TY - GEN
T1 - Modeling Cross-layer Interaction for Chinese Calligraphy Style Classification
AU - Li, Zhigang
AU - Liu, Li
AU - Qiu, Taorong
AU - Lu, Yue
AU - Suen, Ching Y.
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Chinese calligraphy style classification plays a significant role in Chinese calligraphy study. It is a fine-grained classification problem since the difference among different styles is extremely subtle. We propose a novel convolutional neural network equipped with the cross-layer interaction module to address the issue of Chinese calligraphy style classification in this paper. In our proposed network, a multi-scale attention mechanism is first presented, with which the input image can be characterized at multiple levels. Then we model the interaction between any two layers in the network using Hadamard product. In addition, for each input image, we generate its profile image, which is fed to the network together with the input image. In order to evaluate the effectiveness of the proposed network, we conduct extensive experiments on two datasets. The results show that modeling cross-layer interaction is beneficial for the fine-grained Chinese calligraphy style classification task. The multi-scale attention mechanism can highlight the informative part of the image at multiple scales, which can boost the classification performance. Since the profile image can give clues about the stroke compactness of the characters, it is useful in capturing the subtle difference among different styles. The proposed network achieves the accuracies of 98.62 % and 95.92 % on the two datasets respectively, which compares favorably with state-of-the-art methods.
AB - Chinese calligraphy style classification plays a significant role in Chinese calligraphy study. It is a fine-grained classification problem since the difference among different styles is extremely subtle. We propose a novel convolutional neural network equipped with the cross-layer interaction module to address the issue of Chinese calligraphy style classification in this paper. In our proposed network, a multi-scale attention mechanism is first presented, with which the input image can be characterized at multiple levels. Then we model the interaction between any two layers in the network using Hadamard product. In addition, for each input image, we generate its profile image, which is fed to the network together with the input image. In order to evaluate the effectiveness of the proposed network, we conduct extensive experiments on two datasets. The results show that modeling cross-layer interaction is beneficial for the fine-grained Chinese calligraphy style classification task. The multi-scale attention mechanism can highlight the informative part of the image at multiple scales, which can boost the classification performance. Since the profile image can give clues about the stroke compactness of the characters, it is useful in capturing the subtle difference among different styles. The proposed network achieves the accuracies of 98.62 % and 95.92 % on the two datasets respectively, which compares favorably with state-of-the-art methods.
KW - Chinese calligraphy style classification
KW - Cross-layer interaction
KW - Multi-scale attention
KW - Profile image
UR - https://www.scopus.com/pages/publications/85173583590
U2 - 10.1007/978-3-031-41685-9_5
DO - 10.1007/978-3-031-41685-9_5
M3 - 会议稿件
AN - SCOPUS:85173583590
SN - 9783031416842
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 70
EP - 84
BT - Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings
A2 - Fink, Gernot A.
A2 - Jain, Rajiv
A2 - Kise, Koichi
A2 - Zanibbi, Richard
PB - Springer Science and Business Media Deutschland GmbH
T2 - 17th International Conference on Document Analysis and Recognition, ICDAR 2023
Y2 - 21 August 2023 through 26 August 2023
ER -