TY - JOUR
T1 - Improvement of Oracle Bone Inscription Recognition Accuracy
T2 - A Deep Learning Perspective
AU - Fu, Xuanming
AU - Yang, Zhengfeng
AU - Zeng, Zhenbing
AU - Zhang, Yidan
AU - Zhou, Qianting
N1 - Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/1
Y1 - 2022/1
N2 - Deep learning techniques have been successfully applied in handwriting recognition. Oracle bone inscriptions (OBI) are the earliest hieroglyphs in China and valuable resources for studying the etymology of Chinese characters. OBI are of important historical and cultural value in China; thus, textual research surrounding the characters of OBI is a huge challenge for archaeologists. In this work, we built a dataset named OBI-100, which contains 100 classes of oracle bone inscriptions collected from two OBI dictionaries. The dataset includes more than 128,000 character samples related to the natural environment, humans, animals, plants, etc. In addition, we propose improved models based on three typical deep convolutional network structures to recognize the OBI-100 dataset. By modifying the parameters, adjusting the network structures, and adopting optimization strategies, we demonstrate experimentally that these models perform fairly well in OBI recognition. For the 100-category OBI classification task, the optimal model achieves an accuracy of 99.5%, which shows competitive performance compared with other state-of-the-art approaches. We hope that this work can provide a valuable tool for character recognition of OBI.
AB - Deep learning techniques have been successfully applied in handwriting recognition. Oracle bone inscriptions (OBI) are the earliest hieroglyphs in China and valuable resources for studying the etymology of Chinese characters. OBI are of important historical and cultural value in China; thus, textual research surrounding the characters of OBI is a huge challenge for archaeologists. In this work, we built a dataset named OBI-100, which contains 100 classes of oracle bone inscriptions collected from two OBI dictionaries. The dataset includes more than 128,000 character samples related to the natural environment, humans, animals, plants, etc. In addition, we propose improved models based on three typical deep convolutional network structures to recognize the OBI-100 dataset. By modifying the parameters, adjusting the network structures, and adopting optimization strategies, we demonstrate experimentally that these models perform fairly well in OBI recognition. For the 100-category OBI classification task, the optimal model achieves an accuracy of 99.5%, which shows competitive performance compared with other state-of-the-art approaches. We hope that this work can provide a valuable tool for character recognition of OBI.
KW - CNN
KW - Character recognition
KW - Cultural heritage
KW - Deep learning
KW - Image classification
KW - Oracle bone inscriptions
UR - https://www.scopus.com/pages/publications/85123238373
U2 - 10.3390/ijgi11010045
DO - 10.3390/ijgi11010045
M3 - 文章
AN - SCOPUS:85123238373
SN - 2220-9964
VL - 11
JO - ISPRS International Journal of Geo-Information
JF - ISPRS International Journal of Geo-Information
IS - 1
M1 - 45
ER -