Deep Learning and Image Processing Combined Organization of Shirakawa’s Hand-Notated Documents on OBI Research

Xuebin Yue, Bing Lyu, Hengyi Li, Yoshiyuki Fujikawa, Lin Meng
{"title":"Deep Learning and Image Processing Combined Organization of Shirakawa’s Hand-Notated Documents on OBI Research","authors":"Xuebin Yue, Bing Lyu, Hengyi Li, Yoshiyuki Fujikawa, Lin Meng","doi":"10.1109/ICNSC52481.2021.9702164","DOIUrl":null,"url":null,"abstract":"The purpose of this work is to organize Professor Shirakawa’s newly discovered hand-notated documents on his Oracle Bone Inscription (OBI) research. During the second half of the 20th-century, Professor Shirakawa was a prominent researcher on Chinese culture, especially in the field of OBIs, and he left behind many research documents he had notated by hand. However, some of these documents have not been properly organized yet. The reorganization of OBIs is not only helpful for better understanding his research but also for further studying about OBIs in general and their importance in ancient Chinese history. Part of Professor Shirakawa’s hand-notated research documents on OBIs is introduced to the world for the first time in this work. For organizing these documents, Firstly, a morphology-based segmentation method is applied to segment the characters in the documents and then the paper proposes a slight neural network for removing the noise from the mis- segmented characters. Finally, a dynamic K-means method is applied for classifying the segmented characters. Specifically, the histogram of oriented gradients (HOG) descriptors are extracted as features, and the class number of K is dynamically decided by using the silhouette coefficient. The results of this evaluation showed that the accuracy of noise and character classification after segmentation achieves 96.50%, and the accuracy of character classification achieves 74.91%. The results demonstrate the effectiveness of the proposed method.","PeriodicalId":129062,"journal":{"name":"2021 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC52481.2021.9702164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The purpose of this work is to organize Professor Shirakawa’s newly discovered hand-notated documents on his Oracle Bone Inscription (OBI) research. During the second half of the 20th-century, Professor Shirakawa was a prominent researcher on Chinese culture, especially in the field of OBIs, and he left behind many research documents he had notated by hand. However, some of these documents have not been properly organized yet. The reorganization of OBIs is not only helpful for better understanding his research but also for further studying about OBIs in general and their importance in ancient Chinese history. Part of Professor Shirakawa’s hand-notated research documents on OBIs is introduced to the world for the first time in this work. For organizing these documents, Firstly, a morphology-based segmentation method is applied to segment the characters in the documents and then the paper proposes a slight neural network for removing the noise from the mis- segmented characters. Finally, a dynamic K-means method is applied for classifying the segmented characters. Specifically, the histogram of oriented gradients (HOG) descriptors are extracted as features, and the class number of K is dynamically decided by using the silhouette coefficient. The results of this evaluation showed that the accuracy of noise and character classification after segmentation achieves 96.50%, and the accuracy of character classification achieves 74.91%. The results demonstrate the effectiveness of the proposed method.
深度学习与图像处理结合组织Shirakawa手记OBI研究文献
本工作的目的是整理白川教授关于甲骨文研究的新发现的手记文献。在20世纪下半叶,白川教授是中国文化,特别是obi领域的杰出研究者,他留下了许多他亲手批注的研究文件。然而,其中一些文件还没有得到适当的组织。对obi的整理不仅有助于更好地理解他的研究,也有助于进一步研究obi的总体情况及其在中国古代史上的重要性。白川教授关于obi的部分手写研究文件在本作品中首次向世界介绍。为了组织这些文档,首先采用一种基于形态学的分割方法对文档中的字符进行分割,然后提出一种轻微的神经网络来去除错误分割字符中的噪声。最后,采用动态k均值方法对分割后的字符进行分类。具体而言,提取定向梯度直方图(HOG)描述子作为特征,并利用轮廓系数动态确定K的类数。评价结果表明,分割后的噪声和字符分类准确率达到96.50%,字符分类准确率达到74.91%。实验结果表明了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信