A Robust Segmentation Method for Chinese Inscription Images

Xi Xia, Lin Sh
{"title":"A Robust Segmentation Method for Chinese Inscription Images","authors":"Xi Xia, Lin Sh","doi":"10.1109/ICDSBA53075.2021.00043","DOIUrl":null,"url":null,"abstract":"Inscriptions are important carriers of Chinese calligraphy which has high calligraphic, artistic and cultural value. Segmentation of Chinese inscription images play a fundamental role in processing of Chinese character images. Traditional projection segmentation methods can be used for Chinese inscription image segmentation because Chinese characters in most inscription images arrange regularly. However, in an actual segmentation process, the slight offset of the glyph position or a crossing of strokes after projection caused projection segmentation method failure. In order to solve this problem, a Chinese inscription image segmentation method based on clustering algorithm was provided. Firstly, preprocessed the image, and then looked up the contour of the inscription images to exclude the obvious abnormal part of the contour size; Secondly, filled in the remaining contours to obtain a clustered sample set; Thirdly, used the DBSCAN clustering algorithm to cluster the sample set to generate several clusters which each of those clusters represented a Chinese character image; Finally, for clusters that did not satisfy the outline range of Chinese characters, adjusted the minimum neighborhood until the aspect ratio of the circumscribed rectangle of the cluster was within the range of the aspect ratio of Chinese characters. We conducted character segmentation experiments on sample inscriptions. Results showed that out method not only segmented and processed the regularly arranged inscriptions images, but also worked efficiently for irregularly arranged inscriptions images.","PeriodicalId":154348,"journal":{"name":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSBA53075.2021.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Inscriptions are important carriers of Chinese calligraphy which has high calligraphic, artistic and cultural value. Segmentation of Chinese inscription images play a fundamental role in processing of Chinese character images. Traditional projection segmentation methods can be used for Chinese inscription image segmentation because Chinese characters in most inscription images arrange regularly. However, in an actual segmentation process, the slight offset of the glyph position or a crossing of strokes after projection caused projection segmentation method failure. In order to solve this problem, a Chinese inscription image segmentation method based on clustering algorithm was provided. Firstly, preprocessed the image, and then looked up the contour of the inscription images to exclude the obvious abnormal part of the contour size; Secondly, filled in the remaining contours to obtain a clustered sample set; Thirdly, used the DBSCAN clustering algorithm to cluster the sample set to generate several clusters which each of those clusters represented a Chinese character image; Finally, for clusters that did not satisfy the outline range of Chinese characters, adjusted the minimum neighborhood until the aspect ratio of the circumscribed rectangle of the cluster was within the range of the aspect ratio of Chinese characters. We conducted character segmentation experiments on sample inscriptions. Results showed that out method not only segmented and processed the regularly arranged inscriptions images, but also worked efficiently for irregularly arranged inscriptions images.
中文题词图像的鲁棒分割方法
题词是中国书法的重要载体,具有很高的书法、艺术和文化价值。汉字图像的分割是汉字图像处理的基础。由于大多数铭文图像中的汉字排列规律,传统的投影分割方法可以用于铭文图像的分割。但在实际分割过程中,由于字形位置的轻微偏移或投影后笔画的交叉会导致投影分割方法失败。为了解决这一问题,提出了一种基于聚类算法的中文铭文图像分割方法。首先对图像进行预处理,然后查找铭文图像的轮廓,排除轮廓尺寸明显异常的部分;其次,对剩余轮廓进行填充,得到聚类样本集;第三,采用DBSCAN聚类算法对样本集进行聚类,生成多个聚类,每个聚类代表一个汉字图像;最后,对于不满足汉字轮廓范围的聚类,调整最小邻域,直到聚类的限定矩形宽高比在汉字宽高比范围内。我们对样本铭文进行了字符分割实验。结果表明,该方法不仅能对规则排列的铭文图像进行分割和处理,而且对不规则排列的铭文图像也能有效地进行分割和处理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信