{"title":"A Robust Segmentation Method for Chinese Inscription Images","authors":"Xi Xia, Lin Sh","doi":"10.1109/ICDSBA53075.2021.00043","DOIUrl":null,"url":null,"abstract":"Inscriptions are important carriers of Chinese calligraphy which has high calligraphic, artistic and cultural value. Segmentation of Chinese inscription images play a fundamental role in processing of Chinese character images. Traditional projection segmentation methods can be used for Chinese inscription image segmentation because Chinese characters in most inscription images arrange regularly. However, in an actual segmentation process, the slight offset of the glyph position or a crossing of strokes after projection caused projection segmentation method failure. In order to solve this problem, a Chinese inscription image segmentation method based on clustering algorithm was provided. Firstly, preprocessed the image, and then looked up the contour of the inscription images to exclude the obvious abnormal part of the contour size; Secondly, filled in the remaining contours to obtain a clustered sample set; Thirdly, used the DBSCAN clustering algorithm to cluster the sample set to generate several clusters which each of those clusters represented a Chinese character image; Finally, for clusters that did not satisfy the outline range of Chinese characters, adjusted the minimum neighborhood until the aspect ratio of the circumscribed rectangle of the cluster was within the range of the aspect ratio of Chinese characters. We conducted character segmentation experiments on sample inscriptions. Results showed that out method not only segmented and processed the regularly arranged inscriptions images, but also worked efficiently for irregularly arranged inscriptions images.","PeriodicalId":154348,"journal":{"name":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSBA53075.2021.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Inscriptions are important carriers of Chinese calligraphy which has high calligraphic, artistic and cultural value. Segmentation of Chinese inscription images play a fundamental role in processing of Chinese character images. Traditional projection segmentation methods can be used for Chinese inscription image segmentation because Chinese characters in most inscription images arrange regularly. However, in an actual segmentation process, the slight offset of the glyph position or a crossing of strokes after projection caused projection segmentation method failure. In order to solve this problem, a Chinese inscription image segmentation method based on clustering algorithm was provided. Firstly, preprocessed the image, and then looked up the contour of the inscription images to exclude the obvious abnormal part of the contour size; Secondly, filled in the remaining contours to obtain a clustered sample set; Thirdly, used the DBSCAN clustering algorithm to cluster the sample set to generate several clusters which each of those clusters represented a Chinese character image; Finally, for clusters that did not satisfy the outline range of Chinese characters, adjusted the minimum neighborhood until the aspect ratio of the circumscribed rectangle of the cluster was within the range of the aspect ratio of Chinese characters. We conducted character segmentation experiments on sample inscriptions. Results showed that out method not only segmented and processed the regularly arranged inscriptions images, but also worked efficiently for irregularly arranged inscriptions images.