{"title":"基于Viterbi算法的傣族棕榈叶手稿文本线分割","authors":"Ge Peng, Pengfei Yu, Haiyan Li, Lesheng He","doi":"10.1109/ICALIP.2016.7846561","DOIUrl":null,"url":null,"abstract":"The text line segmentation process is a key step in an optical character recognition (OCR) system. Several common approaches, such as projection-based methods and stochastic methods, have been put forward to fulfill this task. However, most of existing methods cannot be directly applied to process the palm leaf manuscripts of Dai which the images have poor quality and include smudges, creases, stroke deformation and character touching. To solve this problem, an improved Viterbi algorithm based on Hidden Markov Model (HMM) is proposed to find all possible segmentation paths firstly. And then, a path filtering method is used to detect the optimal paths for the segmented text blocks. The performance of the method is compared with relevant methods and the experimental results demonstrate the effectiveness of the proposed method.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Text line segmentation using Viterbi algorithm for the palm leaf manuscripts of Dai\",\"authors\":\"Ge Peng, Pengfei Yu, Haiyan Li, Lesheng He\",\"doi\":\"10.1109/ICALIP.2016.7846561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The text line segmentation process is a key step in an optical character recognition (OCR) system. Several common approaches, such as projection-based methods and stochastic methods, have been put forward to fulfill this task. However, most of existing methods cannot be directly applied to process the palm leaf manuscripts of Dai which the images have poor quality and include smudges, creases, stroke deformation and character touching. To solve this problem, an improved Viterbi algorithm based on Hidden Markov Model (HMM) is proposed to find all possible segmentation paths firstly. And then, a path filtering method is used to detect the optimal paths for the segmented text blocks. The performance of the method is compared with relevant methods and the experimental results demonstrate the effectiveness of the proposed method.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text line segmentation using Viterbi algorithm for the palm leaf manuscripts of Dai
The text line segmentation process is a key step in an optical character recognition (OCR) system. Several common approaches, such as projection-based methods and stochastic methods, have been put forward to fulfill this task. However, most of existing methods cannot be directly applied to process the palm leaf manuscripts of Dai which the images have poor quality and include smudges, creases, stroke deformation and character touching. To solve this problem, an improved Viterbi algorithm based on Hidden Markov Model (HMM) is proposed to find all possible segmentation paths firstly. And then, a path filtering method is used to detect the optimal paths for the segmented text blocks. The performance of the method is compared with relevant methods and the experimental results demonstrate the effectiveness of the proposed method.