Xiangxiang Zhi, Dingguo Gao, Qijun Zhao, Shuiwang Li, Ci Qu
{"title":"Text Detection in Tibetan Ancient Books: A Benchmark","authors":"Xiangxiang Zhi, Dingguo Gao, Qijun Zhao, Shuiwang Li, Ci Qu","doi":"10.1109/PRML52754.2021.9520727","DOIUrl":null,"url":null,"abstract":"The digitization of Tibetan ancient books is of great significance to the preservation of Tibetan culture. This problem involves two tasks: Tibetan text detection and Tibetan text recognition. The former is undoubtedly crucial to automatic Tibetan text recognition. However, there are few works on Tibetan text detection, and lack of training data has always been a problem, especially for deep learning methods which require massive training data. In this paper, we introduce the TxTAB dataset for evaluating text detection methods in Tibetan ancient books. The dataset is established based upon 202 treasured handwritten ancient Tibetan text images and is densely annotated with a multi-point annotation method without limiting the number of points. This is a challenging dataset with good diversity. It contains blurred images, gray and color images, the text of different sizes, the text of different handwriting styles, etc. An extensive experimental evaluation of 3 state-of-the-art text detection algorithms on TxTAB is presented with detailed analysis, and the results demonstrate that there is still a big room for improvements particularly for detecting Tibetan text in images of low quality.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRML52754.2021.9520727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The digitization of Tibetan ancient books is of great significance to the preservation of Tibetan culture. This problem involves two tasks: Tibetan text detection and Tibetan text recognition. The former is undoubtedly crucial to automatic Tibetan text recognition. However, there are few works on Tibetan text detection, and lack of training data has always been a problem, especially for deep learning methods which require massive training data. In this paper, we introduce the TxTAB dataset for evaluating text detection methods in Tibetan ancient books. The dataset is established based upon 202 treasured handwritten ancient Tibetan text images and is densely annotated with a multi-point annotation method without limiting the number of points. This is a challenging dataset with good diversity. It contains blurred images, gray and color images, the text of different sizes, the text of different handwriting styles, etc. An extensive experimental evaluation of 3 state-of-the-art text detection algorithms on TxTAB is presented with detailed analysis, and the results demonstrate that there is still a big room for improvements particularly for detecting Tibetan text in images of low quality.