{"title":"A knowledge-based approach to Chinese archive document understanding","authors":"Shih-Shien You, Gan-How Chang, Pao-Chung Chang, Bing-Shan Chien","doi":"10.1109/ICDAR.1995.601957","DOIUrl":null,"url":null,"abstract":"The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 3rd International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.1995.601957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents.