Bo Zhao, Yu Zhou, Zhengyu Zhang, Ying Na, Tinghuai Ma
{"title":"Information Quantity Based Automatic Reconstruction of Shredded Chinese Documents","authors":"Bo Zhao, Yu Zhou, Zhengyu Zhang, Ying Na, Tinghuai Ma","doi":"10.1109/ICTAI.2014.154","DOIUrl":null,"url":null,"abstract":"The reconstruction of shredded documents has a great significance in the fields of forensics, reconstruction of historical documents, and intelligence analysis. The reconstruction of cross-cut shredded Chinese documents is presented in this paper. The Evaluation of Match Degree is divided into two sub-problems, feature and the corresponding scoring function. A new method of the Evaluation of Match Degree which is suitable for shredded Chinese documents is presented. Information Quantity is introduced to measure the reliability of each matching, instead of regarding as the same. A novel and effective algorithm of automatic reconstruction based on Information Quantity is put forward to control the serious propagation of errors caused by the matching of shreds with low Information Quantity. Not only is the propagation of errors controlled effectively, and the error ratio reduced, but also the time complexity decreases greatly. Experiments have proven the high accuracy and superiority of the algorithm proposed in this paper.","PeriodicalId":142794,"journal":{"name":"2014 IEEE 26th International Conference on Tools with Artificial Intelligence","volume":"253 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 26th International Conference on Tools with Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2014.154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The reconstruction of shredded documents has a great significance in the fields of forensics, reconstruction of historical documents, and intelligence analysis. The reconstruction of cross-cut shredded Chinese documents is presented in this paper. The Evaluation of Match Degree is divided into two sub-problems, feature and the corresponding scoring function. A new method of the Evaluation of Match Degree which is suitable for shredded Chinese documents is presented. Information Quantity is introduced to measure the reliability of each matching, instead of regarding as the same. A novel and effective algorithm of automatic reconstruction based on Information Quantity is put forward to control the serious propagation of errors caused by the matching of shreds with low Information Quantity. Not only is the propagation of errors controlled effectively, and the error ratio reduced, but also the time complexity decreases greatly. Experiments have proven the high accuracy and superiority of the algorithm proposed in this paper.