普通折叠:利用四折从单个图像中去除打印文档

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI:10.1145/3103010.3121030

Sagnik Das, Gaurav Mishra, A. Sudharshana, Roy Shilkrot

{"title":"普通折叠:利用四折从单个图像中去除打印文档","authors":"Sagnik Das, Gaurav Mishra, A. Sudharshana, Roy Shilkrot","doi":"10.1145/3103010.3121030","DOIUrl":null,"url":null,"abstract":"Handheld cameras are currently the device of choice for performing document digitization, due to their convenience, ubiquity and high performance at low cost. Software methods process a captured image, to rectify distortions and reconstruct the original document. Existing methods struggle to reconstruct a flattened version given a single image of a document distorted by folding. We propose a novel non-parametric page dewarping approach from a single image based on deep learning to identify creases due to folds on the paper. Our method then performs a 2D boundary method based on polynomial regression, and a Coons patch, to get a flattened reconstruction. We found our method improves OCR word accuracy by more than 2.5 times when compared to the original distorted image.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":"{\"title\":\"The Common Fold: Utilizing the Four-Fold to Dewarp Printed Documents from a Single Image\",\"authors\":\"Sagnik Das, Gaurav Mishra, A. Sudharshana, Roy Shilkrot\",\"doi\":\"10.1145/3103010.3121030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Handheld cameras are currently the device of choice for performing document digitization, due to their convenience, ubiquity and high performance at low cost. Software methods process a captured image, to rectify distortions and reconstruct the original document. Existing methods struggle to reconstruct a flattened version given a single image of a document distorted by folding. We propose a novel non-parametric page dewarping approach from a single image based on deep learning to identify creases due to folds on the paper. Our method then performs a 2D boundary method based on polynomial regression, and a Coons patch, to get a flattened reconstruction. We found our method improves OCR word accuracy by more than 2.5 times when compared to the original distorted image.\",\"PeriodicalId\":200469,\"journal\":{\"name\":\"Proceedings of the 2017 ACM Symposium on Document Engineering\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"27\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 ACM Symposium on Document Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3103010.3121030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 ACM Symposium on Document Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3103010.3121030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

摘要

手持相机由于其便利性、普及性和低成本的高性能，是目前进行文档数字化的首选设备。软件方法处理捕获的图像，纠正失真并重建原始文档。现有的方法很难重建一个被折叠扭曲的文件图像的扁平版本。我们提出了一种基于深度学习的单幅图像的非参数页面去翘曲方法，以识别由于纸张上的折叠而产生的折痕。然后，我们的方法执行基于多项式回归的二维边界方法和Coons补丁，以获得平坦的重建。我们发现，与原始失真图像相比，我们的方法将OCR单词精度提高了2.5倍以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Common Fold: Utilizing the Four-Fold to Dewarp Printed Documents from a Single Image

Handheld cameras are currently the device of choice for performing document digitization, due to their convenience, ubiquity and high performance at low cost. Software methods process a captured image, to rectify distortions and reconstruct the original document. Existing methods struggle to reconstruct a flattened version given a single image of a document distorted by folding. We propose a novel non-parametric page dewarping approach from a single image based on deep learning to identify creases due to folds on the paper. Our method then performs a 2D boundary method based on polynomial regression, and a Coons patch, to get a flattened reconstruction. We found our method improves OCR word accuracy by more than 2.5 times when compared to the original distorted image.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2017 ACM Symposium on Document Engineering

自引率

0.00%

发文量