{"title":"上下文驱动的几何一致的文件重建从照片","authors":"Yusuf Coşkuner, Yakup Genç","doi":"10.1109/SIU49456.2020.9302484","DOIUrl":null,"url":null,"abstract":"It is very practical to photograph and store documents using mobile phones. However, it is difficult to obtain a quality document image due to creases on the paper and limitations of the camera pose. These produce geometric distortions and irregular shadows on the document image. The rectification of geometric distortions requires an estimate of the 3D shape of the photographed document. In this study, we introduce a new approach that can estimate the 3D shape of the document using artificial neural networks. Neural network models extract geometric information from the context of the image to create a 3D shape. In addition, an adaptive thresholding algorithm was used to correct lighting-related distortions. Data reflecting actual document conditions were used to train the neural networks. Therefore, in addition to previous studies, the method can be applied to photograph samples which creased in many different ways and photographed from varying perspectives. Comparative experiments show that the method works well.","PeriodicalId":312627,"journal":{"name":"2020 28th Signal Processing and Communications Applications Conference (SIU)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Context Driven Geometry Consistent Document Reconstruction from Photographs\",\"authors\":\"Yusuf Coşkuner, Yakup Genç\",\"doi\":\"10.1109/SIU49456.2020.9302484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is very practical to photograph and store documents using mobile phones. However, it is difficult to obtain a quality document image due to creases on the paper and limitations of the camera pose. These produce geometric distortions and irregular shadows on the document image. The rectification of geometric distortions requires an estimate of the 3D shape of the photographed document. In this study, we introduce a new approach that can estimate the 3D shape of the document using artificial neural networks. Neural network models extract geometric information from the context of the image to create a 3D shape. In addition, an adaptive thresholding algorithm was used to correct lighting-related distortions. Data reflecting actual document conditions were used to train the neural networks. Therefore, in addition to previous studies, the method can be applied to photograph samples which creased in many different ways and photographed from varying perspectives. Comparative experiments show that the method works well.\",\"PeriodicalId\":312627,\"journal\":{\"name\":\"2020 28th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 28th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU49456.2020.9302484\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 28th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU49456.2020.9302484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Context Driven Geometry Consistent Document Reconstruction from Photographs
It is very practical to photograph and store documents using mobile phones. However, it is difficult to obtain a quality document image due to creases on the paper and limitations of the camera pose. These produce geometric distortions and irregular shadows on the document image. The rectification of geometric distortions requires an estimate of the 3D shape of the photographed document. In this study, we introduce a new approach that can estimate the 3D shape of the document using artificial neural networks. Neural network models extract geometric information from the context of the image to create a 3D shape. In addition, an adaptive thresholding algorithm was used to correct lighting-related distortions. Data reflecting actual document conditions were used to train the neural networks. Therefore, in addition to previous studies, the method can be applied to photograph samples which creased in many different ways and photographed from varying perspectives. Comparative experiments show that the method works well.