{"title":"An efficient deep learning-based framework for image distortion correction","authors":"Sicheng Li, Yuhui Chu, Yunpeng Zhao, Pengpeng Zhao","doi":"10.1007/s00371-024-03580-3","DOIUrl":null,"url":null,"abstract":"<p>Geometric distortions in digital images, caused by factors such as lens defects and changes in camera angles, substantially influence the fidelity of the image by altering pixel positions and shapes. Current geometric distortion correction methods, focusing on specific types of distortions and relying on high computational resources, face limitations in universality and practicality across diverse real-world applications. We propose here a two-stage distortion correction method that integrates deep learning with traditional image registration algorithms for correcting multiple types of geometric distortion. Compared to state-of-the-art correction methods, our proposed method demonstrates flexibility, capable of addressing a wide range of geometric distortions and achieves superior correction results with fewer parameters. In addition, tests performed on synthetic datasets show an improvement of 10.39% for PSNR, 30.42% for SSIM, and 85% for processing speed, compared to the best performing methods to our knowledge. Finally, experiments with handheld medical endoscopic scanners confirm the applicability and robustness of our method in real-world scenarios. Our method offers a versatile and efficient solution for geometric distortion correction, suitable for various applications, including medical imaging and resource-limited embedded systems. Code is available at https://github.com/MaybeRichard/EffiGeoNet</p>","PeriodicalId":501186,"journal":{"name":"The Visual Computer","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Visual Computer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00371-024-03580-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Geometric distortions in digital images, caused by factors such as lens defects and changes in camera angles, substantially influence the fidelity of the image by altering pixel positions and shapes. Current geometric distortion correction methods, focusing on specific types of distortions and relying on high computational resources, face limitations in universality and practicality across diverse real-world applications. We propose here a two-stage distortion correction method that integrates deep learning with traditional image registration algorithms for correcting multiple types of geometric distortion. Compared to state-of-the-art correction methods, our proposed method demonstrates flexibility, capable of addressing a wide range of geometric distortions and achieves superior correction results with fewer parameters. In addition, tests performed on synthetic datasets show an improvement of 10.39% for PSNR, 30.42% for SSIM, and 85% for processing speed, compared to the best performing methods to our knowledge. Finally, experiments with handheld medical endoscopic scanners confirm the applicability and robustness of our method in real-world scenarios. Our method offers a versatile and efficient solution for geometric distortion correction, suitable for various applications, including medical imaging and resource-limited embedded systems. Code is available at https://github.com/MaybeRichard/EffiGeoNet