Jiaxin Zhang , Peirong Zhang , Dezhi Peng , Haowei Xu , Lianwen Jin
{"title":"增强文档去翘曲评估:提高准确性和效率的新度量","authors":"Jiaxin Zhang , Peirong Zhang , Dezhi Peng , Haowei Xu , Lianwen Jin","doi":"10.1016/j.patrec.2025.04.038","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, the task of document image dewarping has garnered significant attention. With the development of a series of advanced models, the performance on various benchmark datasets has seen considerable improvement, as evidenced by the increasingly better quantitative outcomes. However, several recent studies have unveiled that the commonly used evaluation metrics may not consistently represent the dewarping performance, leading to discrepancies between evaluation results and human perceptual judgments. While some alternative metrics have been recently proposed, their efficacy has not been fully validated, and we found that their performance remains suboptimal. To address these issues, we propose a new metric, termed DocAligner Distortion (DD), to mitigate the deficiencies observed in existing metrics. We conduct comprehensive comparisons and analyses between DD and the prevailing metrics used in document image dewarping. Results demonstrate that DD significantly outperforms its predecessors with better accuracy and efficiency. Codes are available at https://github.com/ZZZHANG-jx/DocAligner-Distortion.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"195 ","pages":"Pages 51-58"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing document dewarping evaluation: A new metric with improved accuracy and efficiency\",\"authors\":\"Jiaxin Zhang , Peirong Zhang , Dezhi Peng , Haowei Xu , Lianwen Jin\",\"doi\":\"10.1016/j.patrec.2025.04.038\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recently, the task of document image dewarping has garnered significant attention. With the development of a series of advanced models, the performance on various benchmark datasets has seen considerable improvement, as evidenced by the increasingly better quantitative outcomes. However, several recent studies have unveiled that the commonly used evaluation metrics may not consistently represent the dewarping performance, leading to discrepancies between evaluation results and human perceptual judgments. While some alternative metrics have been recently proposed, their efficacy has not been fully validated, and we found that their performance remains suboptimal. To address these issues, we propose a new metric, termed DocAligner Distortion (DD), to mitigate the deficiencies observed in existing metrics. We conduct comprehensive comparisons and analyses between DD and the prevailing metrics used in document image dewarping. Results demonstrate that DD significantly outperforms its predecessors with better accuracy and efficiency. Codes are available at https://github.com/ZZZHANG-jx/DocAligner-Distortion.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"195 \",\"pages\":\"Pages 51-58\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167865525001801\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525001801","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Enhancing document dewarping evaluation: A new metric with improved accuracy and efficiency
Recently, the task of document image dewarping has garnered significant attention. With the development of a series of advanced models, the performance on various benchmark datasets has seen considerable improvement, as evidenced by the increasingly better quantitative outcomes. However, several recent studies have unveiled that the commonly used evaluation metrics may not consistently represent the dewarping performance, leading to discrepancies between evaluation results and human perceptual judgments. While some alternative metrics have been recently proposed, their efficacy has not been fully validated, and we found that their performance remains suboptimal. To address these issues, we propose a new metric, termed DocAligner Distortion (DD), to mitigate the deficiencies observed in existing metrics. We conduct comprehensive comparisons and analyses between DD and the prevailing metrics used in document image dewarping. Results demonstrate that DD significantly outperforms its predecessors with better accuracy and efficiency. Codes are available at https://github.com/ZZZHANG-jx/DocAligner-Distortion.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.