{"title":"基于学习的电子纸显示退化文档图像映射","authors":"Xianbin Zhang, Shufan Pei, Liqun Lin, Xiaoyan Zhao, Jiawei Xu, Tiesong Zhao","doi":"10.1002/jsid.2040","DOIUrl":null,"url":null,"abstract":"<p>With the widespread use of E-paper technology, numerous documents are being digitized and displayed on E-paper screens. However, the display quality of degraded document images on E-paper often suffers from a lack of detail. To address this challenge, we introduce a mapping model that converts color images into E-paper display images. This model leverages U-Net++ as its backbone, integrating residual connectivity and dual attention modules. Given the presence of varying stroke thicknesses in document images, a fixed-size convolutional kernel is insufficient. Therefore, we propose multi-branch channels and spatial attention modules (MCSAM), which combines the selective kernel network (SKNet) with a spatial attention mechanism to adaptively select the appropriate convolutional kernel size based on font size. To demonstrate its effectiveness, we tested the mapped images on a custom E-paper display platform. Experimental results highlight the superior performance of our proposed method.</p>","PeriodicalId":49979,"journal":{"name":"Journal of the Society for Information Display","volume":"33 6","pages":"801-808"},"PeriodicalIF":2.2000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning-based image mapping for degraded documents on E-paper display\",\"authors\":\"Xianbin Zhang, Shufan Pei, Liqun Lin, Xiaoyan Zhao, Jiawei Xu, Tiesong Zhao\",\"doi\":\"10.1002/jsid.2040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>With the widespread use of E-paper technology, numerous documents are being digitized and displayed on E-paper screens. However, the display quality of degraded document images on E-paper often suffers from a lack of detail. To address this challenge, we introduce a mapping model that converts color images into E-paper display images. This model leverages U-Net++ as its backbone, integrating residual connectivity and dual attention modules. Given the presence of varying stroke thicknesses in document images, a fixed-size convolutional kernel is insufficient. Therefore, we propose multi-branch channels and spatial attention modules (MCSAM), which combines the selective kernel network (SKNet) with a spatial attention mechanism to adaptively select the appropriate convolutional kernel size based on font size. To demonstrate its effectiveness, we tested the mapped images on a custom E-paper display platform. Experimental results highlight the superior performance of our proposed method.</p>\",\"PeriodicalId\":49979,\"journal\":{\"name\":\"Journal of the Society for Information Display\",\"volume\":\"33 6\",\"pages\":\"801-808\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Society for Information Display\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://sid.onlinelibrary.wiley.com/doi/10.1002/jsid.2040\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Society for Information Display","FirstCategoryId":"5","ListUrlMain":"https://sid.onlinelibrary.wiley.com/doi/10.1002/jsid.2040","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Learning-based image mapping for degraded documents on E-paper display
With the widespread use of E-paper technology, numerous documents are being digitized and displayed on E-paper screens. However, the display quality of degraded document images on E-paper often suffers from a lack of detail. To address this challenge, we introduce a mapping model that converts color images into E-paper display images. This model leverages U-Net++ as its backbone, integrating residual connectivity and dual attention modules. Given the presence of varying stroke thicknesses in document images, a fixed-size convolutional kernel is insufficient. Therefore, we propose multi-branch channels and spatial attention modules (MCSAM), which combines the selective kernel network (SKNet) with a spatial attention mechanism to adaptively select the appropriate convolutional kernel size based on font size. To demonstrate its effectiveness, we tested the mapped images on a custom E-paper display platform. Experimental results highlight the superior performance of our proposed method.
期刊介绍:
The Journal of the Society for Information Display publishes original works dealing with the theory and practice of information display. Coverage includes materials, devices and systems; the underlying chemistry, physics, physiology and psychology; measurement techniques, manufacturing technologies; and all aspects of the interaction between equipment and its users. Review articles are also published in all of these areas. Occasional special issues or sections consist of collections of papers on specific topical areas or collections of full length papers based in part on oral or poster presentations given at SID sponsored conferences.