{"title":"A non-overlapping image stitching method for reconstruction of page in ancient Chinese books","authors":"Yizhou Lan, Daoyuan Zheng, Qingwu Hu, Shaohua Wang, Shunli Wang, Tong Yue, Jiayuan Li","doi":"10.1016/j.cviu.2025.104449","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic stitching of page images in ancient Chinese books plays an important role in preservation and transmission of cultural heritage, significantly diminishing the need for manual intervention. Current methods accomplish image stitching based on their overlapping area and struggle with the ancient book pages without overlapping areas. To overcome this hurdle, this study proposes a novel deep learning based method to accurately stitch ancient book pages, which contains three key steps. Firstly, aiming to locate stitching seams precisely, a semantic segmentation model is exploited to predict the thickness masks of page images, and the non-overlapping pages can be stitched by cropping the thickness areas. Secondly, a novel multi-rule page stitching module with two creative page alignment methods is designed to align elements along the stitching seams. Lastly, the proposed method encompasses a self-assessment module, which judiciously selects the optimal stitched outcome from the multiple probable outputs generated by the multi-rule page stitching module. Experimental results demonstrate that the proposed method achieves superior performance in automatic stitching of ancient book pages. The stitching results on over 140 pages from three different ancient books show an accuracy of 82.18%, with 37.75% improvements over existing methods. This method provides a foundation for the automatic digitization of ancient Chinese books, showing significant potential applications in the field of automatic character recognition for historical documents.</div></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":"259 ","pages":"Article 104449"},"PeriodicalIF":3.5000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314225001729","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic stitching of page images in ancient Chinese books plays an important role in preservation and transmission of cultural heritage, significantly diminishing the need for manual intervention. Current methods accomplish image stitching based on their overlapping area and struggle with the ancient book pages without overlapping areas. To overcome this hurdle, this study proposes a novel deep learning based method to accurately stitch ancient book pages, which contains three key steps. Firstly, aiming to locate stitching seams precisely, a semantic segmentation model is exploited to predict the thickness masks of page images, and the non-overlapping pages can be stitched by cropping the thickness areas. Secondly, a novel multi-rule page stitching module with two creative page alignment methods is designed to align elements along the stitching seams. Lastly, the proposed method encompasses a self-assessment module, which judiciously selects the optimal stitched outcome from the multiple probable outputs generated by the multi-rule page stitching module. Experimental results demonstrate that the proposed method achieves superior performance in automatic stitching of ancient book pages. The stitching results on over 140 pages from three different ancient books show an accuracy of 82.18%, with 37.75% improvements over existing methods. This method provides a foundation for the automatic digitization of ancient Chinese books, showing significant potential applications in the field of automatic character recognition for historical documents.
期刊介绍:
The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views.
Research Areas Include:
• Theory
• Early vision
• Data structures and representations
• Shape
• Range
• Motion
• Matching and recognition
• Architecture and languages
• Vision systems