A non-overlapping image stitching method for reconstruction of page in ancient Chinese books

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Computer Vision and Image Understanding Pub Date : 2025-07-22 DOI:10.1016/j.cviu.2025.104449

Yizhou Lan, Daoyuan Zheng, Qingwu Hu, Shaohua Wang, Shunli Wang, Tong Yue, Jiayuan Li

{"title":"A non-overlapping image stitching method for reconstruction of page in ancient Chinese books","authors":"Yizhou Lan, Daoyuan Zheng, Qingwu Hu, Shaohua Wang, Shunli Wang, Tong Yue, Jiayuan Li","doi":"10.1016/j.cviu.2025.104449","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic stitching of page images in ancient Chinese books plays an important role in preservation and transmission of cultural heritage, significantly diminishing the need for manual intervention. Current methods accomplish image stitching based on their overlapping area and struggle with the ancient book pages without overlapping areas. To overcome this hurdle, this study proposes a novel deep learning based method to accurately stitch ancient book pages, which contains three key steps. Firstly, aiming to locate stitching seams precisely, a semantic segmentation model is exploited to predict the thickness masks of page images, and the non-overlapping pages can be stitched by cropping the thickness areas. Secondly, a novel multi-rule page stitching module with two creative page alignment methods is designed to align elements along the stitching seams. Lastly, the proposed method encompasses a self-assessment module, which judiciously selects the optimal stitched outcome from the multiple probable outputs generated by the multi-rule page stitching module. Experimental results demonstrate that the proposed method achieves superior performance in automatic stitching of ancient book pages. The stitching results on over 140 pages from three different ancient books show an accuracy of 82.18%, with 37.75% improvements over existing methods. This method provides a foundation for the automatic digitization of ancient Chinese books, showing significant potential applications in the field of automatic character recognition for historical documents.</div></div>","PeriodicalId":50633,"journal":{"name":"Computer Vision and Image Understanding","volume":"259 ","pages":"Article 104449"},"PeriodicalIF":3.5000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Vision and Image Understanding","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077314225001729","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Automatic stitching of page images in ancient Chinese books plays an important role in preservation and transmission of cultural heritage, significantly diminishing the need for manual intervention. Current methods accomplish image stitching based on their overlapping area and struggle with the ancient book pages without overlapping areas. To overcome this hurdle, this study proposes a novel deep learning based method to accurately stitch ancient book pages, which contains three key steps. Firstly, aiming to locate stitching seams precisely, a semantic segmentation model is exploited to predict the thickness masks of page images, and the non-overlapping pages can be stitched by cropping the thickness areas. Secondly, a novel multi-rule page stitching module with two creative page alignment methods is designed to align elements along the stitching seams. Lastly, the proposed method encompasses a self-assessment module, which judiciously selects the optimal stitched outcome from the multiple probable outputs generated by the multi-rule page stitching module. Experimental results demonstrate that the proposed method achieves superior performance in automatic stitching of ancient book pages. The stitching results on over 140 pages from three different ancient books show an accuracy of 82.18%, with 37.75% improvements over existing methods. This method provides a foundation for the automatic digitization of ancient Chinese books, showing significant potential applications in the field of automatic character recognition for historical documents.

查看原文本刊更多论文

一种用于古籍页面重建的无重叠图像拼接方法

中国古籍页面图像的自动拼接在文化遗产的保存和传播中发挥了重要作用，大大减少了人工干预的需要。现有的方法是基于图像的重叠区域进行图像拼接，并与没有重叠区域的古书页面进行斗争。为了克服这一障碍，本研究提出了一种基于深度学习的古书书页精确拼接方法，该方法包含三个关键步骤。首先，为了精确定位拼接缝，利用语义分割模型预测页面图像的厚度掩模，通过裁剪厚度区域对不重叠的页面进行拼接；其次，设计了一种新颖的多规则页面拼接模块，采用两种创造性的页面对齐方法，实现了元素沿拼接线对齐；最后，该方法包含一个自评估模块，该模块从多规则页面拼接模块生成的多个可能输出中明智地选择最优的拼接结果。实验结果表明，该方法在古书页面自动拼接中取得了较好的效果。对三种不同古籍140多页的拼接结果显示，拼接准确率为82.18%，比现有方法提高了37.75%。该方法为中国古籍自动数字化提供了基础，在历史文献字符自动识别领域具有重要的应用潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Vision and Image Understanding 工程技术-工程：电子与电气

CiteScore

7.80

自引率

4.40%

发文量

112

审稿时长

79 days

期刊介绍： The central focus of this journal is the computer analysis of pictorial information. Computer Vision and Image Understanding publishes papers covering all aspects of image analysis from the low-level, iconic processes of early vision to the high-level, symbolic processes of recognition and interpretation. A wide range of topics in the image understanding area is covered, including papers offering insights that differ from predominant views. Research Areas Include: • Theory • Early vision • Data structures and representations • Shape • Range • Motion • Matching and recognition • Architecture and languages • Vision systems