Automatic damage identification of Sanskrit palm leaf manuscripts with SegFormer

IF 2.6 1区 艺术学 Q2 CHEMISTRY, ANALYTICAL
Yue Wang, Ming Wen, Xiao Zhou, Feng Gao, Shuai Tian, Dan Jue, Hongmei Lu, Zhimin Zhang
{"title":"Automatic damage identification of Sanskrit palm leaf manuscripts with SegFormer","authors":"Yue Wang, Ming Wen, Xiao Zhou, Feng Gao, Shuai Tian, Dan Jue, Hongmei Lu, Zhimin Zhang","doi":"10.1186/s40494-023-01125-w","DOIUrl":null,"url":null,"abstract":"<p>Palm leaf manuscripts (PLMs) are of great importance in recording Buddhist Scriptures, medicine, history, philosophy, etc. Some damages occur during the use, spread, and preservation procedure. The comprehensive investigation of Sanskrit PLMs is a prerequisite for further conservation and restoration. However, current damage identification and investigation are carried out manually. They require strong professional skills and are extraordinarily time-consuming. In this study, PLM-SegFormer is developed to provide an automated damage segmentation for Sanskrit PLMs based on the SegFormer architecture. Firstly, a digital image dataset of Sanskrit PLMs (the PLM dataset) was obtained from the Potala Palace in Tibet. Then, the hyperparameters for pre-processing, model training, prediction, and post-processing phases were fully optimized to make the SegFormer model more suitable for the PLM damage segmentation task. The optimized segmentation model reaches 70.1% mHit and 51.2% mIoU. The proposed framework automates the damage segmentation of 10,064 folios of PLMs within 12 h. The PLM-SegFormer framework will facilitate the preservation state survey and record of the Palm-leaf manuscript and be of great value to the subsequent preservation and restoration. The source code is available at https://github.com/Ryan21wy/PLM_SegFormer.</p>","PeriodicalId":13109,"journal":{"name":"Heritage Science","volume":"72 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heritage Science","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s40494-023-01125-w","RegionNum":1,"RegionCategory":"艺术学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Palm leaf manuscripts (PLMs) are of great importance in recording Buddhist Scriptures, medicine, history, philosophy, etc. Some damages occur during the use, spread, and preservation procedure. The comprehensive investigation of Sanskrit PLMs is a prerequisite for further conservation and restoration. However, current damage identification and investigation are carried out manually. They require strong professional skills and are extraordinarily time-consuming. In this study, PLM-SegFormer is developed to provide an automated damage segmentation for Sanskrit PLMs based on the SegFormer architecture. Firstly, a digital image dataset of Sanskrit PLMs (the PLM dataset) was obtained from the Potala Palace in Tibet. Then, the hyperparameters for pre-processing, model training, prediction, and post-processing phases were fully optimized to make the SegFormer model more suitable for the PLM damage segmentation task. The optimized segmentation model reaches 70.1% mHit and 51.2% mIoU. The proposed framework automates the damage segmentation of 10,064 folios of PLMs within 12 h. The PLM-SegFormer framework will facilitate the preservation state survey and record of the Palm-leaf manuscript and be of great value to the subsequent preservation and restoration. The source code is available at https://github.com/Ryan21wy/PLM_SegFormer.

Abstract Image

利用 SegFormer 自动识别梵文棕榈叶手稿的损坏情况
棕榈叶手稿(PLM)在记录佛经、医学、历史、哲学等方面具有重要意义。在使用、传播和保存过程中会出现一些损坏。对梵文手稿进行全面调查是进一步保护和修复的先决条件。然而,目前的损坏鉴定和调查都是人工进行的。这需要很强的专业技能,而且非常耗时。本研究开发了 PLM-SegFormer,以 SegFormer 架构为基础为梵文公共图书馆提供自动损伤分割。首先,从西藏布达拉宫获得了梵文普利姆数字图像数据集(PLM 数据集)。然后,对预处理、模型训练、预测和后处理阶段的超参数进行了全面优化,使 SegFormer 模型更适合 PLM 损伤分割任务。优化后的分割模型达到了 70.1% mHit 和 51.2% mIoU。PLM-SegFormer 框架将有助于掌叶手稿的保存状态调查和记录,并对后续的保存和修复工作具有重要价值。源代码见 https://github.com/Ryan21wy/PLM_SegFormer。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Heritage Science
Heritage Science Arts and Humanities-Conservation
CiteScore
4.00
自引率
20.00%
发文量
183
审稿时长
19 weeks
期刊介绍: Heritage Science is an open access journal publishing original peer-reviewed research covering: Understanding of the manufacturing processes, provenances, and environmental contexts of material types, objects, and buildings, of cultural significance including their historical significance. Understanding and prediction of physico-chemical and biological degradation processes of cultural artefacts, including climate change, and predictive heritage studies. Development and application of analytical and imaging methods or equipments for non-invasive, non-destructive or portable analysis of artwork and objects of cultural significance to identify component materials, degradation products and deterioration markers. Development and application of invasive and destructive methods for understanding the provenance of objects of cultural significance. Development and critical assessment of treatment materials and methods for artwork and objects of cultural significance. Development and application of statistical methods and algorithms for data analysis to further understanding of culturally significant objects. Publication of reference and corpus datasets as supplementary information to the statistical and analytical studies above. Description of novel technologies that can assist in the understanding of cultural heritage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信