Generative innovations for paleography: enhancing character image synthesis through unconditional single image models

IF 2.6 1区 艺术学 Q2 CHEMISTRY, ANALYTICAL
A. Aswathy, P. Uma Maheswari
{"title":"Generative innovations for paleography: enhancing character image synthesis through unconditional single image models","authors":"A. Aswathy, P. Uma Maheswari","doi":"10.1186/s40494-024-01373-4","DOIUrl":null,"url":null,"abstract":"<p>Data scarcity in paleographic image datasets poses a significant challenge to researchers and scholars in the field. Unlike modern printed texts, historical manuscripts and documents are often scarce and fragile, making them difficult to digitize and create comprehensive datasets. Recently many innovations have been arrived on single image generative models for natural images but none of them are focused on paleographic character images and other handwritten datasets. In paleographic images like stone inscription characters, maintaining exact shape and structure of character is important unlike natural images. In this paper we propose an unconditional single image generative model, CharGAN for isolated paleographic character images. In the proposed system, augmented images are generated from a single image using generative adversarial networks, while maintaining their structure. Specifically, an external augmentation inducer is used to create higher-level augmentations in the generated images. In addition, the input to the generator is replaced with dynamic sampling from a Gaussian mixture model to make changes to the low-level features. From our experimental results, we infer that these two enhancements make single-image generative models suitable not only for natural images, but also for paleographic character images and other handwritten character datasets, the AHCD dataset, and EMNIST, where the global structure is important. Both the qualitative and quantitative results show that our approach is effective and superior in single-image generative tasks, particularly in isolated character image generation.</p>","PeriodicalId":13109,"journal":{"name":"Heritage Science","volume":"81 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heritage Science","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s40494-024-01373-4","RegionNum":1,"RegionCategory":"艺术学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Data scarcity in paleographic image datasets poses a significant challenge to researchers and scholars in the field. Unlike modern printed texts, historical manuscripts and documents are often scarce and fragile, making them difficult to digitize and create comprehensive datasets. Recently many innovations have been arrived on single image generative models for natural images but none of them are focused on paleographic character images and other handwritten datasets. In paleographic images like stone inscription characters, maintaining exact shape and structure of character is important unlike natural images. In this paper we propose an unconditional single image generative model, CharGAN for isolated paleographic character images. In the proposed system, augmented images are generated from a single image using generative adversarial networks, while maintaining their structure. Specifically, an external augmentation inducer is used to create higher-level augmentations in the generated images. In addition, the input to the generator is replaced with dynamic sampling from a Gaussian mixture model to make changes to the low-level features. From our experimental results, we infer that these two enhancements make single-image generative models suitable not only for natural images, but also for paleographic character images and other handwritten character datasets, the AHCD dataset, and EMNIST, where the global structure is important. Both the qualitative and quantitative results show that our approach is effective and superior in single-image generative tasks, particularly in isolated character image generation.

Abstract Image

古文字学的生成性创新:通过无条件的单一图像模型加强字符图像合成
古籍图像数据集的数据稀缺性给该领域的研究人员和学者带来了巨大挑战。与现代印刷文本不同,历史手稿和文献通常稀缺且易碎,因此难以数字化并创建全面的数据集。最近,人们在自然图像的单图像生成模型方面取得了许多创新成果,但没有一项成果是针对古文字图像和其他手写数据集的。在石刻文字等古文字图像中,与自然图像不同,保持文字的精确形状和结构非常重要。在本文中,我们针对孤立的古文字图像提出了一种无条件的单图像生成模型 CharGAN。在所提出的系统中,使用生成对抗网络从单一图像生成增强图像,同时保持其结构。具体来说,外部增强诱导器用于在生成的图像中创建更高级别的增强。此外,生成器的输入被高斯混合模型的动态采样所取代,从而对低级特征进行更改。根据实验结果,我们推断这两项改进使单图像生成模型不仅适用于自然图像,也适用于古文字图像和其他手写字符数据集、AHCD 数据集和 EMNIST,因为在这些数据集中,全局结构非常重要。定性和定量结果都表明,我们的方法在单图像生成任务中,尤其是在孤立字符图像生成中,是有效和优越的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Heritage Science
Heritage Science Arts and Humanities-Conservation
CiteScore
4.00
自引率
20.00%
发文量
183
审稿时长
19 weeks
期刊介绍: Heritage Science is an open access journal publishing original peer-reviewed research covering: Understanding of the manufacturing processes, provenances, and environmental contexts of material types, objects, and buildings, of cultural significance including their historical significance. Understanding and prediction of physico-chemical and biological degradation processes of cultural artefacts, including climate change, and predictive heritage studies. Development and application of analytical and imaging methods or equipments for non-invasive, non-destructive or portable analysis of artwork and objects of cultural significance to identify component materials, degradation products and deterioration markers. Development and application of invasive and destructive methods for understanding the provenance of objects of cultural significance. Development and critical assessment of treatment materials and methods for artwork and objects of cultural significance. Development and application of statistical methods and algorithms for data analysis to further understanding of culturally significant objects. Publication of reference and corpus datasets as supplementary information to the statistical and analytical studies above. Description of novel technologies that can assist in the understanding of cultural heritage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信