{"title":"ShuiAttNet: Fully convolutional attention network for Shuishu character recognition","authors":"Xiaojun Bi , Lu Han , Weizheng Qiao","doi":"10.1016/j.eswa.2025.127613","DOIUrl":null,"url":null,"abstract":"<div><div>Shuishu is one of the most representative hieroglyphs and precious cultural heritage in China, currently facing the risk of extinction. Preserving this endangered script requires innovative approaches to accurately recognize its characters. However, existing methods face significant challenges, including the inability to handle the broad diversity of Shuishu characters and the complexities of authentic ancient manuscripts. To address these issues, we present a comprehensive study that combines dataset construction and advanced deep learning methods. First, we establish the largest and most diverse Shuishu single-character dataset named S842 to date, addressing the critical lack of publicly available resources for Shuishu. Then we propose a novel Fully Convolutional Attention Network named ShuiAttNet, which is specifically designed for Shuishu character recognition. ShuiAttNet introduces two key innovations: the Attentional MBConv (AMC) block and the Fully Convolutional Attention (FCA) block. The AMC block utilizes a novel feature fusion mechanism to capture fine-grained local details while reducing feature redundancy caused by the low-rank characteristics of Shuishu characters. Meanwhile, the FCA block employs Depthwise Separable Dilated Convolution to establish long-range dependencies while preserving the two-dimensional spatial structure of the images. These components enable ShuiAttNet to achieve superior performance with significantly fewer parameters compared to existing methods. Extensive experiments validate the effectiveness and superiority of ShuiAttNet in both quantitative and qualitative assessments. Experimental results show that our proposed model achieves a Top-1 Acc of 97.04%, outperforming other state-of-the-art methods.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127613"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012357","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Shuishu is one of the most representative hieroglyphs and precious cultural heritage in China, currently facing the risk of extinction. Preserving this endangered script requires innovative approaches to accurately recognize its characters. However, existing methods face significant challenges, including the inability to handle the broad diversity of Shuishu characters and the complexities of authentic ancient manuscripts. To address these issues, we present a comprehensive study that combines dataset construction and advanced deep learning methods. First, we establish the largest and most diverse Shuishu single-character dataset named S842 to date, addressing the critical lack of publicly available resources for Shuishu. Then we propose a novel Fully Convolutional Attention Network named ShuiAttNet, which is specifically designed for Shuishu character recognition. ShuiAttNet introduces two key innovations: the Attentional MBConv (AMC) block and the Fully Convolutional Attention (FCA) block. The AMC block utilizes a novel feature fusion mechanism to capture fine-grained local details while reducing feature redundancy caused by the low-rank characteristics of Shuishu characters. Meanwhile, the FCA block employs Depthwise Separable Dilated Convolution to establish long-range dependencies while preserving the two-dimensional spatial structure of the images. These components enable ShuiAttNet to achieve superior performance with significantly fewer parameters compared to existing methods. Extensive experiments validate the effectiveness and superiority of ShuiAttNet in both quantitative and qualitative assessments. Experimental results show that our proposed model achieves a Top-1 Acc of 97.04%, outperforming other state-of-the-art methods.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.