Chronological differentiation of printed or handwritten text and stamps based on hyperspectral imaging and convolutional neural networks

IF 2.6 3区 化学 Q2 CHEMISTRY, ANALYTICAL
Xiaoquan Lu, Jianqiang Zhang, Fan Li, Jiaquan Wu, Xinyu Zhang, Huihui Ren, Hang Chen and Kun Ma
{"title":"Chronological differentiation of printed or handwritten text and stamps based on hyperspectral imaging and convolutional neural networks","authors":"Xiaoquan Lu, Jianqiang Zhang, Fan Li, Jiaquan Wu, Xinyu Zhang, Huihui Ren, Hang Chen and Kun Ma","doi":"10.1039/D5AY01131K","DOIUrl":null,"url":null,"abstract":"<p >To address the technical challenges associated with determining the chronological order of overlapping stamps and textual content in forensic document examination, this study proposes a novel non-destructive method that integrates hyperspectral imaging (HSI) with convolutional neural networks (CNNs). A multi-type cross-sequence dataset was constructed, comprising 60 samples of handwriting-stamp sequences and 20 samples of printed text-stamp sequences, all subjected to six months of natural aging. Spectral responses were collected across the 400–1000 nm range in the overlapping regions. To suppress spectral noise, multiplicative scatter correction (MSC) was employed as a preprocessing step. The proposed dual-layer CNN architecture consists of an initial convolutional layer with 64 3 × 3 kernels followed by 2 × 2 max pooling, a second convolutional layer with 128 3 × 3 kernels and another 2 × 2 pooling layer, followed by a fully connected layer with 256 neurons that integrates spatial-spectral features, culminating in a four-class classification using Softmax. The model was trained over 150 epochs using the Adam optimizer (learning rate = 0.001) and L2 regularization (<em>λ</em> = 0.001). The approach accurately distinguished between the chronological order of laser-printed toner, gel pen ink, and traditional/photo-sensitive stamp inks. Experimental results demonstrate a classification accuracy of 97.62% (AUC = 0.9965) on the printed text dataset and 96.67% (AUC = 0.9921) on the handwriting dataset, outperforming both extreme learning machine (ELM) (90.42%) and long short-term memory (LSTM) (96.43%) baselines. All pure (non-overlapping) samples were correctly classified with 100% accuracy. Feature analysis confirms the CNN's ability to extract highly discriminative spatial features, effectively overcoming the subjectivity and material-damaging limitations of traditional microscopic techniques.</p>","PeriodicalId":64,"journal":{"name":"Analytical Methods","volume":" 37","pages":" 7519-7532"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Methods","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/ay/d5ay01131k","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

To address the technical challenges associated with determining the chronological order of overlapping stamps and textual content in forensic document examination, this study proposes a novel non-destructive method that integrates hyperspectral imaging (HSI) with convolutional neural networks (CNNs). A multi-type cross-sequence dataset was constructed, comprising 60 samples of handwriting-stamp sequences and 20 samples of printed text-stamp sequences, all subjected to six months of natural aging. Spectral responses were collected across the 400–1000 nm range in the overlapping regions. To suppress spectral noise, multiplicative scatter correction (MSC) was employed as a preprocessing step. The proposed dual-layer CNN architecture consists of an initial convolutional layer with 64 3 × 3 kernels followed by 2 × 2 max pooling, a second convolutional layer with 128 3 × 3 kernels and another 2 × 2 pooling layer, followed by a fully connected layer with 256 neurons that integrates spatial-spectral features, culminating in a four-class classification using Softmax. The model was trained over 150 epochs using the Adam optimizer (learning rate = 0.001) and L2 regularization (λ = 0.001). The approach accurately distinguished between the chronological order of laser-printed toner, gel pen ink, and traditional/photo-sensitive stamp inks. Experimental results demonstrate a classification accuracy of 97.62% (AUC = 0.9965) on the printed text dataset and 96.67% (AUC = 0.9921) on the handwriting dataset, outperforming both extreme learning machine (ELM) (90.42%) and long short-term memory (LSTM) (96.43%) baselines. All pure (non-overlapping) samples were correctly classified with 100% accuracy. Feature analysis confirms the CNN's ability to extract highly discriminative spatial features, effectively overcoming the subjectivity and material-damaging limitations of traditional microscopic techniques.

Abstract Image

基于高光谱成像和卷积神经网络的印刷或手写文本和邮票的时间区分。
为了解决与确定法医文件检查中重叠印章和文本内容的时间顺序相关的技术挑战,本研究提出了一种新的非破坏性方法,该方法将高光谱成像(HSI)与卷积神经网络(cnn)相结合。构建了一个多类型交叉序列数据集,包括60个手写图章序列样本和20个印刷文字图章序列样本,均经过6个月的自然老化。在重叠区域收集400 ~ 1000 nm范围内的光谱响应。为了抑制光谱噪声,采用乘法散射校正(MSC)作为预处理步骤。本文提出的双层CNN架构包括:初始卷积层包含64个3 × 3核,然后是2 × 2最大池化;第二层卷积层包含128个3 × 3核,然后是另一个2 × 2池化层;全连接层包含256个神经元,整合了空间光谱特征,最终使用Softmax进行四类分类。使用Adam优化器(学习率= 0.001)和L2正则化(λ = 0.001)对模型进行了超过150次的训练。该方法准确地区分了激光打印墨粉、凝胶笔墨水和传统/光敏邮票墨水的时间顺序。实验结果表明,该方法在印刷文本数据集上的分类准确率为97.62% (AUC = 0.9965),在手写数据集上的分类准确率为96.67% (AUC = 0.9921),优于极限学习机(ELM)(90.42%)和长短期记忆(LSTM)(96.43%)基线。所有纯(非重叠)样本都以100%的准确率正确分类。特征分析证实了CNN提取高度判别空间特征的能力,有效克服了传统显微技术的主观性和材料破坏局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Analytical Methods
Analytical Methods CHEMISTRY, ANALYTICAL-FOOD SCIENCE & TECHNOLOGY
CiteScore
5.10
自引率
3.20%
发文量
569
审稿时长
1.8 months
期刊介绍: Early applied demonstrations of new analytical methods with clear societal impact
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信