Arnet:基于多尺度扩张注意和残差动态卷积的木材CT图像分类算法研究

IF 3.1 2区 农林科学 Q1 FORESTRY
Zhishuai Zheng, Zhedong Ge, Huanqi Zheng, Xiaoxia Yang, Lipeng Qin, Xu Wang, Yucheng Zhou
{"title":"Arnet:基于多尺度扩张注意和残差动态卷积的木材CT图像分类算法研究","authors":"Zhishuai Zheng,&nbsp;Zhedong Ge,&nbsp;Huanqi Zheng,&nbsp;Xiaoxia Yang,&nbsp;Lipeng Qin,&nbsp;Xu Wang,&nbsp;Yucheng Zhou","doi":"10.1007/s00226-025-01649-3","DOIUrl":null,"url":null,"abstract":"<div><p>Addressing the challenges of low classification accuracy and protracted identification times posed by lightweight convolutional neural networks (CNNs) for wood micrograph classification, this study introduces ARNet, a novel model tailored for wood CT image analysis.ARNet significantly enhances the overall image recognition performance by boosting its dynamic feature extraction capabilities and refining its proficiency in processing salient features.The methodology employs residual dynamic convolution that dynamically aggregates convolutional kernels in response to the input image, optimizing adaptability.This optimized field of view across disparate feature layers facilitates the extraction of critical information such as wood texture, pore distribution, and cellular arrangement, thereby enhancing analytical depth.Additionally, ARNet incorporates multi-scale dilated attention mechanisms to capture nuanced feature maps across multiple scales, thereby broadening the scope of feature analysis.This approach not only achieves a profound understanding and efficient processing of the input data but also accentuates critical features, significantly enhancing the distinguishability between diverse image categories.The combination of CNNs and Transformers not only extracts rich local and global information but also captures unique features of images on a point-to-point basis, thereby improving classification accuracy. Experiments were conducted on the Mini-ImageNet, CIFAR100, and CIFAR10 public datasets. The results show that ARNet achieved top-1 accuracies of 65.21%, 78.32%, and 93.39% on Mini-ImageNet, CIFAR100, and CIFAR10, respectively, outperforming other models such as RMT, TCFormer, and SSViT. Additionally, we applied ARNet at the Shandong base of the national wood industry engineering research center to identify transverse section micrographs of 20 precious wood types, achieving an accuracy of 99.50% on the test set. After loading the parameters into the re-parameterized model, the validation set accuracy was 99.20%, with a detection time of 0.024s per image. This demonstrates that by combining residual dynamic convolution with multi-scale dilated attention, the accuracy and real-time performance of wood micrograph classification can be effectively improved.</p></div>","PeriodicalId":810,"journal":{"name":"Wood Science and Technology","volume":"59 3","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Arnet: research on wood CT image classification algorithm based on multi-scale dilated attention and residual dynamic convolution\",\"authors\":\"Zhishuai Zheng,&nbsp;Zhedong Ge,&nbsp;Huanqi Zheng,&nbsp;Xiaoxia Yang,&nbsp;Lipeng Qin,&nbsp;Xu Wang,&nbsp;Yucheng Zhou\",\"doi\":\"10.1007/s00226-025-01649-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Addressing the challenges of low classification accuracy and protracted identification times posed by lightweight convolutional neural networks (CNNs) for wood micrograph classification, this study introduces ARNet, a novel model tailored for wood CT image analysis.ARNet significantly enhances the overall image recognition performance by boosting its dynamic feature extraction capabilities and refining its proficiency in processing salient features.The methodology employs residual dynamic convolution that dynamically aggregates convolutional kernels in response to the input image, optimizing adaptability.This optimized field of view across disparate feature layers facilitates the extraction of critical information such as wood texture, pore distribution, and cellular arrangement, thereby enhancing analytical depth.Additionally, ARNet incorporates multi-scale dilated attention mechanisms to capture nuanced feature maps across multiple scales, thereby broadening the scope of feature analysis.This approach not only achieves a profound understanding and efficient processing of the input data but also accentuates critical features, significantly enhancing the distinguishability between diverse image categories.The combination of CNNs and Transformers not only extracts rich local and global information but also captures unique features of images on a point-to-point basis, thereby improving classification accuracy. Experiments were conducted on the Mini-ImageNet, CIFAR100, and CIFAR10 public datasets. The results show that ARNet achieved top-1 accuracies of 65.21%, 78.32%, and 93.39% on Mini-ImageNet, CIFAR100, and CIFAR10, respectively, outperforming other models such as RMT, TCFormer, and SSViT. Additionally, we applied ARNet at the Shandong base of the national wood industry engineering research center to identify transverse section micrographs of 20 precious wood types, achieving an accuracy of 99.50% on the test set. After loading the parameters into the re-parameterized model, the validation set accuracy was 99.20%, with a detection time of 0.024s per image. This demonstrates that by combining residual dynamic convolution with multi-scale dilated attention, the accuracy and real-time performance of wood micrograph classification can be effectively improved.</p></div>\",\"PeriodicalId\":810,\"journal\":{\"name\":\"Wood Science and Technology\",\"volume\":\"59 3\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wood Science and Technology\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00226-025-01649-3\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"FORESTRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wood Science and Technology","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s00226-025-01649-3","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FORESTRY","Score":null,"Total":0}
引用次数: 0

摘要

针对轻量级卷积神经网络(cnn)在木材显微图像分类中分类精度低、识别时间长等问题,本研究引入了一种专门用于木材CT图像分析的新型模型ARNet。ARNet通过增强其动态特征提取能力和改进其处理显著特征的熟练程度,显著提高了整体图像识别性能。该方法采用残差动态卷积,根据输入图像动态聚合卷积核,优化适应性。这种跨不同特征层的优化视野有助于提取关键信息,如木材纹理、孔隙分布和细胞排列,从而提高分析深度。此外,ARNet还结合了多尺度扩张注意机制,以捕获跨多个尺度的细微特征图,从而扩大了特征分析的范围。该方法不仅实现了对输入数据的深刻理解和高效处理,而且突出了关键特征,显著提高了不同图像类别之间的可区分性。cnn和Transformers的结合不仅提取了丰富的局部和全局信息,而且在点对点的基础上捕获了图像的独特特征,从而提高了分类精度。实验在Mini-ImageNet、CIFAR100和CIFAR10公共数据集上进行。结果表明,ARNet在Mini-ImageNet、CIFAR100和CIFAR10上的准确率分别为65.21%、78.32%和93.39%,优于RMT、TCFormer和SSViT等其他模型。此外,我们在国家木材工业工程研究中心山东基地应用ARNet对20种珍贵木材类型的横切面显微图进行了识别,在测试集上的准确率达到99.50%。将参数加载到重新参数化的模型中后,验证集的准确率为99.20%,每张图像的检测时间为0.024s。说明残差动态卷积与多尺度扩张注意相结合,可以有效提高木材显微图像分类的准确率和实时性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Arnet: research on wood CT image classification algorithm based on multi-scale dilated attention and residual dynamic convolution

Addressing the challenges of low classification accuracy and protracted identification times posed by lightweight convolutional neural networks (CNNs) for wood micrograph classification, this study introduces ARNet, a novel model tailored for wood CT image analysis.ARNet significantly enhances the overall image recognition performance by boosting its dynamic feature extraction capabilities and refining its proficiency in processing salient features.The methodology employs residual dynamic convolution that dynamically aggregates convolutional kernels in response to the input image, optimizing adaptability.This optimized field of view across disparate feature layers facilitates the extraction of critical information such as wood texture, pore distribution, and cellular arrangement, thereby enhancing analytical depth.Additionally, ARNet incorporates multi-scale dilated attention mechanisms to capture nuanced feature maps across multiple scales, thereby broadening the scope of feature analysis.This approach not only achieves a profound understanding and efficient processing of the input data but also accentuates critical features, significantly enhancing the distinguishability between diverse image categories.The combination of CNNs and Transformers not only extracts rich local and global information but also captures unique features of images on a point-to-point basis, thereby improving classification accuracy. Experiments were conducted on the Mini-ImageNet, CIFAR100, and CIFAR10 public datasets. The results show that ARNet achieved top-1 accuracies of 65.21%, 78.32%, and 93.39% on Mini-ImageNet, CIFAR100, and CIFAR10, respectively, outperforming other models such as RMT, TCFormer, and SSViT. Additionally, we applied ARNet at the Shandong base of the national wood industry engineering research center to identify transverse section micrographs of 20 precious wood types, achieving an accuracy of 99.50% on the test set. After loading the parameters into the re-parameterized model, the validation set accuracy was 99.20%, with a detection time of 0.024s per image. This demonstrates that by combining residual dynamic convolution with multi-scale dilated attention, the accuracy and real-time performance of wood micrograph classification can be effectively improved.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Wood Science and Technology
Wood Science and Technology 工程技术-材料科学:纸与木材
CiteScore
5.90
自引率
5.90%
发文量
75
审稿时长
3 months
期刊介绍: Wood Science and Technology publishes original scientific research results and review papers covering the entire field of wood material science, wood components and wood based products. Subjects are wood biology and wood quality, wood physics and physical technologies, wood chemistry and chemical technologies. Latest advances in areas such as cell wall and wood formation; structural and chemical composition of wood and wood composites and their property relations; physical, mechanical and chemical characterization and relevant methodological developments, and microbiological degradation of wood and wood based products are reported. Topics related to wood technology include machining, gluing, and finishing, composite technology, wood modification, wood mechanics, creep and rheology, and the conversion of wood into pulp and biorefinery products.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信