Wood species identification based on mask R-CNN with multi-feature extraction networks and hyperspectral imaging

IF 2.5 3区 农林科学 Q1 FORESTRY
Zhiqiang Xin, Wenshu Lin, Fulan Liao
{"title":"Wood species identification based on mask R-CNN with multi-feature extraction networks and hyperspectral imaging","authors":"Zhiqiang Xin,&nbsp;Wenshu Lin,&nbsp;Fulan Liao","doi":"10.1007/s00107-025-02297-x","DOIUrl":null,"url":null,"abstract":"<div><p>Different tree species exhibit significant variations in physical properties, uses, and economic value, making accurate species identification crucial. Traditional methods relying on human visual inspection are time-consuming and susceptible to subjective experience and fatigue. This paper proposes an RGB image expansion method based on hyperspectral data and an optimized Mask R-CNN model for wood species identification. First, 600 hyperspectral images of wood blocks of four tree species (Larch, Spruce, Birch, and Poplar) were collected. Principal Component Analysis was used to reduce the dimensionality of the hyperspectral images, followed by spectral band recombination to enhance texture features, resulting in a dataset of 1873 RGB images. Secondly, Leaky ReLU was used in place of ReLU as the activation function for the residual blocks. The ResNet50 and ResNet101 networks, combined with Feature Pyramid Networks were served as the two foundational feature extraction networks, and Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation Network (SENet) were inserted at different layers of the feature extraction network. Experimental results show that appropriate integration of attention mechanisms at different layers of the backbone can improve model accuracy and reduce loss rates. The ResNet101-CBAM3-SENet4 model exhibited the best overall performance, with precision of 0.9574, 0.9778, 0.9592, and 0.9783 for the four wood species in the test set, and an average precision of 0.9680. The mean Average Precision was calculated as 0.9657, and the mean Average Recall was 0.9806. This research provides new directions for dataset expansion in image identification and accurate identification of wood species with similar textures.</p></div>","PeriodicalId":550,"journal":{"name":"European Journal of Wood and Wood Products","volume":"83 4","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Wood and Wood Products","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s00107-025-02297-x","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FORESTRY","Score":null,"Total":0}
引用次数: 0

Abstract

Different tree species exhibit significant variations in physical properties, uses, and economic value, making accurate species identification crucial. Traditional methods relying on human visual inspection are time-consuming and susceptible to subjective experience and fatigue. This paper proposes an RGB image expansion method based on hyperspectral data and an optimized Mask R-CNN model for wood species identification. First, 600 hyperspectral images of wood blocks of four tree species (Larch, Spruce, Birch, and Poplar) were collected. Principal Component Analysis was used to reduce the dimensionality of the hyperspectral images, followed by spectral band recombination to enhance texture features, resulting in a dataset of 1873 RGB images. Secondly, Leaky ReLU was used in place of ReLU as the activation function for the residual blocks. The ResNet50 and ResNet101 networks, combined with Feature Pyramid Networks were served as the two foundational feature extraction networks, and Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation Network (SENet) were inserted at different layers of the feature extraction network. Experimental results show that appropriate integration of attention mechanisms at different layers of the backbone can improve model accuracy and reduce loss rates. The ResNet101-CBAM3-SENet4 model exhibited the best overall performance, with precision of 0.9574, 0.9778, 0.9592, and 0.9783 for the four wood species in the test set, and an average precision of 0.9680. The mean Average Precision was calculated as 0.9657, and the mean Average Recall was 0.9806. This research provides new directions for dataset expansion in image identification and accurate identification of wood species with similar textures.

基于多特征提取网络和高光谱成像掩膜R-CNN的树种识别
不同树种在物理特性、用途和经济价值上表现出显著差异,因此准确的树种鉴定至关重要。传统的方法依赖于人的视觉检测,耗时长,容易受到主观经验和疲劳的影响。提出了一种基于高光谱数据的RGB图像展开方法和优化的Mask R-CNN模型,用于树种识别。首先,采集了落叶松、云杉、桦树和杨树4种树种木块的600幅高光谱图像。采用主成分分析方法对高光谱图像进行降维处理,然后对光谱波段进行重组增强纹理特征,得到1873张RGB图像数据集。其次,用Leaky ReLU代替ReLU作为剩余块的激活函数。将ResNet50和ResNet101网络结合特征金字塔网络作为两个基本特征提取网络,并在特征提取网络的不同层插入卷积块注意模块(CBAM)和挤压激励网络(SENet)。实验结果表明,适当整合不同层次的注意力机制可以提高模型的准确性,降低模型的损失率。ResNet101-CBAM3-SENet4模型综合性能最好,4种木材的精度分别为0.9574、0.9778、0.9592和0.9783,平均精度为0.9680。平均精密度为0.9657,平均召回率为0.9806。该研究为图像识别的数据集扩展和纹理相似树种的准确识别提供了新的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Wood and Wood Products
European Journal of Wood and Wood Products 工程技术-材料科学:纸与木材
CiteScore
5.40
自引率
3.80%
发文量
124
审稿时长
6.0 months
期刊介绍: European Journal of Wood and Wood Products reports on original research and new developments in the field of wood and wood products and their biological, chemical, physical as well as mechanical and technological properties, processes and uses. Subjects range from roundwood to wood based products, composite materials and structural applications, with related jointing techniques. Moreover, it deals with wood as a chemical raw material, source of energy as well as with inter-disciplinary aspects of environmental assessment and international markets. European Journal of Wood and Wood Products aims at promoting international scientific communication and transfer of new technologies from research into practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信