Yushun Cai , Linghan Gao , Cui Jia , Xiaole Liu , Guanxing Wang , Ying Tian
{"title":"Coniferous tree species classification based on DMCA-Unet network model with UAV multispectral imagery","authors":"Yushun Cai , Linghan Gao , Cui Jia , Xiaole Liu , Guanxing Wang , Ying Tian","doi":"10.1016/j.tfp.2025.101026","DOIUrl":null,"url":null,"abstract":"<div><div>As a critical component of global forest ecosystems, the classification of coniferous tree species facilitates precise assessment of forest resources and enhances forest As critical components of global forest ecosystems, accurate classification of coniferous tree species facilitates precise assessment of forest resources and enhances forest management efficiency, with substantial implications for ecological conservation and carbon sequestration evaluation. However, existing studies predominantly rely on high-cost, processing-intensive hyperspectral data, restricting large-scale practical applications. To address conventional methods' limitations in characterizing complex coniferous species and overcome hyperspectral dependency, this study proposes DMCA-Unet—a semantic segmentation model utilizing cost-effective unmanned aerial vehicle (UAV) Multispectral imagery. Our approach exhaustively exploits spatial-semantic- spectral features of UAV images for tree species classification in heterogeneous forests. Employing multispectral data from the Millennium Xiulin Experimental Zone (Xiongan New Area), the model adopts a VGG-16-based encoder backbone. The framework integrates deep dilated convolution modules that combine depth-wise convolutions with progressive dilation rates and LeakyReLU activations, substituting standard convolution operations to elevate training efficiency, accuracy, and convergence. Furthermore, a multi-head cross-attention mechanism captures interdependencies among heterogeneous features, strengthening classification capability in complex backgrounds. For optimized parameter learning, a combined loss function integrating cross-entropy and Dice loss resolves class imbalance issues during training. Experiments demonstrate: (1) DMCA-Unet achieves 94.45 % overall accuracy, surpassing DeepLabV3+,ViT(Vision Transformer),HRNet, PSPNet, Unet-VGG16, Unet-ResNet50, and U-Net by +4.6 %, +27.78 %, +2.7 %, +11.85 %, +2.31 %, +2.42 %, and +3.54 % respectively at lower computational cost; (2) The hybrid loss markedly enhances accuracy under sample imbalance while stabilizing training. This methodology provides technical support for forest resource monitoring and sustainable management.</div></div>","PeriodicalId":36104,"journal":{"name":"Trees, Forests and People","volume":"22 ","pages":"Article 101026"},"PeriodicalIF":2.9000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Trees, Forests and People","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666719325002523","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FORESTRY","Score":null,"Total":0}
引用次数: 0
Abstract
As a critical component of global forest ecosystems, the classification of coniferous tree species facilitates precise assessment of forest resources and enhances forest As critical components of global forest ecosystems, accurate classification of coniferous tree species facilitates precise assessment of forest resources and enhances forest management efficiency, with substantial implications for ecological conservation and carbon sequestration evaluation. However, existing studies predominantly rely on high-cost, processing-intensive hyperspectral data, restricting large-scale practical applications. To address conventional methods' limitations in characterizing complex coniferous species and overcome hyperspectral dependency, this study proposes DMCA-Unet—a semantic segmentation model utilizing cost-effective unmanned aerial vehicle (UAV) Multispectral imagery. Our approach exhaustively exploits spatial-semantic- spectral features of UAV images for tree species classification in heterogeneous forests. Employing multispectral data from the Millennium Xiulin Experimental Zone (Xiongan New Area), the model adopts a VGG-16-based encoder backbone. The framework integrates deep dilated convolution modules that combine depth-wise convolutions with progressive dilation rates and LeakyReLU activations, substituting standard convolution operations to elevate training efficiency, accuracy, and convergence. Furthermore, a multi-head cross-attention mechanism captures interdependencies among heterogeneous features, strengthening classification capability in complex backgrounds. For optimized parameter learning, a combined loss function integrating cross-entropy and Dice loss resolves class imbalance issues during training. Experiments demonstrate: (1) DMCA-Unet achieves 94.45 % overall accuracy, surpassing DeepLabV3+,ViT(Vision Transformer),HRNet, PSPNet, Unet-VGG16, Unet-ResNet50, and U-Net by +4.6 %, +27.78 %, +2.7 %, +11.85 %, +2.31 %, +2.42 %, and +3.54 % respectively at lower computational cost; (2) The hybrid loss markedly enhances accuracy under sample imbalance while stabilizing training. This methodology provides technical support for forest resource monitoring and sustainable management.