{"title":"通过多分支架构和混合域关注优化AlexNet的准确树种分类","authors":"Jianjianxian Liu, Tao Xing, Xiangyu Wang","doi":"10.1007/s40747-025-01836-6","DOIUrl":null,"url":null,"abstract":"<p>Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"37 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing AlexNet for accurate tree species classification via multi-branch architecture and mixed-domain attention\",\"authors\":\"Jianjianxian Liu, Tao Xing, Xiangyu Wang\",\"doi\":\"10.1007/s40747-025-01836-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.</p>\",\"PeriodicalId\":10524,\"journal\":{\"name\":\"Complex & Intelligent Systems\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Complex & Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s40747-025-01836-6\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01836-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Optimizing AlexNet for accurate tree species classification via multi-branch architecture and mixed-domain attention
Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.
期刊介绍:
Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.