通过多分支架构和混合域关注优化AlexNet的准确树种分类

IF 4.6 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-04-16 DOI:10.1007/s40747-025-01836-6

Jianjianxian Liu, Tao Xing, Xiangyu Wang

{"title":"通过多分支架构和混合域关注优化AlexNet的准确树种分类","authors":"Jianjianxian Liu, Tao Xing, Xiangyu Wang","doi":"10.1007/s40747-025-01836-6","DOIUrl":null,"url":null,"abstract":"<p>Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"37 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing AlexNet for accurate tree species classification via multi-branch architecture and mixed-domain attention\",\"authors\":\"Jianjianxian Liu, Tao Xing, Xiangyu Wang\",\"doi\":\"10.1007/s40747-025-01836-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.</p>\",\"PeriodicalId\":10524,\"journal\":{\"name\":\"Complex & Intelligent Systems\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Complex & Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s40747-025-01836-6\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01836-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

准确的树种鉴定对有效的林业管理和保护至关重要。简单的深度学习模型，如AlexNet和VGG16，通常难以进行细粒度的纹理提取和特征区分，特别是在复杂的环境中。虽然更高级的模型，如ResNet34和更深层次的架构，提供了卓越的特征提取能力，但它们的代价是更长的训练时间和更高的计算成本。为了解决树种分类中的这些挑战，提出了一种优化的AlexNet架构MMCAlexNet来解决树种分类中的这些挑战。该模型集成了多分支卷积模块、混合域关注模块和联合损失函数，提高了特征提取和类分离能力。多分支卷积模块通过处理不同核大小分支的输入来提取不同的特征，同时捕获精细和全局细节。混合域注意模块通过关注关键的空间和通道特征来增强特征表示。联合损失函数通过平衡分类精度和特征一致性来保证更好的分类分离，防止过拟合。实验结果表明，MMCAlexNet的分类准确率比基线AlexNet高出7.61% ~ 9.69%。虽然优化结构的引入增加了计算复杂度，但它将模型大小减少了12MB。此外，与VGG16相比，MMCAlexNet减少了306.24MB的模型大小，提高了3.94%-6.35%的精度，在提高精度和计算效率之间取得了平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimizing AlexNet for accurate tree species classification via multi-branch architecture and mixed-domain attention

Accurate identification of tree species is essential for effective forestry management and conservation. Simple deep-learning models, such as AlexNet and VGG16, often struggle with fine-grained texture extraction and feature distinction, especially in complex environments. While more advanced models, such as ResNet34 and deeper architectures, offer superior feature extraction capabilities, they come with the trade-off of significantly longer training times and higher computational costs. To address these challenges in tree species classification, an optimized AlexNet architecture, MMCAlexNet, is proposed to address these challenges for tree species classification. The model integrates a multi-branch convolutional module, a mixeddomain attention module, and a joint loss function to improve feature extraction and class separation. The multi-branch convolutional module extracts diverse features by processing input with branches of different kernel sizes, capturing both fine and global details. The mixeddomain attention module enhances feature representation by focusing on critical spatial and channel-wise features. The joint loss function ensures better class separation and prevents overfitting by balancing classification accuracy and feature consistency. Experimental results demonstrate that MMCAlexNet outperforms the baseline AlexNet by 7.61%–9.69% in classification accuracy. Although the introduction of optimized structures increases computational complexity, it reduces the model size by 12MB. Furthermore, MMCAlexNet decreases the model size by 306.24MB and improves accuracy by 3.94%–6.35% compared to VGG16, demonstrating a balance between improved accuracy and computational efficiency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.