{"title":"基于多任务深度学习的蝴蝶生态图像分析","authors":"Kunkun Zhang , Xin Chen , Bin Wang","doi":"10.1016/j.dsp.2025.105168","DOIUrl":null,"url":null,"abstract":"<div><div>Butterfly ecological image analysis (BEIA) is an exciting and essential field where computer vision can significantly aid in ecological research and biodiversity conservation. Although deep learning has made significant strides in BEIA, the existing models still handle tasks such as segmentation and classification independently, which constrains the potential performance improvements gained by exploiting the correlations between these tasks. In this paper, we design a multi-task deep learning model, named MdeBEIA, to perform both segmentation and classification tasks for BEIA. The MdeBEIA model features a unified encoder that extracts global semantics and spatial information from butterfly images, creating a shared feature representation for both tasks. This approach leverages the intrinsic correlations between segmentation and classification to enhance feature learning. To further boost classification performance, we integrate a Region of Interest Guidance Module (RIGM), which uses intermediate segmentation masks and a self-attention mechanism to refine feature learning by emphasizing contextual relationships. Additionally, we employ a deep mutual learning strategy to improve the model's performance and generalization ability. Experimental results show that MdeBEIA achieves a Jaccard score of 94.70 % in segmentation, surpassing the state-of-the-art by 0.93 %, with comparable inference speeds. In classification, it outperforms the state-of-the-art by 0.81 %, reaching 98.34 %.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"162 ","pages":"Article 105168"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MdeBEIA: Multi-task deep leaning for butterfly ecological image analysis\",\"authors\":\"Kunkun Zhang , Xin Chen , Bin Wang\",\"doi\":\"10.1016/j.dsp.2025.105168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Butterfly ecological image analysis (BEIA) is an exciting and essential field where computer vision can significantly aid in ecological research and biodiversity conservation. Although deep learning has made significant strides in BEIA, the existing models still handle tasks such as segmentation and classification independently, which constrains the potential performance improvements gained by exploiting the correlations between these tasks. In this paper, we design a multi-task deep learning model, named MdeBEIA, to perform both segmentation and classification tasks for BEIA. The MdeBEIA model features a unified encoder that extracts global semantics and spatial information from butterfly images, creating a shared feature representation for both tasks. This approach leverages the intrinsic correlations between segmentation and classification to enhance feature learning. To further boost classification performance, we integrate a Region of Interest Guidance Module (RIGM), which uses intermediate segmentation masks and a self-attention mechanism to refine feature learning by emphasizing contextual relationships. Additionally, we employ a deep mutual learning strategy to improve the model's performance and generalization ability. Experimental results show that MdeBEIA achieves a Jaccard score of 94.70 % in segmentation, surpassing the state-of-the-art by 0.93 %, with comparable inference speeds. In classification, it outperforms the state-of-the-art by 0.81 %, reaching 98.34 %.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"162 \",\"pages\":\"Article 105168\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425001903\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425001903","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
MdeBEIA: Multi-task deep leaning for butterfly ecological image analysis
Butterfly ecological image analysis (BEIA) is an exciting and essential field where computer vision can significantly aid in ecological research and biodiversity conservation. Although deep learning has made significant strides in BEIA, the existing models still handle tasks such as segmentation and classification independently, which constrains the potential performance improvements gained by exploiting the correlations between these tasks. In this paper, we design a multi-task deep learning model, named MdeBEIA, to perform both segmentation and classification tasks for BEIA. The MdeBEIA model features a unified encoder that extracts global semantics and spatial information from butterfly images, creating a shared feature representation for both tasks. This approach leverages the intrinsic correlations between segmentation and classification to enhance feature learning. To further boost classification performance, we integrate a Region of Interest Guidance Module (RIGM), which uses intermediate segmentation masks and a self-attention mechanism to refine feature learning by emphasizing contextual relationships. Additionally, we employ a deep mutual learning strategy to improve the model's performance and generalization ability. Experimental results show that MdeBEIA achieves a Jaccard score of 94.70 % in segmentation, surpassing the state-of-the-art by 0.93 %, with comparable inference speeds. In classification, it outperforms the state-of-the-art by 0.81 %, reaching 98.34 %.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,