{"title":"基于深度学习的多模态无人机荔枝果检测选择特征融合","authors":"Debarun Chakraborty;Bhabesh Deka","doi":"10.1109/TAI.2025.3532205","DOIUrl":null,"url":null,"abstract":"In the field of precision agriculture, accurate crop detection is crucial for crop yield estimation, and health monitoring using photogrammetric measurements. Achieving high precision requires advance object detection models and multiscale feature fusion. This article addresses key research gaps in litchi crop monitoring, including the lack of a suitable dataset for litchi detection in natural environment and the limitations of conventional deep learning models in handling challenges such as occlusion, overlapping, and background complexities. First, we prepare high-resolution litchi dataset called “UAVLitchi” of 5000 images that include both RGB and multispectral images and next, we propose a selective feature fusion (SFF)-based architecture for litchi detection. By utilizing both RGB and multispectral images, this architecture effectively mitigates the challenges of visual detection arising from the complex cluster growth structure of litchis, offering a robust solution for accurate detection. The integration of SFF within a dual-channel mask-region based convolutional neural network (Mask-RCNN) leading to significant improvements in feature extraction for litchi detection. Experimental results demonstrate impressive performance, achieving an mean average precession (mAP50) of 94.65%, mAP75 of 89.23%, recall of 90.16%, and F1-score of 91.44%.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 7","pages":"1932-1942"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Learning-Based Selective Feature Fusion for Litchi Fruit Detection Using Multimodal UAV Sensor Measurements\",\"authors\":\"Debarun Chakraborty;Bhabesh Deka\",\"doi\":\"10.1109/TAI.2025.3532205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the field of precision agriculture, accurate crop detection is crucial for crop yield estimation, and health monitoring using photogrammetric measurements. Achieving high precision requires advance object detection models and multiscale feature fusion. This article addresses key research gaps in litchi crop monitoring, including the lack of a suitable dataset for litchi detection in natural environment and the limitations of conventional deep learning models in handling challenges such as occlusion, overlapping, and background complexities. First, we prepare high-resolution litchi dataset called “UAVLitchi” of 5000 images that include both RGB and multispectral images and next, we propose a selective feature fusion (SFF)-based architecture for litchi detection. By utilizing both RGB and multispectral images, this architecture effectively mitigates the challenges of visual detection arising from the complex cluster growth structure of litchis, offering a robust solution for accurate detection. The integration of SFF within a dual-channel mask-region based convolutional neural network (Mask-RCNN) leading to significant improvements in feature extraction for litchi detection. Experimental results demonstrate impressive performance, achieving an mean average precession (mAP50) of 94.65%, mAP75 of 89.23%, recall of 90.16%, and F1-score of 91.44%.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"6 7\",\"pages\":\"1932-1942\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10850648/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10850648/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning-Based Selective Feature Fusion for Litchi Fruit Detection Using Multimodal UAV Sensor Measurements
In the field of precision agriculture, accurate crop detection is crucial for crop yield estimation, and health monitoring using photogrammetric measurements. Achieving high precision requires advance object detection models and multiscale feature fusion. This article addresses key research gaps in litchi crop monitoring, including the lack of a suitable dataset for litchi detection in natural environment and the limitations of conventional deep learning models in handling challenges such as occlusion, overlapping, and background complexities. First, we prepare high-resolution litchi dataset called “UAVLitchi” of 5000 images that include both RGB and multispectral images and next, we propose a selective feature fusion (SFF)-based architecture for litchi detection. By utilizing both RGB and multispectral images, this architecture effectively mitigates the challenges of visual detection arising from the complex cluster growth structure of litchis, offering a robust solution for accurate detection. The integration of SFF within a dual-channel mask-region based convolutional neural network (Mask-RCNN) leading to significant improvements in feature extraction for litchi detection. Experimental results demonstrate impressive performance, achieving an mean average precession (mAP50) of 94.65%, mAP75 of 89.23%, recall of 90.16%, and F1-score of 91.44%.