Heguang Sun , Huanming Mai , Yanzhi Mao , Qingquan Li , Mei Guo , Yongshun Liu , Ziheng Feng , Haikuan Feng , Wei Guo , Guijun Yang , Xiaoling Deng , XiaoYu Song
{"title":"基于改进SMOTE-CS小样本建模和深度特征学习的无人机多品种马铃薯晚疫病监测","authors":"Heguang Sun , Huanming Mai , Yanzhi Mao , Qingquan Li , Mei Guo , Yongshun Liu , Ziheng Feng , Haikuan Feng , Wei Guo , Guijun Yang , Xiaoling Deng , XiaoYu Song","doi":"10.1016/j.eja.2025.127702","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and non-destructive monitoring of potato late blight (PLB) using unmanned aerial vehicle (UAV) remote sensing data is of great significance for field management. However, during disease outbreaks, there is a lack of universally applicable rapid monitoring models. On the one hand, different varieties exhibit varying levels of resistance and disease monitoring progression, which can be attributed to genetic and environmental factors. On the other hand, the heterogeneity, imbalance, and noise in spectral and textural data across regions pose significant challenges for disease monitoring. To address these issues, this study first improves upon the noise problem in Synthetic Minority Over-sampling Technique (SMOTE) by employing an enhanced feature selection algorithm based on the Feature Selection with Compactness and Separability (FS-CS) principle. Subsequently, the feature ranking is then used with the Importance-Ordered Weighted Averaging (IOWA) operator to calculate the induced Minkowski OWA distance (IMOWAD), replacing the nearest neighbor distance used in SMOTE. This refinement emphasizes the boundaries of synthetic sample regions and mitigates noise-related issues. This improved method is referred to as SMOTE-CS. Secondly, nine models were constructed to evaluate the effectiveness of FS-CS in feature selection when integrating multiple datasets. Compared to mRmR and ReliefF, FS-CS achieved higher accuracy with a smaller number of features. Finally, to address varietal and environmental differences, modeling was conducted using a shallow transfer learning 1D-CNN model and a deep DRSN model incorporating nonlinear soft thresholding processing, respectively. The results indicate that the 1D-CNN model achieved overall accuracies (OA) of 0.99 and 0.93 on the two datasets, respectively. However, its performance was affected by the poor interpretability of the boundary between the synthetic source and target domain samples. The integration of nonlinear soft-thresholding into the DRSN model enhanced its feature extraction capability and noise suppression. It demonstrated strong performance on the two datasets, achieving an OA of 0.91 and a Kappa coefficient of 0.86. Compared to the original SMOTE version, the proposed approach exhibited superior generalization ability. The results of this study provide new insights into the problems of small sample imbalance, noise, and technical support for multi-species PLB monitoring in different regions.</div></div>","PeriodicalId":51045,"journal":{"name":"European Journal of Agronomy","volume":"169 ","pages":"Article 127702"},"PeriodicalIF":4.5000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-variety monitoring of potato late blight severity using UAV data with improved SMOTE-CS for small sample modeling and deep feature learning\",\"authors\":\"Heguang Sun , Huanming Mai , Yanzhi Mao , Qingquan Li , Mei Guo , Yongshun Liu , Ziheng Feng , Haikuan Feng , Wei Guo , Guijun Yang , Xiaoling Deng , XiaoYu Song\",\"doi\":\"10.1016/j.eja.2025.127702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate and non-destructive monitoring of potato late blight (PLB) using unmanned aerial vehicle (UAV) remote sensing data is of great significance for field management. However, during disease outbreaks, there is a lack of universally applicable rapid monitoring models. On the one hand, different varieties exhibit varying levels of resistance and disease monitoring progression, which can be attributed to genetic and environmental factors. On the other hand, the heterogeneity, imbalance, and noise in spectral and textural data across regions pose significant challenges for disease monitoring. To address these issues, this study first improves upon the noise problem in Synthetic Minority Over-sampling Technique (SMOTE) by employing an enhanced feature selection algorithm based on the Feature Selection with Compactness and Separability (FS-CS) principle. Subsequently, the feature ranking is then used with the Importance-Ordered Weighted Averaging (IOWA) operator to calculate the induced Minkowski OWA distance (IMOWAD), replacing the nearest neighbor distance used in SMOTE. This refinement emphasizes the boundaries of synthetic sample regions and mitigates noise-related issues. This improved method is referred to as SMOTE-CS. Secondly, nine models were constructed to evaluate the effectiveness of FS-CS in feature selection when integrating multiple datasets. Compared to mRmR and ReliefF, FS-CS achieved higher accuracy with a smaller number of features. Finally, to address varietal and environmental differences, modeling was conducted using a shallow transfer learning 1D-CNN model and a deep DRSN model incorporating nonlinear soft thresholding processing, respectively. The results indicate that the 1D-CNN model achieved overall accuracies (OA) of 0.99 and 0.93 on the two datasets, respectively. However, its performance was affected by the poor interpretability of the boundary between the synthetic source and target domain samples. The integration of nonlinear soft-thresholding into the DRSN model enhanced its feature extraction capability and noise suppression. It demonstrated strong performance on the two datasets, achieving an OA of 0.91 and a Kappa coefficient of 0.86. Compared to the original SMOTE version, the proposed approach exhibited superior generalization ability. The results of this study provide new insights into the problems of small sample imbalance, noise, and technical support for multi-species PLB monitoring in different regions.</div></div>\",\"PeriodicalId\":51045,\"journal\":{\"name\":\"European Journal of Agronomy\",\"volume\":\"169 \",\"pages\":\"Article 127702\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Agronomy\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1161030125001984\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Agronomy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1161030125001984","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
Multi-variety monitoring of potato late blight severity using UAV data with improved SMOTE-CS for small sample modeling and deep feature learning
Accurate and non-destructive monitoring of potato late blight (PLB) using unmanned aerial vehicle (UAV) remote sensing data is of great significance for field management. However, during disease outbreaks, there is a lack of universally applicable rapid monitoring models. On the one hand, different varieties exhibit varying levels of resistance and disease monitoring progression, which can be attributed to genetic and environmental factors. On the other hand, the heterogeneity, imbalance, and noise in spectral and textural data across regions pose significant challenges for disease monitoring. To address these issues, this study first improves upon the noise problem in Synthetic Minority Over-sampling Technique (SMOTE) by employing an enhanced feature selection algorithm based on the Feature Selection with Compactness and Separability (FS-CS) principle. Subsequently, the feature ranking is then used with the Importance-Ordered Weighted Averaging (IOWA) operator to calculate the induced Minkowski OWA distance (IMOWAD), replacing the nearest neighbor distance used in SMOTE. This refinement emphasizes the boundaries of synthetic sample regions and mitigates noise-related issues. This improved method is referred to as SMOTE-CS. Secondly, nine models were constructed to evaluate the effectiveness of FS-CS in feature selection when integrating multiple datasets. Compared to mRmR and ReliefF, FS-CS achieved higher accuracy with a smaller number of features. Finally, to address varietal and environmental differences, modeling was conducted using a shallow transfer learning 1D-CNN model and a deep DRSN model incorporating nonlinear soft thresholding processing, respectively. The results indicate that the 1D-CNN model achieved overall accuracies (OA) of 0.99 and 0.93 on the two datasets, respectively. However, its performance was affected by the poor interpretability of the boundary between the synthetic source and target domain samples. The integration of nonlinear soft-thresholding into the DRSN model enhanced its feature extraction capability and noise suppression. It demonstrated strong performance on the two datasets, achieving an OA of 0.91 and a Kappa coefficient of 0.86. Compared to the original SMOTE version, the proposed approach exhibited superior generalization ability. The results of this study provide new insights into the problems of small sample imbalance, noise, and technical support for multi-species PLB monitoring in different regions.
期刊介绍:
The European Journal of Agronomy, the official journal of the European Society for Agronomy, publishes original research papers reporting experimental and theoretical contributions to field-based agronomy and crop science. The journal will consider research at the field level for agricultural, horticultural and tree crops, that uses comprehensive and explanatory approaches. The EJA covers the following topics:
crop physiology
crop production and management including irrigation, fertilization and soil management
agroclimatology and modelling
plant-soil relationships
crop quality and post-harvest physiology
farming and cropping systems
agroecosystems and the environment
crop-weed interactions and management
organic farming
horticultural crops
papers from the European Society for Agronomy bi-annual meetings
In determining the suitability of submitted articles for publication, particular scrutiny is placed on the degree of novelty and significance of the research and the extent to which it adds to existing knowledge in agronomy.