{"title":"基于过采样深度卷积网络的近红外高光谱图像小麦品种无损识别","authors":"Nitin Tyagi, Sarvagya Porwal, Pradeep Singh, Balasubramanian Raman, Neerja Garg","doi":"10.1007/s10921-024-01143-z","DOIUrl":null,"url":null,"abstract":"<div><p>Differentiating between wheat species poses a significant challenge to the Indian grain industry. Visual inspection of wheat species has drawbacks, including inconsistency, low throughput, and labor intensiveness. In this study, near-infrared hyperspectral imaging (NIR-HSI) was utilized in conjunction with a deep learning approach to achieve precise predictions of wheat at the species level. A dataset comprising 40 different varieties from four Indian wheat species, namely <i>Triticum aestivum</i> (<i>T. aestivum</i>), <i>Triticum durum</i> (<i>T. durum</i>), <i>Triticum dicocccum</i> (<i>T. dicoccum</i>), and <i>Triticale</i>, was prepared using a NIR-HSI system that encompassed the wavelength ranging from 900–1700 nm. The imbalanced dataset is a common problem in the classification task, making it harder for the classifier to classify minority class data correctly. To address this issue, oversampling techniques such as Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic sampling (ADASYN) were employed. For the classification task, a 1D Convolutional Neural Network (1D-CNN), a 1D-ResNet, and four traditional machine learning models: Naive Bayes (NB), K-Nearest Neighbor (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) are utilized and compared. The performance of these models was assessed using both imbalanced and balanced datasets. The 1D-CNN model outperformed traditional machine learning models, achieving impressive test accuracies of 98.25% and 98.43% with the SMOTE and ADASYN approach, respectively. These findings underscore the efficacy of NIR-HSI in conjunction with an end-to-end 1D-CNN and oversampling techniques as a reliable and efficient method for the rapid, accurate, and nondestructive identification of various wheat species. The code is available at https://github.com/nitintyagi007-iitr/Wheat_species_classification</p></div>","PeriodicalId":655,"journal":{"name":"Journal of Nondestructive Evaluation","volume":"44 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Nondestructive Identification of Wheat Species using Deep Convolutional Networks with Oversampling Strategies on Near-Infrared Hyperspectral Imagery\",\"authors\":\"Nitin Tyagi, Sarvagya Porwal, Pradeep Singh, Balasubramanian Raman, Neerja Garg\",\"doi\":\"10.1007/s10921-024-01143-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Differentiating between wheat species poses a significant challenge to the Indian grain industry. Visual inspection of wheat species has drawbacks, including inconsistency, low throughput, and labor intensiveness. In this study, near-infrared hyperspectral imaging (NIR-HSI) was utilized in conjunction with a deep learning approach to achieve precise predictions of wheat at the species level. A dataset comprising 40 different varieties from four Indian wheat species, namely <i>Triticum aestivum</i> (<i>T. aestivum</i>), <i>Triticum durum</i> (<i>T. durum</i>), <i>Triticum dicocccum</i> (<i>T. dicoccum</i>), and <i>Triticale</i>, was prepared using a NIR-HSI system that encompassed the wavelength ranging from 900–1700 nm. The imbalanced dataset is a common problem in the classification task, making it harder for the classifier to classify minority class data correctly. To address this issue, oversampling techniques such as Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic sampling (ADASYN) were employed. For the classification task, a 1D Convolutional Neural Network (1D-CNN), a 1D-ResNet, and four traditional machine learning models: Naive Bayes (NB), K-Nearest Neighbor (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) are utilized and compared. The performance of these models was assessed using both imbalanced and balanced datasets. The 1D-CNN model outperformed traditional machine learning models, achieving impressive test accuracies of 98.25% and 98.43% with the SMOTE and ADASYN approach, respectively. These findings underscore the efficacy of NIR-HSI in conjunction with an end-to-end 1D-CNN and oversampling techniques as a reliable and efficient method for the rapid, accurate, and nondestructive identification of various wheat species. The code is available at https://github.com/nitintyagi007-iitr/Wheat_species_classification</p></div>\",\"PeriodicalId\":655,\"journal\":{\"name\":\"Journal of Nondestructive Evaluation\",\"volume\":\"44 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Nondestructive Evaluation\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10921-024-01143-z\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, CHARACTERIZATION & TESTING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nondestructive Evaluation","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s10921-024-01143-z","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, CHARACTERIZATION & TESTING","Score":null,"Total":0}
引用次数: 0
摘要
区分小麦品种对印度粮食工业构成了重大挑战。小麦品种目测检测存在不一致、产量低、劳动强度大等缺点。在这项研究中,近红外高光谱成像(NIR-HSI)与深度学习方法相结合,在品种水平上实现了小麦的精确预测。利用NIR-HSI系统制备了一个数据集,该数据集包括来自4个印度小麦品种的40个不同品种,即Triticum aestivum (T. aestivum)、Triticum durum (T. durum)、Triticum dicoccum (T. dicoccum)和Triticale,其波长范围为900-1700 nm。数据集不平衡是分类任务中常见的问题,使得分类器难以正确分类少数类数据。为了解决这个问题,采用了合成少数过采样技术(SMOTE)和自适应合成采样(ADASYN)等过采样技术。对于分类任务,使用1D卷积神经网络(1D- cnn), 1D- resnet和四种传统机器学习模型:朴素贝叶斯(NB), k -近邻(KNN),随机森林(RF)和极端梯度增强(XGBoost)进行比较。使用不平衡和平衡数据集评估这些模型的性能。1D-CNN模型优于传统的机器学习模型,使用SMOTE和ADASYN方法分别实现了令人印象深刻的98.25%和98.43%的测试准确率。这些发现强调了NIR-HSI结合端到端1D-CNN和过采样技术作为快速、准确和无损鉴定各种小麦品种的可靠有效方法的有效性。代码可在https://github.com/nitintyagi007-iitr/Wheat_species_classification上获得
Nondestructive Identification of Wheat Species using Deep Convolutional Networks with Oversampling Strategies on Near-Infrared Hyperspectral Imagery
Differentiating between wheat species poses a significant challenge to the Indian grain industry. Visual inspection of wheat species has drawbacks, including inconsistency, low throughput, and labor intensiveness. In this study, near-infrared hyperspectral imaging (NIR-HSI) was utilized in conjunction with a deep learning approach to achieve precise predictions of wheat at the species level. A dataset comprising 40 different varieties from four Indian wheat species, namely Triticum aestivum (T. aestivum), Triticum durum (T. durum), Triticum dicocccum (T. dicoccum), and Triticale, was prepared using a NIR-HSI system that encompassed the wavelength ranging from 900–1700 nm. The imbalanced dataset is a common problem in the classification task, making it harder for the classifier to classify minority class data correctly. To address this issue, oversampling techniques such as Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic sampling (ADASYN) were employed. For the classification task, a 1D Convolutional Neural Network (1D-CNN), a 1D-ResNet, and four traditional machine learning models: Naive Bayes (NB), K-Nearest Neighbor (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) are utilized and compared. The performance of these models was assessed using both imbalanced and balanced datasets. The 1D-CNN model outperformed traditional machine learning models, achieving impressive test accuracies of 98.25% and 98.43% with the SMOTE and ADASYN approach, respectively. These findings underscore the efficacy of NIR-HSI in conjunction with an end-to-end 1D-CNN and oversampling techniques as a reliable and efficient method for the rapid, accurate, and nondestructive identification of various wheat species. The code is available at https://github.com/nitintyagi007-iitr/Wheat_species_classification
期刊介绍:
Journal of Nondestructive Evaluation provides a forum for the broad range of scientific and engineering activities involved in developing a quantitative nondestructive evaluation (NDE) capability. This interdisciplinary journal publishes papers on the development of new equipment, analyses, and approaches to nondestructive measurements.