{"title":"Feasibility Study on Identifying Seed Variety of Soybean With Hyperspectral Imaging and Deep Learning","authors":"Lei Pang, Zhen Wang, Siyan Mi, Hui Li","doi":"10.1002/cem.70035","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Seed variety purity is an important indicator of seed quality, and mixing soybean seeds at different maturity stages can affect crop growth and food quality. This study investigated the feasibility of recognizing five soybean varieties at different maturity stages using hyperspectral imaging. Hyperspectral data from 3600 soybean seeds were collected in the range of 395.5–1003.7 nm. First, the potential to qualitatively distinguish the five soybean varieties was assessed using visual cluster analyses based on principal component analysis (PCA), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP). Next, the performance of four classification models—random forest (RF), extreme learning machine (ELM), partial least squares discriminant analysis (PLS-DA), and one-dimensional convolutional neural network (1DCNN)—was compared. Multiplicative scatter correction (MSC) preprocessing significantly improved the recognition effect of all four models, with the 1DCNN model demonstrating the highest accuracy and most stable recognition performance. The effects of feature bands extracted using competitive adaptive reweighted sampling (CARS), variable importance in projection (VIP), and local linear embedding (LLE) on the four models were also compared. The accuracy of all four feature band sets, when combined with the MSC+1DCNN model, exceeded 96% in identifying soybean varieties. Therefore, these results indicate that the 1DCNN discriminant analysis model is suitable for spectral data analysis in soybean seed variety classification and can significantly enhance classification accuracy.</p>\n </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 5","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cem.70035","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}
引用次数: 0
Abstract
Seed variety purity is an important indicator of seed quality, and mixing soybean seeds at different maturity stages can affect crop growth and food quality. This study investigated the feasibility of recognizing five soybean varieties at different maturity stages using hyperspectral imaging. Hyperspectral data from 3600 soybean seeds were collected in the range of 395.5–1003.7 nm. First, the potential to qualitatively distinguish the five soybean varieties was assessed using visual cluster analyses based on principal component analysis (PCA), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP). Next, the performance of four classification models—random forest (RF), extreme learning machine (ELM), partial least squares discriminant analysis (PLS-DA), and one-dimensional convolutional neural network (1DCNN)—was compared. Multiplicative scatter correction (MSC) preprocessing significantly improved the recognition effect of all four models, with the 1DCNN model demonstrating the highest accuracy and most stable recognition performance. The effects of feature bands extracted using competitive adaptive reweighted sampling (CARS), variable importance in projection (VIP), and local linear embedding (LLE) on the four models were also compared. The accuracy of all four feature band sets, when combined with the MSC+1DCNN model, exceeded 96% in identifying soybean varieties. Therefore, these results indicate that the 1DCNN discriminant analysis model is suitable for spectral data analysis in soybean seed variety classification and can significantly enhance classification accuracy.
期刊介绍:
The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.