Shengqi Yan , Xin Zhao , Qibing Zhu , Min Huang , Xinnian Guo
{"title":"A method for constructing optical detection model of wheat seed purity based on sample generation and contrastive learning strategy","authors":"Shengqi Yan , Xin Zhao , Qibing Zhu , Min Huang , Xinnian Guo","doi":"10.1016/j.jfca.2024.107022","DOIUrl":null,"url":null,"abstract":"<div><div>Seed purity is an essential indicator of seed quality, and use of optical techniques for seed purity testing has been extensively reported. Currently, most optical detection models for seed purity rely on the assumptions that the training set used to develop detection model contains all possible impurity varieties, and that the number of positive (the variety of interest) and negative samples (all possible impurities contained in that variety) in the training set is balanced. This assumption is often difficult to fulfil in practice due to the complexity of the seed production process. To address this issue, an optical detection model construction method for wheat seed purity based on sample generation and contrastive learning strategy was proposed in this study. The proposed method firstly employed a linear generation strategy to expand the impurity samples in training set, thus enhancing the diversity of impurity samples and improving the unbalance between the positive and negative samples. Thereafter, the contrastive learning loss function was introduced to train the optical detection model based on the deep convolutional neural network, so as to enhance the feature differences between positive and negative samples, and to improve the model's recognition accuracy for unknown impurity samples. Hyperspectral images of 4200 wheat seeds from six varieties were acquired, and a purity detection model for each variety was built by using the average spectra of the seed as input. The proposed method achieved average accuracy of 95.33 %, with improvements of 21.18 % over SVDD (78.67 %), 12.13 % over LSSVM (85.02 %), and 13.06 % over CNN (84.2 %), respectively. Further studies shown that the proposed method still maintains good detection accuracy under the condition that the test set contains multiply unknown impurities. The proposed method provides a feasible way for the construction of an optical detection model for seed purity in a real scenario.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"138 ","pages":"Article 107022"},"PeriodicalIF":4.0000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157524010561","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Seed purity is an essential indicator of seed quality, and use of optical techniques for seed purity testing has been extensively reported. Currently, most optical detection models for seed purity rely on the assumptions that the training set used to develop detection model contains all possible impurity varieties, and that the number of positive (the variety of interest) and negative samples (all possible impurities contained in that variety) in the training set is balanced. This assumption is often difficult to fulfil in practice due to the complexity of the seed production process. To address this issue, an optical detection model construction method for wheat seed purity based on sample generation and contrastive learning strategy was proposed in this study. The proposed method firstly employed a linear generation strategy to expand the impurity samples in training set, thus enhancing the diversity of impurity samples and improving the unbalance between the positive and negative samples. Thereafter, the contrastive learning loss function was introduced to train the optical detection model based on the deep convolutional neural network, so as to enhance the feature differences between positive and negative samples, and to improve the model's recognition accuracy for unknown impurity samples. Hyperspectral images of 4200 wheat seeds from six varieties were acquired, and a purity detection model for each variety was built by using the average spectra of the seed as input. The proposed method achieved average accuracy of 95.33 %, with improvements of 21.18 % over SVDD (78.67 %), 12.13 % over LSSVM (85.02 %), and 13.06 % over CNN (84.2 %), respectively. Further studies shown that the proposed method still maintains good detection accuracy under the condition that the test set contains multiply unknown impurities. The proposed method provides a feasible way for the construction of an optical detection model for seed purity in a real scenario.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.