{"title":"基于荧光光谱数据的机器学习韭菜种子分类","authors":"Ewa Ropelewska, Kadir Sabanci, Vanya Slavova, Stefka Genova","doi":"10.1007/s00217-023-04361-3","DOIUrl":null,"url":null,"abstract":"<div><p>The objective of this study was to distinguish leek seeds belonging to the <i>Starozagorski kamush</i> variety and two breeding lines based on the selected fluorescence spectroscopic data. The classification models were developed for three classes of <i>Starozagorski kamush</i> vs. breeding line 4 vs. breeding line 39 and pairs of classes of <i>Starozagorski kamush</i> vs. breeding line 4, <i>Starozagorski kamush</i> vs. breeding line 39, and breeding line 4 vs. breeding line 39. The traditional machine learning algorithms, such as PART, Logistic, Naive Bayes, Random Forest, IBk, and Filtered Classifier were applied. All three classes were distinguished with an average accuracy of up to 93.33% for models built using IBk and Filtered Classifier. In the case of each model, <i>Starozagorski kamush</i> variety was completely different (accuracy of 100%, precision, and F-measure, MCC (Matthews correlation coefficient), and ROC (receiver operating characteristic) area of 1.000) from breeding lines, and the mixing of cases was observed between breeding line 4 and breeding line 39. The models built for pairs of leek seed classes distinguished <i>Starozagorski kamush</i> and breeding line 4 with an average accuracy reaching 100% (Logistic, Naive Bayes, Random Forest, IBk). The classification accuracy of <i>Starozagorski kamush</i> and breeding line 39 also reached 100% (Logistic, Naive Bayes, Random Forest, IBk), whereas breeding line 4 and breeding line 39 were classified with an average accuracy of up to 80% (Logistic, Naive Bayes, Random Forest, Filtered Classifier). The proposed approach combining fluorescence spectroscopy and machine learning may be used in practice to distinguish leek seed varieties and breeding lines.</p></div>","PeriodicalId":549,"journal":{"name":"European Food Research and Technology","volume":"249 12","pages":"3217 - 3226"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00217-023-04361-3.pdf","citationCount":"0","resultStr":"{\"title\":\"The classification of leek seeds based on fluorescence spectroscopic data using machine learning\",\"authors\":\"Ewa Ropelewska, Kadir Sabanci, Vanya Slavova, Stefka Genova\",\"doi\":\"10.1007/s00217-023-04361-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The objective of this study was to distinguish leek seeds belonging to the <i>Starozagorski kamush</i> variety and two breeding lines based on the selected fluorescence spectroscopic data. The classification models were developed for three classes of <i>Starozagorski kamush</i> vs. breeding line 4 vs. breeding line 39 and pairs of classes of <i>Starozagorski kamush</i> vs. breeding line 4, <i>Starozagorski kamush</i> vs. breeding line 39, and breeding line 4 vs. breeding line 39. The traditional machine learning algorithms, such as PART, Logistic, Naive Bayes, Random Forest, IBk, and Filtered Classifier were applied. All three classes were distinguished with an average accuracy of up to 93.33% for models built using IBk and Filtered Classifier. In the case of each model, <i>Starozagorski kamush</i> variety was completely different (accuracy of 100%, precision, and F-measure, MCC (Matthews correlation coefficient), and ROC (receiver operating characteristic) area of 1.000) from breeding lines, and the mixing of cases was observed between breeding line 4 and breeding line 39. The models built for pairs of leek seed classes distinguished <i>Starozagorski kamush</i> and breeding line 4 with an average accuracy reaching 100% (Logistic, Naive Bayes, Random Forest, IBk). The classification accuracy of <i>Starozagorski kamush</i> and breeding line 39 also reached 100% (Logistic, Naive Bayes, Random Forest, IBk), whereas breeding line 4 and breeding line 39 were classified with an average accuracy of up to 80% (Logistic, Naive Bayes, Random Forest, Filtered Classifier). The proposed approach combining fluorescence spectroscopy and machine learning may be used in practice to distinguish leek seed varieties and breeding lines.</p></div>\",\"PeriodicalId\":549,\"journal\":{\"name\":\"European Food Research and Technology\",\"volume\":\"249 12\",\"pages\":\"3217 - 3226\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2023-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s00217-023-04361-3.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Food Research and Technology\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00217-023-04361-3\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Food Research and Technology","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1007/s00217-023-04361-3","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
The classification of leek seeds based on fluorescence spectroscopic data using machine learning
The objective of this study was to distinguish leek seeds belonging to the Starozagorski kamush variety and two breeding lines based on the selected fluorescence spectroscopic data. The classification models were developed for three classes of Starozagorski kamush vs. breeding line 4 vs. breeding line 39 and pairs of classes of Starozagorski kamush vs. breeding line 4, Starozagorski kamush vs. breeding line 39, and breeding line 4 vs. breeding line 39. The traditional machine learning algorithms, such as PART, Logistic, Naive Bayes, Random Forest, IBk, and Filtered Classifier were applied. All three classes were distinguished with an average accuracy of up to 93.33% for models built using IBk and Filtered Classifier. In the case of each model, Starozagorski kamush variety was completely different (accuracy of 100%, precision, and F-measure, MCC (Matthews correlation coefficient), and ROC (receiver operating characteristic) area of 1.000) from breeding lines, and the mixing of cases was observed between breeding line 4 and breeding line 39. The models built for pairs of leek seed classes distinguished Starozagorski kamush and breeding line 4 with an average accuracy reaching 100% (Logistic, Naive Bayes, Random Forest, IBk). The classification accuracy of Starozagorski kamush and breeding line 39 also reached 100% (Logistic, Naive Bayes, Random Forest, IBk), whereas breeding line 4 and breeding line 39 were classified with an average accuracy of up to 80% (Logistic, Naive Bayes, Random Forest, Filtered Classifier). The proposed approach combining fluorescence spectroscopy and machine learning may be used in practice to distinguish leek seed varieties and breeding lines.
期刊介绍:
The journal European Food Research and Technology publishes state-of-the-art research papers and review articles on fundamental and applied food research. The journal''s mission is the fast publication of high quality papers on front-line research, newest techniques and on developing trends in the following sections:
-chemistry and biochemistry-
technology and molecular biotechnology-
nutritional chemistry and toxicology-
analytical and sensory methodologies-
food physics.
Out of the scope of the journal are:
- contributions which are not of international interest or do not have a substantial impact on food sciences,
- submissions which comprise merely data collections, based on the use of routine analytical or bacteriological methods,
- contributions reporting biological or functional effects without profound chemical and/or physical structure characterization of the compound(s) under research.