非线性机器学习耦合近红外光谱增强了咖啡原产地溯源模型的性能和洞察力

IF 1.6 4区 化学 Q3 CHEMISTRY, APPLIED
Joy Sim, Cushla McGoverin, Indrawati Oey, Russell Frew, Biniam Kebede
{"title":"非线性机器学习耦合近红外光谱增强了咖啡原产地溯源模型的性能和洞察力","authors":"Joy Sim, Cushla McGoverin, Indrawati Oey, Russell Frew, Biniam Kebede","doi":"10.1177/09670335241269014","DOIUrl":null,"url":null,"abstract":"Over the past decade, there has been overwhelming interest in rapid and routine origin tracing and authentication methods, such as near infrared (NIR) spectroscopy. In a systematic and comprehensive approach, this study coupled NIR with advanced machine learning models to explore the origin classification of coffee at various scales (continental to regional level). Speciality green coffee beans were sourced from three continents, eight countries, and 22 regions. The dispersive bulk NIR spectra were used for spectral registration in the reflectance mode, and the obtained spectra were preprocessed with extended multiplicative scatter correction and mean centering. The classical linear partial least squares-discriminant analysis (PLS-DA) adequately predicted origin at the continental and country level, and showed promise at the regional level. Non-linear machine learning models improved predictions further, with the best accuracy found using random forest with accuracies up to 0.99. Discriminating wavelength regions and constituents were identified at each origin scale, with more minor wavelength regions selected by random forest. This proof of concept work demonstrated the potential of NIR spectroscopy coupled with machine learning for rapid origin classification of coffee from the continental to the regional level.","PeriodicalId":16551,"journal":{"name":"Journal of Near Infrared Spectroscopy","volume":"43 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Non-linear machine learning coupled near infrared spectroscopy enhanced model performance and insights for coffee origin traceability\",\"authors\":\"Joy Sim, Cushla McGoverin, Indrawati Oey, Russell Frew, Biniam Kebede\",\"doi\":\"10.1177/09670335241269014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the past decade, there has been overwhelming interest in rapid and routine origin tracing and authentication methods, such as near infrared (NIR) spectroscopy. In a systematic and comprehensive approach, this study coupled NIR with advanced machine learning models to explore the origin classification of coffee at various scales (continental to regional level). Speciality green coffee beans were sourced from three continents, eight countries, and 22 regions. The dispersive bulk NIR spectra were used for spectral registration in the reflectance mode, and the obtained spectra were preprocessed with extended multiplicative scatter correction and mean centering. The classical linear partial least squares-discriminant analysis (PLS-DA) adequately predicted origin at the continental and country level, and showed promise at the regional level. Non-linear machine learning models improved predictions further, with the best accuracy found using random forest with accuracies up to 0.99. Discriminating wavelength regions and constituents were identified at each origin scale, with more minor wavelength regions selected by random forest. This proof of concept work demonstrated the potential of NIR spectroscopy coupled with machine learning for rapid origin classification of coffee from the continental to the regional level.\",\"PeriodicalId\":16551,\"journal\":{\"name\":\"Journal of Near Infrared Spectroscopy\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Near Infrared Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1177/09670335241269014\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Near Infrared Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1177/09670335241269014","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

在过去的十年中,人们对快速和常规的原产地追踪和鉴定方法(如近红外光谱法)产生了极大的兴趣。本研究采用系统而全面的方法,将近红外光谱与先进的机器学习模型相结合,探索不同尺度(从大陆到地区)的咖啡原产地分类。特种绿色咖啡豆来自三大洲、八个国家和 22 个地区。色散大块近红外光谱用于反射模式下的光谱配准,获得的光谱经过扩展乘法散度校正和均值居中预处理。经典的线性偏最小二乘判别分析(PLS-DA)可充分预测大陆和国家层面的原产地,并在区域层面显示出前景。非线性机器学习模型进一步提高了预测结果,其中使用随机森林的预测准确率最高,可达 0.99。在每个起源尺度上都确定了可区分的波长区域和成分,随机森林选择了更多的次要波长区域。这项概念验证工作证明了近红外光谱与机器学习相结合,在从大陆到地区一级对咖啡进行快速原产地分类方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Non-linear machine learning coupled near infrared spectroscopy enhanced model performance and insights for coffee origin traceability
Over the past decade, there has been overwhelming interest in rapid and routine origin tracing and authentication methods, such as near infrared (NIR) spectroscopy. In a systematic and comprehensive approach, this study coupled NIR with advanced machine learning models to explore the origin classification of coffee at various scales (continental to regional level). Speciality green coffee beans were sourced from three continents, eight countries, and 22 regions. The dispersive bulk NIR spectra were used for spectral registration in the reflectance mode, and the obtained spectra were preprocessed with extended multiplicative scatter correction and mean centering. The classical linear partial least squares-discriminant analysis (PLS-DA) adequately predicted origin at the continental and country level, and showed promise at the regional level. Non-linear machine learning models improved predictions further, with the best accuracy found using random forest with accuracies up to 0.99. Discriminating wavelength regions and constituents were identified at each origin scale, with more minor wavelength regions selected by random forest. This proof of concept work demonstrated the potential of NIR spectroscopy coupled with machine learning for rapid origin classification of coffee from the continental to the regional level.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.30
自引率
5.60%
发文量
35
审稿时长
6 months
期刊介绍: JNIRS — Journal of Near Infrared Spectroscopy is a peer reviewed journal, publishing original research papers, short communications, review articles and letters concerned with near infrared spectroscopy and technology, its application, new instrumentation and the use of chemometric and data handling techniques within NIR.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信