基于可解释集成学习方法的傅里叶变换近红外光谱检测玉米油中矿物油污染

IF 4 2区 农林科学 Q2 CHEMISTRY, APPLIED
Jihong Deng , Hui Jiang , Quansheng Chen
{"title":"基于可解释集成学习方法的傅里叶变换近红外光谱检测玉米油中矿物油污染","authors":"Jihong Deng ,&nbsp;Hui Jiang ,&nbsp;Quansheng Chen","doi":"10.1016/j.jfca.2025.107594","DOIUrl":null,"url":null,"abstract":"<div><div>Corn oil is rich in unsaturated fatty acids and antioxidants, which contribute to cardiovascular health. This has led to its widespread use in food processing and cooking. However, during the production, transportation, and storage of corn oil, it can be exposed to mineral oil contamination. Therefore, ensuring the quality and safety of corn oil is crucial. At the same time, there is growing attention on developing rapid and environmentally friendly analytical monitoring tools to screen edible oils for impurities, ensuring their quality. This study introduced an innovative method that combines explainable artificial intelligence with Fourier Transform Near-Infrared Spectroscopy (FT-NIR) to detect mineral oil contaminants in corn oil. Five types of mineral oils were selected as potential pollutants, and spectral data from contaminated and uncontaminated corn oil samples were collected. Partial Least Squares Discriminant Analysis (PLS-DA) and ensemble learning methods, AdaBoost, XGBoost, LightGBM, and CatBoost, were applied to address two qualitative objectives. The results showed that PLS-DA effectively captured the spectral differences between normal and contaminated samples, achieving 100 % classification accuracy. Following this, four classifiers were developed using spectral data selected by Competitive Adaptive Reweighted Sampling (CARS) to identify specific contaminants in corn oil. LightGBM demonstrated the best performance, achieving 100 % accuracy, precision, recall, and F1 score across all contaminant categories. The Shapley Additive Explanations (SHAP) algorithm was also used to enhance model interpretability. This algorithm identified the key spectral wavelengths contributing to the classification of each contaminant category. The findings demonstrate that combining FT-NIR, feature selection, and explainable models provides a fast, accurate, and environmentally friendly method for assessing the quality of corn oil. This approach improves contamination detection and enhances consumer confidence in edible oil products. Future work should focus on extending this method to other edible oil safety applications and integrating it into real-time on-site monitoring systems for edible oil production.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"143 ","pages":"Article 107594"},"PeriodicalIF":4.0000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Fourier Transform Near-infrared Spectroscopy with Explainable Ensemble Learning Methods for Detecting Mineral Oil Contamination in Corn Oil\",\"authors\":\"Jihong Deng ,&nbsp;Hui Jiang ,&nbsp;Quansheng Chen\",\"doi\":\"10.1016/j.jfca.2025.107594\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Corn oil is rich in unsaturated fatty acids and antioxidants, which contribute to cardiovascular health. This has led to its widespread use in food processing and cooking. However, during the production, transportation, and storage of corn oil, it can be exposed to mineral oil contamination. Therefore, ensuring the quality and safety of corn oil is crucial. At the same time, there is growing attention on developing rapid and environmentally friendly analytical monitoring tools to screen edible oils for impurities, ensuring their quality. This study introduced an innovative method that combines explainable artificial intelligence with Fourier Transform Near-Infrared Spectroscopy (FT-NIR) to detect mineral oil contaminants in corn oil. Five types of mineral oils were selected as potential pollutants, and spectral data from contaminated and uncontaminated corn oil samples were collected. Partial Least Squares Discriminant Analysis (PLS-DA) and ensemble learning methods, AdaBoost, XGBoost, LightGBM, and CatBoost, were applied to address two qualitative objectives. The results showed that PLS-DA effectively captured the spectral differences between normal and contaminated samples, achieving 100 % classification accuracy. Following this, four classifiers were developed using spectral data selected by Competitive Adaptive Reweighted Sampling (CARS) to identify specific contaminants in corn oil. LightGBM demonstrated the best performance, achieving 100 % accuracy, precision, recall, and F1 score across all contaminant categories. The Shapley Additive Explanations (SHAP) algorithm was also used to enhance model interpretability. This algorithm identified the key spectral wavelengths contributing to the classification of each contaminant category. The findings demonstrate that combining FT-NIR, feature selection, and explainable models provides a fast, accurate, and environmentally friendly method for assessing the quality of corn oil. This approach improves contamination detection and enhances consumer confidence in edible oil products. Future work should focus on extending this method to other edible oil safety applications and integrating it into real-time on-site monitoring systems for edible oil production.</div></div>\",\"PeriodicalId\":15867,\"journal\":{\"name\":\"Journal of Food Composition and Analysis\",\"volume\":\"143 \",\"pages\":\"Article 107594\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-04-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Food Composition and Analysis\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0889157525004090\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157525004090","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

玉米油富含不饱和脂肪酸和抗氧化剂,有助于心血管健康。这导致了它在食品加工和烹饪中的广泛应用。然而,在玉米油的生产、运输和储存过程中,玉米油可能会受到矿物油的污染。因此,保证玉米油的质量安全至关重要。与此同时,人们越来越关注开发快速和环保的分析监测工具来筛选食用油中的杂质,以确保其质量。本研究介绍了一种将可解释人工智能与傅里叶变换近红外光谱(FT-NIR)相结合的创新方法,用于检测玉米油中的矿物油污染物。选取5种矿物油作为潜在污染物,采集了污染玉米油和未污染玉米油样品的光谱数据。偏最小二乘判别分析(PLS-DA)和集成学习方法AdaBoost, XGBoost, LightGBM和CatBoost应用于解决两个定性目标。结果表明,PLS-DA有效地捕获了正常和污染样品之间的光谱差异,分类精度达到100% %。在此之后,利用竞争自适应重加权采样(CARS)选择的光谱数据开发了四种分类器,以识别玉米油中的特定污染物。LightGBM表现出最好的性能,在所有污染物类别中达到100 %的准确度、精密度、召回率和F1分数。Shapley加性解释(SHAP)算法也被用来提高模型的可解释性。该算法确定了有助于对每种污染物类别进行分类的关键光谱波长。研究结果表明,结合FT-NIR,特征选择和可解释模型为评估玉米油的质量提供了一种快速,准确和环保的方法。这种方法改进了污染检测,增强了消费者对食用油产品的信心。未来的工作重点是将该方法扩展到其他食用油安全应用中,并将其集成到食用油生产的实时现场监控系统中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing Fourier Transform Near-infrared Spectroscopy with Explainable Ensemble Learning Methods for Detecting Mineral Oil Contamination in Corn Oil
Corn oil is rich in unsaturated fatty acids and antioxidants, which contribute to cardiovascular health. This has led to its widespread use in food processing and cooking. However, during the production, transportation, and storage of corn oil, it can be exposed to mineral oil contamination. Therefore, ensuring the quality and safety of corn oil is crucial. At the same time, there is growing attention on developing rapid and environmentally friendly analytical monitoring tools to screen edible oils for impurities, ensuring their quality. This study introduced an innovative method that combines explainable artificial intelligence with Fourier Transform Near-Infrared Spectroscopy (FT-NIR) to detect mineral oil contaminants in corn oil. Five types of mineral oils were selected as potential pollutants, and spectral data from contaminated and uncontaminated corn oil samples were collected. Partial Least Squares Discriminant Analysis (PLS-DA) and ensemble learning methods, AdaBoost, XGBoost, LightGBM, and CatBoost, were applied to address two qualitative objectives. The results showed that PLS-DA effectively captured the spectral differences between normal and contaminated samples, achieving 100 % classification accuracy. Following this, four classifiers were developed using spectral data selected by Competitive Adaptive Reweighted Sampling (CARS) to identify specific contaminants in corn oil. LightGBM demonstrated the best performance, achieving 100 % accuracy, precision, recall, and F1 score across all contaminant categories. The Shapley Additive Explanations (SHAP) algorithm was also used to enhance model interpretability. This algorithm identified the key spectral wavelengths contributing to the classification of each contaminant category. The findings demonstrate that combining FT-NIR, feature selection, and explainable models provides a fast, accurate, and environmentally friendly method for assessing the quality of corn oil. This approach improves contamination detection and enhances consumer confidence in edible oil products. Future work should focus on extending this method to other edible oil safety applications and integrating it into real-time on-site monitoring systems for edible oil production.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Food Composition and Analysis
Journal of Food Composition and Analysis 工程技术-食品科技
CiteScore
6.20
自引率
11.60%
发文量
601
审稿时长
53 days
期刊介绍: The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects. The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信