Wan Yi Li , Mei Di Cheng , Ting Wu , Jun Quan Lin , Li Lin , Ling Yang , Juan Zou
{"title":"Machine learning-based malachite green and leucomalachite green content in large-mouth bass prediction using Raman spectroscopy","authors":"Wan Yi Li , Mei Di Cheng , Ting Wu , Jun Quan Lin , Li Lin , Ling Yang , Juan Zou","doi":"10.1016/j.jfca.2025.108052","DOIUrl":null,"url":null,"abstract":"<div><div>Carcinogenic malachite green (MG) and its metabolite leucomalachite green (LMG) in aquatic products present significant food safety risks. Conventional detection methods for MG and LMG, such as the liquid chromatography-tandem mass spectrometry (LC-MS/MS), are often time-consuming and destructive due to their laborious multi-step sample extraction and purification procedures, thereby hindering rapid screening in the aquaculture industry. This study addresses this need by developing a non-destructive method using Raman spectroscopy combined with machine learning to predict MG and LMG concentrations in large-mouth bass muscle tissue. Muscle samples were analyzed at multiple time points (0–480 min) post-MG/LMG exposure. Among evaluated preprocessing techniques (Savitzky-Golay smoothing, multiple scatter correction, standard normal variate, normalization), Savitzky-Golay smoothing proved most effective. Four machine learning regression models were tested: Partial Least-Squares Regression (PLSR), Random Forest Regression (RFR), XGBoost, and Back Propagation Neural Network (BPNN). RFR yielded the best predictions for MG (validation R² = 0.8090), while PLSR was optimal for LMG (validation R² = 0.8435). This Raman/ML approach enables rapid, non-invasive quantification of these banned residues, offering a practical tool for enhanced food safety monitoring in aquaculture.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"147 ","pages":"Article 108052"},"PeriodicalIF":4.0000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157525008671","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Carcinogenic malachite green (MG) and its metabolite leucomalachite green (LMG) in aquatic products present significant food safety risks. Conventional detection methods for MG and LMG, such as the liquid chromatography-tandem mass spectrometry (LC-MS/MS), are often time-consuming and destructive due to their laborious multi-step sample extraction and purification procedures, thereby hindering rapid screening in the aquaculture industry. This study addresses this need by developing a non-destructive method using Raman spectroscopy combined with machine learning to predict MG and LMG concentrations in large-mouth bass muscle tissue. Muscle samples were analyzed at multiple time points (0–480 min) post-MG/LMG exposure. Among evaluated preprocessing techniques (Savitzky-Golay smoothing, multiple scatter correction, standard normal variate, normalization), Savitzky-Golay smoothing proved most effective. Four machine learning regression models were tested: Partial Least-Squares Regression (PLSR), Random Forest Regression (RFR), XGBoost, and Back Propagation Neural Network (BPNN). RFR yielded the best predictions for MG (validation R² = 0.8090), while PLSR was optimal for LMG (validation R² = 0.8435). This Raman/ML approach enables rapid, non-invasive quantification of these banned residues, offering a practical tool for enhanced food safety monitoring in aquaculture.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.