{"title":"Geographical origin identification of sweet cherry based on quality traits combined with DD-SIMCA and XGBoost","authors":"Linxia Wu, Ziye Liu, Meng Wang","doi":"10.1016/j.foodchem.2025.145525","DOIUrl":null,"url":null,"abstract":"<div><div>Geographical origin identification technologies based on physical and nutritional characteristics have recently been developed and applied. This study evaluated the feasibility of identifying the geographical origin of sweet cherries using organoleptic traits and phenolic compound profiles. Data-driven soft independent modeling of class analogy (DD-SIMCA) and extreme gradient boosting (XGBoost) were applied to 170 sweet cherry samples collected in 2023 and 2024 from Beijing, Dalian, Tianshui, and Yantai, China. Measurements included transverse diameter, longitudinal diameter, fruit weight, soluble solid content, titratable acidity, organic acids, ascorbic acid, and 14 phenolic compounds. The DD-SIMCA model showed high sensitivity (98.00 %) and specificity (100.00 %). XGBoost yielded a prediction accuracy of 94.12 %, outperforming LDA (82.35 %), RF (88.24 %), and k-NN (82.35 %). Key discriminatory features included malic acid, quinic acid, citric acid, kaempferol-3-<em>O</em>-rutinoside, titratable acidity, and cyanidin-3-<em>O</em>-rutinoside. These findings indicate that DD-SIMCA and XGBoost are effective methods for the geographical origin identification of sweet cherries based on quality attributes. This approach supports quality assurance and control in regional production systems.</div></div>","PeriodicalId":318,"journal":{"name":"Food Chemistry","volume":"492 ","pages":"Article 145525"},"PeriodicalIF":9.8000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Chemistry","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0308814625027761","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Geographical origin identification technologies based on physical and nutritional characteristics have recently been developed and applied. This study evaluated the feasibility of identifying the geographical origin of sweet cherries using organoleptic traits and phenolic compound profiles. Data-driven soft independent modeling of class analogy (DD-SIMCA) and extreme gradient boosting (XGBoost) were applied to 170 sweet cherry samples collected in 2023 and 2024 from Beijing, Dalian, Tianshui, and Yantai, China. Measurements included transverse diameter, longitudinal diameter, fruit weight, soluble solid content, titratable acidity, organic acids, ascorbic acid, and 14 phenolic compounds. The DD-SIMCA model showed high sensitivity (98.00 %) and specificity (100.00 %). XGBoost yielded a prediction accuracy of 94.12 %, outperforming LDA (82.35 %), RF (88.24 %), and k-NN (82.35 %). Key discriminatory features included malic acid, quinic acid, citric acid, kaempferol-3-O-rutinoside, titratable acidity, and cyanidin-3-O-rutinoside. These findings indicate that DD-SIMCA and XGBoost are effective methods for the geographical origin identification of sweet cherries based on quality attributes. This approach supports quality assurance and control in regional production systems.
期刊介绍:
Food Chemistry publishes original research papers dealing with the advancement of the chemistry and biochemistry of foods or the analytical methods/ approach used. All papers should focus on the novelty of the research carried out.