Bagging and boosting machine learning algorithms for modelling sensory perception from simple chemical variables: Wine mouthfeel as a case study

IF 4.9 1区 农林科学 Q1 FOOD SCIENCE & TECHNOLOGY
María-Pilar Sáenz-Navajas , Chelo Ferreira , Susan E.P. Bastian , David W. Jeffery
{"title":"Bagging and boosting machine learning algorithms for modelling sensory perception from simple chemical variables: Wine mouthfeel as a case study","authors":"María-Pilar Sáenz-Navajas ,&nbsp;Chelo Ferreira ,&nbsp;Susan E.P. Bastian ,&nbsp;David W. Jeffery","doi":"10.1016/j.foodqual.2025.105494","DOIUrl":null,"url":null,"abstract":"<div><div>Aiming to predict sensory properties from chemical data, the application of bagging and boosting machine learning (ML) algorithms was comprehensively investigated and applied to modelling of red wine mouthfeel from simple chemical measurements. A panel of 15 Australian winemakers described the mouthfeel properties of a total of 30 commercial red wines from Australia and Spain using rate-all-that-apply sensory methodology. In parallel, linear sweep voltammetry signals and excitation-emission matrix (EEM) and absorbance data were acquired for the wines. Data were analysed following unsupervised statistical strategies including principal component analysis (PCA with varimax rotation) to simplify the interpretation of sensory variables, along with supervised regression models based on ML, namely random forest (RF) and extreme gradient boosting (XGBoost). PCA results showed that four independent and uncorrelated sensory dimensions mainly related to perceptions of ‘drying’, ‘full body’, ‘velvety’, and ‘gummy’ differentiated among the wines. The RF and XGBoost algorithms yielded superior validated regression models compared to classical PLS modelling. The ML algorithms exhibited strong predictive performance on test data, with an average value exceeding 80 % accuracy for any of the three sets of chemical variables employed. Although XGBoost provided slightly better models, the low computational effort required by RF is advantageous. Key variables included in the models are discussed along with the importance of controlling overfitting. Overall, absorbance, voltammetric or EEM signals coupled with RF or XGBoost algorithms are presented as cheap, easy-to-use, and rapid approaches to predicting sensory properties from chemical signals in complex matrices such as wine.</div></div>","PeriodicalId":322,"journal":{"name":"Food Quality and Preference","volume":"129 ","pages":"Article 105494"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Quality and Preference","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950329325000692","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming to predict sensory properties from chemical data, the application of bagging and boosting machine learning (ML) algorithms was comprehensively investigated and applied to modelling of red wine mouthfeel from simple chemical measurements. A panel of 15 Australian winemakers described the mouthfeel properties of a total of 30 commercial red wines from Australia and Spain using rate-all-that-apply sensory methodology. In parallel, linear sweep voltammetry signals and excitation-emission matrix (EEM) and absorbance data were acquired for the wines. Data were analysed following unsupervised statistical strategies including principal component analysis (PCA with varimax rotation) to simplify the interpretation of sensory variables, along with supervised regression models based on ML, namely random forest (RF) and extreme gradient boosting (XGBoost). PCA results showed that four independent and uncorrelated sensory dimensions mainly related to perceptions of ‘drying’, ‘full body’, ‘velvety’, and ‘gummy’ differentiated among the wines. The RF and XGBoost algorithms yielded superior validated regression models compared to classical PLS modelling. The ML algorithms exhibited strong predictive performance on test data, with an average value exceeding 80 % accuracy for any of the three sets of chemical variables employed. Although XGBoost provided slightly better models, the low computational effort required by RF is advantageous. Key variables included in the models are discussed along with the importance of controlling overfitting. Overall, absorbance, voltammetric or EEM signals coupled with RF or XGBoost algorithms are presented as cheap, easy-to-use, and rapid approaches to predicting sensory properties from chemical signals in complex matrices such as wine.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Food Quality and Preference
Food Quality and Preference 工程技术-食品科技
CiteScore
10.40
自引率
15.10%
发文量
263
审稿时长
38 days
期刊介绍: Food Quality and Preference is a journal devoted to sensory, consumer and behavioural research in food and non-food products. It publishes original research, critical reviews, and short communications in sensory and consumer science, and sensometrics. In addition, the journal publishes special invited issues on important timely topics and from relevant conferences. These are aimed at bridging the gap between research and application, bringing together authors and readers in consumer and market research, sensory science, sensometrics and sensory evaluation, nutrition and food choice, as well as food research, product development and sensory quality assurance. Submissions to Food Quality and Preference are limited to papers that include some form of human measurement; papers that are limited to physical/chemical measures or the routine application of sensory, consumer or econometric analysis will not be considered unless they specifically make a novel scientific contribution in line with the journal''s coverage as outlined below.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信