Wine quality analysis by the structural causal model (SCM)

Ž. Kurtanjek
{"title":"Wine quality analysis by the structural causal model (SCM)","authors":"Ž. Kurtanjek","doi":"10.17508/cjfst.2023.15.2.05","DOIUrl":null,"url":null,"abstract":"Bayes network modelling for structural causal analysis between wine physicochemical data and quantitative human quality blind assessments is applied. The large dataset of white and red \"Vinho Verde'' wine samples from Portugal, which was available from an open data repository for machine learning at the University of California at Irving, was analysed. The dataset contains 4898 white and 1599 red samples evaluated by blind tastes by a minimum of 3 sensory assessors and 12 physicochemical properties. The casual effects of wine analytic data on human quality evaluations are evaluated numerically by Bayes neural networks for adjusted sets of the covariates as marginal distributions and presented graphically as partial dependence plots. Structural causal analysis revealed important differences between the most important variables for quality predictions and the individual causal effects. Bayes neural network models of the partial dependencies show more pronounced nonlinear effects for red wines compared to white wine quality. The artificial intelligence models with boosted random decision tree forests for untrained wine samples yield a 5% relative standard error of predictions compared to 12% for the linear models and ordinary least squares estimation. For red wine, the most important direct causal quality effects are caused by alcohol, volatile acidity, and sulphates. Alcohol improves quality with a maximum plateau at 14%, while volatile acidity has a strong proportional negative effect. The effect of sulphates is highly nonlinear with maximum positive effect at a concentration of 1 g/L of K2SO4. For the white wine samples causal effects are linear with positive effects of alcohol and negative effects of volatile and fixed acidity. The developed structural causal model enables evaluation of targeted wine production interventions, named as “doing x, do(x) models”, as restructured adjusted Bayes networks. It leads to potential applications of artificial intelligence in wine production technology and process quality control.","PeriodicalId":10771,"journal":{"name":"Croatian journal of food science and technology","volume":"171 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Croatian journal of food science and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17508/cjfst.2023.15.2.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bayes network modelling for structural causal analysis between wine physicochemical data and quantitative human quality blind assessments is applied. The large dataset of white and red "Vinho Verde'' wine samples from Portugal, which was available from an open data repository for machine learning at the University of California at Irving, was analysed. The dataset contains 4898 white and 1599 red samples evaluated by blind tastes by a minimum of 3 sensory assessors and 12 physicochemical properties. The casual effects of wine analytic data on human quality evaluations are evaluated numerically by Bayes neural networks for adjusted sets of the covariates as marginal distributions and presented graphically as partial dependence plots. Structural causal analysis revealed important differences between the most important variables for quality predictions and the individual causal effects. Bayes neural network models of the partial dependencies show more pronounced nonlinear effects for red wines compared to white wine quality. The artificial intelligence models with boosted random decision tree forests for untrained wine samples yield a 5% relative standard error of predictions compared to 12% for the linear models and ordinary least squares estimation. For red wine, the most important direct causal quality effects are caused by alcohol, volatile acidity, and sulphates. Alcohol improves quality with a maximum plateau at 14%, while volatile acidity has a strong proportional negative effect. The effect of sulphates is highly nonlinear with maximum positive effect at a concentration of 1 g/L of K2SO4. For the white wine samples causal effects are linear with positive effects of alcohol and negative effects of volatile and fixed acidity. The developed structural causal model enables evaluation of targeted wine production interventions, named as “doing x, do(x) models”, as restructured adjusted Bayes networks. It leads to potential applications of artificial intelligence in wine production technology and process quality control.
通过结构因果模型(SCM)进行葡萄酒质量分析
应用贝叶斯网络模型对葡萄酒理化数据和定量人类质量盲评之间的因果关系进行结构分析。分析了来自葡萄牙的 "Vinho Verde "白葡萄酒和红葡萄酒样本的大型数据集,该数据集可从加利福尼亚大学欧文分校的机器学习开放数据存储库中获取。该数据集包含 4898 个白葡萄酒样本和 1599 个红葡萄酒样本,由至少 3 位感官评估员通过盲品和 12 种理化特性进行评估。葡萄酒分析数据对人类质量评价的偶然影响是通过贝叶斯神经网络以边际分布的方式对调整后的协变量集进行数值评估的,并以部分依赖图的方式进行图形展示。结构因果分析揭示了对质量预测最重要的变量与单个因果效应之间的重要差异。贝叶斯神经网络部分依赖关系模型显示,与白葡萄酒质量相比,红葡萄酒的非线性效应更为明显。对于未经训练的葡萄酒样本,采用提升随机决策树森林的人工智能模型得出的预测结果的相对标准误差为 5%,而线性模型和普通最小二乘法估计得出的预测结果的相对标准误差为 12%。对于红葡萄酒来说,最重要的直接质量影响因素是酒精、挥发性酸度和硫酸盐。酒精对质量的改善作用在 14% 时达到最大值,而挥发性酸度则有很强的比例性负面影响。硫酸盐的影响是高度非线性的,当 K2SO4 浓度为 1 克/升时,正效应最大。白葡萄酒样品的因果效应是线性的,酒精有正效应,挥发性酸度和固定酸度有负效应。所开发的结构因果模型可以评估有针对性的葡萄酒生产干预措施,被命名为 "做 x,做(x)模型",作为重组调整贝叶斯网络。这为人工智能在葡萄酒生产技术和过程质量控制方面的应用提供了可能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
16
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信