{"title":"Wine quality analysis by the structural causal model (SCM)","authors":"Ž. Kurtanjek","doi":"10.17508/cjfst.2023.15.2.05","DOIUrl":null,"url":null,"abstract":"Bayes network modelling for structural causal analysis between wine physicochemical data and quantitative human quality blind assessments is applied. The large dataset of white and red \"Vinho Verde'' wine samples from Portugal, which was available from an open data repository for machine learning at the University of California at Irving, was analysed. The dataset contains 4898 white and 1599 red samples evaluated by blind tastes by a minimum of 3 sensory assessors and 12 physicochemical properties. The casual effects of wine analytic data on human quality evaluations are evaluated numerically by Bayes neural networks for adjusted sets of the covariates as marginal distributions and presented graphically as partial dependence plots. Structural causal analysis revealed important differences between the most important variables for quality predictions and the individual causal effects. Bayes neural network models of the partial dependencies show more pronounced nonlinear effects for red wines compared to white wine quality. The artificial intelligence models with boosted random decision tree forests for untrained wine samples yield a 5% relative standard error of predictions compared to 12% for the linear models and ordinary least squares estimation. For red wine, the most important direct causal quality effects are caused by alcohol, volatile acidity, and sulphates. Alcohol improves quality with a maximum plateau at 14%, while volatile acidity has a strong proportional negative effect. The effect of sulphates is highly nonlinear with maximum positive effect at a concentration of 1 g/L of K2SO4. For the white wine samples causal effects are linear with positive effects of alcohol and negative effects of volatile and fixed acidity. The developed structural causal model enables evaluation of targeted wine production interventions, named as “doing x, do(x) models”, as restructured adjusted Bayes networks. It leads to potential applications of artificial intelligence in wine production technology and process quality control.","PeriodicalId":10771,"journal":{"name":"Croatian journal of food science and technology","volume":"171 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Croatian journal of food science and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17508/cjfst.2023.15.2.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Bayes network modelling for structural causal analysis between wine physicochemical data and quantitative human quality blind assessments is applied. The large dataset of white and red "Vinho Verde'' wine samples from Portugal, which was available from an open data repository for machine learning at the University of California at Irving, was analysed. The dataset contains 4898 white and 1599 red samples evaluated by blind tastes by a minimum of 3 sensory assessors and 12 physicochemical properties. The casual effects of wine analytic data on human quality evaluations are evaluated numerically by Bayes neural networks for adjusted sets of the covariates as marginal distributions and presented graphically as partial dependence plots. Structural causal analysis revealed important differences between the most important variables for quality predictions and the individual causal effects. Bayes neural network models of the partial dependencies show more pronounced nonlinear effects for red wines compared to white wine quality. The artificial intelligence models with boosted random decision tree forests for untrained wine samples yield a 5% relative standard error of predictions compared to 12% for the linear models and ordinary least squares estimation. For red wine, the most important direct causal quality effects are caused by alcohol, volatile acidity, and sulphates. Alcohol improves quality with a maximum plateau at 14%, while volatile acidity has a strong proportional negative effect. The effect of sulphates is highly nonlinear with maximum positive effect at a concentration of 1 g/L of K2SO4. For the white wine samples causal effects are linear with positive effects of alcohol and negative effects of volatile and fixed acidity. The developed structural causal model enables evaluation of targeted wine production interventions, named as “doing x, do(x) models”, as restructured adjusted Bayes networks. It leads to potential applications of artificial intelligence in wine production technology and process quality control.