{"title":"Dive into diagnostics for DOE model discrepancies","authors":"Shari Kraber","doi":"10.1177/87560879231204142","DOIUrl":null,"url":null,"abstract":"The purpose of running a statistically designed experiment (DOE) is to take a strategically selected small sample of data from a larger system, and then extract a prediction equation that appropriately models the overall system. The statistical tool used to relate the independent factors to the dependent responses is analysis of variance (ANOVA). This article will lay out the key assumptions for ANOVA and how to verify them using graphical diagnostic plots. The first assumption (and one that is often overlooked) is that the chosen model is correct. This means that the terms in the model explain the relationship between the factors and the response, and there are not too many terms (over-fitting), or too few terms (underfitting). One should not use the raw R-squared statistic as it is misleading. Please see Journal of Plastic Film& Sheeting Volume 37, Number 4, page 405, “R-squared mysteries solved” which explains how to interpret raw versus adjusted versus predicted R-squared.","PeriodicalId":16823,"journal":{"name":"Journal of Plastic Film & Sheeting","volume":"20 1","pages":"0"},"PeriodicalIF":2.1000,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Plastic Film & Sheeting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/87560879231204142","RegionNum":4,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATERIALS SCIENCE, COATINGS & FILMS","Score":null,"Total":0}
引用次数: 0
Abstract
The purpose of running a statistically designed experiment (DOE) is to take a strategically selected small sample of data from a larger system, and then extract a prediction equation that appropriately models the overall system. The statistical tool used to relate the independent factors to the dependent responses is analysis of variance (ANOVA). This article will lay out the key assumptions for ANOVA and how to verify them using graphical diagnostic plots. The first assumption (and one that is often overlooked) is that the chosen model is correct. This means that the terms in the model explain the relationship between the factors and the response, and there are not too many terms (over-fitting), or too few terms (underfitting). One should not use the raw R-squared statistic as it is misleading. Please see Journal of Plastic Film& Sheeting Volume 37, Number 4, page 405, “R-squared mysteries solved” which explains how to interpret raw versus adjusted versus predicted R-squared.
期刊介绍:
The Journal of Plastic Film and Sheeting improves communication concerning plastic film and sheeting with major emphasis on the propogation of knowledge which will serve to advance the science and technology of these products and thus better serve industry and the ultimate consumer. The journal reports on the wide variety of advances that are rapidly taking place in the technology of plastic film and sheeting. This journal is a member of the Committee on Publication Ethics (COPE).