Exploring quality dimensions in trustworthy Machine Learning in the context of official statistics: model explainability and uncertainty quantification
{"title":"Exploring quality dimensions in trustworthy Machine Learning in the context of official statistics: model explainability and uncertainty quantification","authors":"Saeid Molladavoudi, Wesley Yung","doi":"10.1007/s11943-023-00331-z","DOIUrl":null,"url":null,"abstract":"<div><p>Despite the fact that National Statistical Offices (NSOs) continue to embrace and adopt Machine Learning (ML) methods and tools in a variety of areas of their operations, including data collection, integration, and processing, it is still not clear how these complex and prediction-oriented approaches can be incorporated into the quality standards and frameworks within NSOs or if the frameworks themselves need to be modified. This article focuses on and builds upon two of the quality dimensions proposed in the Quality Framework for Statistical Algorithms (QF4SA): model explainability and accuracy (including uncertainty). The implications of the current methods for explainable ML and uncertainty quantification will be examined in further detail, as well as their possible uses in statistical production, such as continuous model monitoring in intermediate ML classifications and auto-coding phases. This strategy will ensure that human subject-matter experts, who are an essential component of every statistical program, are effectively integrated into the life cycle of ML projects. It will also guarantee to maintain the quality of ML models in production, adhere to the current quality frameworks within NSOs, and ultimately boost confidence and trust in these emerging technologies.</p></div>","PeriodicalId":100134,"journal":{"name":"AStA Wirtschafts- und Sozialstatistisches Archiv","volume":"17 3-4","pages":"223 - 252"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AStA Wirtschafts- und Sozialstatistisches Archiv","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s11943-023-00331-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Despite the fact that National Statistical Offices (NSOs) continue to embrace and adopt Machine Learning (ML) methods and tools in a variety of areas of their operations, including data collection, integration, and processing, it is still not clear how these complex and prediction-oriented approaches can be incorporated into the quality standards and frameworks within NSOs or if the frameworks themselves need to be modified. This article focuses on and builds upon two of the quality dimensions proposed in the Quality Framework for Statistical Algorithms (QF4SA): model explainability and accuracy (including uncertainty). The implications of the current methods for explainable ML and uncertainty quantification will be examined in further detail, as well as their possible uses in statistical production, such as continuous model monitoring in intermediate ML classifications and auto-coding phases. This strategy will ensure that human subject-matter experts, who are an essential component of every statistical program, are effectively integrated into the life cycle of ML projects. It will also guarantee to maintain the quality of ML models in production, adhere to the current quality frameworks within NSOs, and ultimately boost confidence and trust in these emerging technologies.
尽管各国国家统计局(NSO)在数据收集、整合和处理等多个业务领域不断接受并采用机器学习(ML)方法和工具,但目前仍不清楚如何将这些复杂且以预测为导向的方法纳入国家统计局的质量标准和框架,或者是否需要对框架本身进行修改。本文重点关注并借鉴了统计算法质量框架(QF4SA)中提出的两个质量维度:模型的可解释性和准确性(包括不确定性)。本文将进一步详细分析当前可解释 ML 和不确定性量化方法的影响,以及这些方法在统计生产中的可能用途,例如在中间 ML 分类和自动编码阶段对模型进行持续监控。这一战略将确保作为每个统计程序重要组成部分的人类主题专家有效融入 ML 项目的生命周期。它还将保证在生产过程中保持 ML 模型的质量,遵守国家统计局当前的质量框架,并最终增强人们对这些新兴技术的信心和信任。