{"title":"Accuracy of categorical measurements: Nominal scale","authors":"Tamar Gadrich, Yariv N. Marmor, Emil Bashkansky","doi":"10.1016/j.measurement.2025.117044","DOIUrl":null,"url":null,"abstract":"<div><div>Classification accuracy, considered from a categorical measurement point of view, is particularly crucial in scenarios where the cost of false output is high. Categorical measurement means that the object’s property under study is presented on a scale consisting of <em>K</em> exclusive classes/categories, forming a comprehensive spectrum of this property. Often such classifications are provided by a group of measurers/classifiers participating in collaborative study (e.g. inter-laboratory or inter-operator comparisons, proficiency testing projects, comparison of different methods or algorithms etc.). Previously, only the metrological properties of the fixed-factor model were studied. A random factors statistical model for analyzing accuracy from collaborative studies is presented in this paper. We assume that due to measurement/classification errors, a property belonging to category <em>i</em> will be classified by a collaborator as category <em>k</em> with probabilities <span><math><msub><mi>p</mi><mrow><mi>k</mi><mfenced><mrow><mi>i</mi></mrow></mfenced></mrow></msub></math></span> (confusion matrix), distributed between collaborators according to the Dirichlet distribution for every given <em>i</em>, whereas category counts of repeated classifications by every collaborator are distributed according to corresponding multinomial distribution. We propose unbiased estimators for repeatability and classifiers’ components of the total precision as well as trueness, based on total categorical variation decomposition and the distance from an ideal classification metrics. We discuss possible options for statistical homogeneity/heterogeneity tests. In the framework of the proposed model, the issue of assessing concordance and discordance between classifiers is also discussed.</div></div>","PeriodicalId":18349,"journal":{"name":"Measurement","volume":"250 ","pages":"Article 117044"},"PeriodicalIF":5.2000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0263224125004038","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Classification accuracy, considered from a categorical measurement point of view, is particularly crucial in scenarios where the cost of false output is high. Categorical measurement means that the object’s property under study is presented on a scale consisting of K exclusive classes/categories, forming a comprehensive spectrum of this property. Often such classifications are provided by a group of measurers/classifiers participating in collaborative study (e.g. inter-laboratory or inter-operator comparisons, proficiency testing projects, comparison of different methods or algorithms etc.). Previously, only the metrological properties of the fixed-factor model were studied. A random factors statistical model for analyzing accuracy from collaborative studies is presented in this paper. We assume that due to measurement/classification errors, a property belonging to category i will be classified by a collaborator as category k with probabilities (confusion matrix), distributed between collaborators according to the Dirichlet distribution for every given i, whereas category counts of repeated classifications by every collaborator are distributed according to corresponding multinomial distribution. We propose unbiased estimators for repeatability and classifiers’ components of the total precision as well as trueness, based on total categorical variation decomposition and the distance from an ideal classification metrics. We discuss possible options for statistical homogeneity/heterogeneity tests. In the framework of the proposed model, the issue of assessing concordance and discordance between classifiers is also discussed.
期刊介绍:
Contributions are invited on novel achievements in all fields of measurement and instrumentation science and technology. Authors are encouraged to submit novel material, whose ultimate goal is an advancement in the state of the art of: measurement and metrology fundamentals, sensors, measurement instruments, measurement and estimation techniques, measurement data processing and fusion algorithms, evaluation procedures and methodologies for plants and industrial processes, performance analysis of systems, processes and algorithms, mathematical models for measurement-oriented purposes, distributed measurement systems in a connected world.