{"title":"当涉及子类时,评估多类分类的准确性。","authors":"Nan Nan, Lili Tian","doi":"10.1177/09622802251343600","DOIUrl":null,"url":null,"abstract":"<p><p>Classifications that involve subclasses are common in many applied fields. \"Compound multi-class classification\" refers to the settings which involve three or more main classes and at least one of the main classes has multiple subclasses. In this paper, we propose an accuracy metric proper for \"compound <math><mi>M</mi></math>-class classification,\" namely \"hypervolume under compound <math><mrow><mi>R</mi><mi>O</mi><mi>C</mi></mrow></math> manifold <math><mo>(</mo><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub><mo>)</mo></math>.\" The proposed <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> evaluates the overall accuracy of a biomarker measured on continuous scale correctly identifying <math><mi>M</mi></math> main classes without requiring specification of an ordering in terms of marker values for subclasses relative to each other within each main class. The probabilistic interpretation of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> is analytically derived. A network-based computing algorithm which enables efficient computation of the empirical estimate of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> is developed. Non-parametric bootstrap percentile confidence intervals of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> are assessed through extensive simulation studies. Lastly, a real data example is included to illustrate the usage of our proposed method.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251343600"},"PeriodicalIF":1.6000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing accuracy for multi-class classification when subclasses are involved.\",\"authors\":\"Nan Nan, Lili Tian\",\"doi\":\"10.1177/09622802251343600\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Classifications that involve subclasses are common in many applied fields. \\\"Compound multi-class classification\\\" refers to the settings which involve three or more main classes and at least one of the main classes has multiple subclasses. In this paper, we propose an accuracy metric proper for \\\"compound <math><mi>M</mi></math>-class classification,\\\" namely \\\"hypervolume under compound <math><mrow><mi>R</mi><mi>O</mi><mi>C</mi></mrow></math> manifold <math><mo>(</mo><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub><mo>)</mo></math>.\\\" The proposed <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> evaluates the overall accuracy of a biomarker measured on continuous scale correctly identifying <math><mi>M</mi></math> main classes without requiring specification of an ordering in terms of marker values for subclasses relative to each other within each main class. The probabilistic interpretation of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> is analytically derived. A network-based computing algorithm which enables efficient computation of the empirical estimate of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> is developed. Non-parametric bootstrap percentile confidence intervals of <math><mi>H</mi><mi>U</mi><msub><mi>M</mi><mrow><mi>C</mi><mo>,</mo><mi>M</mi></mrow></msub></math> are assessed through extensive simulation studies. Lastly, a real data example is included to illustrate the usage of our proposed method.</p>\",\"PeriodicalId\":22038,\"journal\":{\"name\":\"Statistical Methods in Medical Research\",\"volume\":\" \",\"pages\":\"9622802251343600\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Methods in Medical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/09622802251343600\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Methods in Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/09622802251343600","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Assessing accuracy for multi-class classification when subclasses are involved.
Classifications that involve subclasses are common in many applied fields. "Compound multi-class classification" refers to the settings which involve three or more main classes and at least one of the main classes has multiple subclasses. In this paper, we propose an accuracy metric proper for "compound -class classification," namely "hypervolume under compound manifold ." The proposed evaluates the overall accuracy of a biomarker measured on continuous scale correctly identifying main classes without requiring specification of an ordering in terms of marker values for subclasses relative to each other within each main class. The probabilistic interpretation of is analytically derived. A network-based computing algorithm which enables efficient computation of the empirical estimate of is developed. Non-parametric bootstrap percentile confidence intervals of are assessed through extensive simulation studies. Lastly, a real data example is included to illustrate the usage of our proposed method.
期刊介绍:
Statistical Methods in Medical Research is a peer reviewed scholarly journal and is the leading vehicle for articles in all the main areas of medical statistics and an essential reference for all medical statisticians. This unique journal is devoted solely to statistics and medicine and aims to keep professionals abreast of the many powerful statistical techniques now available to the medical profession. This journal is a member of the Committee on Publication Ethics (COPE)