{"title":"在没有金标准的医疗专家系统验证中控制机会一致性:肺炎和雷诺阿的重新审视","authors":"M. Martı́n-Baranera , J.J. Sancho, F. Sanz","doi":"10.1006/cbmr.2000.1552","DOIUrl":null,"url":null,"abstract":"<div><p>In the validation of medical expert systems, agreement among different human specialists on a random sample of cases may be taken as a substitute to a missing gold standard. Distance measures between pairs of experts, extensively described in previous studies, do not take into account the influence of chance-expected agreement. A weighted kappa index, with three different weighting schemes, is proposed as an alternative to be applied in the general situation of <em>N</em> cases assessed by <em>E</em> experts about <em>K</em> possible diagnoses, each of them qualified with one of <em>G</em> ordinal categories. A hierarchical cluster analysis, applied to the kappa matrices generated, allows for the classification of the expert system among clinical specialists, providing a relative assessment of its diagnostic ability. The above methodology is applied to the validation of two medical expert systems, PNEUMON-IA and RENOIR.</p></div>","PeriodicalId":75733,"journal":{"name":"Computers and biomedical research, an international journal","volume":"33 6","pages":"Pages 380-397"},"PeriodicalIF":0.0000,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/cbmr.2000.1552","citationCount":"6","resultStr":"{\"title\":\"Controlling for Chance Agreement in the Validation of Medical Expert Systems with No Gold Standard: PNEUMON-IA and RENOIR Revisited\",\"authors\":\"M. Martı́n-Baranera , J.J. Sancho, F. Sanz\",\"doi\":\"10.1006/cbmr.2000.1552\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the validation of medical expert systems, agreement among different human specialists on a random sample of cases may be taken as a substitute to a missing gold standard. Distance measures between pairs of experts, extensively described in previous studies, do not take into account the influence of chance-expected agreement. A weighted kappa index, with three different weighting schemes, is proposed as an alternative to be applied in the general situation of <em>N</em> cases assessed by <em>E</em> experts about <em>K</em> possible diagnoses, each of them qualified with one of <em>G</em> ordinal categories. A hierarchical cluster analysis, applied to the kappa matrices generated, allows for the classification of the expert system among clinical specialists, providing a relative assessment of its diagnostic ability. The above methodology is applied to the validation of two medical expert systems, PNEUMON-IA and RENOIR.</p></div>\",\"PeriodicalId\":75733,\"journal\":{\"name\":\"Computers and biomedical research, an international journal\",\"volume\":\"33 6\",\"pages\":\"Pages 380-397\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1006/cbmr.2000.1552\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and biomedical research, an international journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010480900915520\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and biomedical research, an international journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010480900915520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Controlling for Chance Agreement in the Validation of Medical Expert Systems with No Gold Standard: PNEUMON-IA and RENOIR Revisited
In the validation of medical expert systems, agreement among different human specialists on a random sample of cases may be taken as a substitute to a missing gold standard. Distance measures between pairs of experts, extensively described in previous studies, do not take into account the influence of chance-expected agreement. A weighted kappa index, with three different weighting schemes, is proposed as an alternative to be applied in the general situation of N cases assessed by E experts about K possible diagnoses, each of them qualified with one of G ordinal categories. A hierarchical cluster analysis, applied to the kappa matrices generated, allows for the classification of the expert system among clinical specialists, providing a relative assessment of its diagnostic ability. The above methodology is applied to the validation of two medical expert systems, PNEUMON-IA and RENOIR.