{"title":"Methodology for establishing a 'gold standard' (for a medical expert system)","authors":"D. C. Georgakis, C. Georgakis","doi":"10.1109/CBMS.1992.244926","DOIUrl":null,"url":null,"abstract":"Presents a statistical method for establishing a consensus diagnosis among several experts, which is the common 'gold standard' used for evaluating the diagnostic performance of a medical expert system. This method involves the use of the extended kappa statistic developed by J.L. Fleiss (1971) and R.J. Light (1971) in the social sciences for the study of a similar problem. The method is carried out in two stages. First, the existence of an overall agreement or disagreement among the medical experts in their diagnoses of a sample of patients is established. Second, in the case of overall agreement, one selects the particular disorders that have a significant level of agreement among the experts, and uses the experts' diagnoses as the 'gold standard' for the sample of patients that were classified in those disorders.<<ETX>>","PeriodicalId":197891,"journal":{"name":"[1992] Proceedings Fifth Annual IEEE Symposium on Computer-Based Medical Systems","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1992] Proceedings Fifth Annual IEEE Symposium on Computer-Based Medical Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.1992.244926","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Presents a statistical method for establishing a consensus diagnosis among several experts, which is the common 'gold standard' used for evaluating the diagnostic performance of a medical expert system. This method involves the use of the extended kappa statistic developed by J.L. Fleiss (1971) and R.J. Light (1971) in the social sciences for the study of a similar problem. The method is carried out in two stages. First, the existence of an overall agreement or disagreement among the medical experts in their diagnoses of a sample of patients is established. Second, in the case of overall agreement, one selects the particular disorders that have a significant level of agreement among the experts, and uses the experts' diagnoses as the 'gold standard' for the sample of patients that were classified in those disorders.<>