{"title":"在使用共识统计来评估绩效时使用适当的统计技术","authors":"Daniel Tholen, Piotr Robouch","doi":"10.1007/s00769-024-01607-9","DOIUrl":null,"url":null,"abstract":"<div><p>A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the <i>z’</i> score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.</p></div>","PeriodicalId":454,"journal":{"name":"Accreditation and Quality Assurance","volume":"29 5-6","pages":"425 - 431"},"PeriodicalIF":1.0000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Appropriate statistical techniques when using consensus statistics to evaluate performance\",\"authors\":\"Daniel Tholen, Piotr Robouch\",\"doi\":\"10.1007/s00769-024-01607-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the <i>z’</i> score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.</p></div>\",\"PeriodicalId\":454,\"journal\":{\"name\":\"Accreditation and Quality Assurance\",\"volume\":\"29 5-6\",\"pages\":\"425 - 431\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accreditation and Quality Assurance\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00769-024-01607-9\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accreditation and Quality Assurance","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s00769-024-01607-9","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Appropriate statistical techniques when using consensus statistics to evaluate performance
A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the z’ score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.
期刊介绍:
Accreditation and Quality Assurance has established itself as the leading information and discussion forum for all aspects relevant to quality, transparency and reliability of measurement results in chemical and biological sciences. The journal serves the information needs of researchers, practitioners and decision makers dealing with quality assurance and quality management, including the development and application of metrological principles and concepts such as traceability or measurement uncertainty in the following fields: environment, nutrition, consumer protection, geology, metallurgy, pharmacy, forensics, clinical chemistry and laboratory medicine, and microbiology.