在使用共识统计来评估绩效时使用适当的统计技术

IF 1 4区工程技术 Q4 CHEMISTRY, ANALYTICAL

Accreditation and Quality Assurance Pub Date : 2024-06-24 DOI:10.1007/s00769-024-01607-9

Daniel Tholen, Piotr Robouch

{"title":"在使用共识统计来评估绩效时使用适当的统计技术","authors":"Daniel Tholen, Piotr Robouch","doi":"10.1007/s00769-024-01607-9","DOIUrl":null,"url":null,"abstract":"<div><p>A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the <i>z’</i> score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.</p></div>","PeriodicalId":454,"journal":{"name":"Accreditation and Quality Assurance","volume":"29 5-6","pages":"425 - 431"},"PeriodicalIF":1.0000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Appropriate statistical techniques when using consensus statistics to evaluate performance\",\"authors\":\"Daniel Tholen, Piotr Robouch\",\"doi\":\"10.1007/s00769-024-01607-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the <i>z’</i> score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.</p></div>\",\"PeriodicalId\":454,\"journal\":{\"name\":\"Accreditation and Quality Assurance\",\"volume\":\"29 5-6\",\"pages\":\"425 - 431\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accreditation and Quality Assurance\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00769-024-01607-9\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accreditation and Quality Assurance","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s00769-024-01607-9","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}

引用次数: 0

摘要

各种各样的统计方法可以用于能力测试（PT）计划在实验室测试的各个领域。在ISO 13528和其他国际标准中描述的统计方法在各种各样的应用中解决了PT。统计技术方面最显著的差别在于，业绩评价是利用当前一轮的协商一致统计数据根据参与者的结果确定的，还是独立确定业绩标准的。对于协商一致评价的方案，下一个最重要的因素是方案及其参与者的经验。缺乏能力、新入组或不理解所提供说明的参与者报告的结果比例证明了这一点。例如，对于新方案（例如，第一次运行）所必需的统计技术可能不适用于具有相同参与者的几轮之后的类似方案。同样，不同的技术可以适用于封闭方案，这些方案对有限的有经验的实验室进行定期技术审查。考虑到不称职或没有经验的参与者的高水平“污染”的技术，例如具有共识赋值的z '分数，不应该在使用共识统计的有经验的方案中使用。对有经验的实验室应采用对小系统误差更敏感的其他技术进行更密切的监测，包括考虑结果测量不确定度的统计技术。成熟的PT方案和特殊用途的封闭方案应评估实验室在合格评定决策中使用的任何PT方案中参与者结果的测量不确定度，或者改进参与者协议是该方案的目标。要求遵守ISO/IEC 17043的监督机构应考虑这些建议，以更好地确保测量的全球兼容性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Appropriate statistical techniques when using consensus statistics to evaluate performance

A large variety of statistical methods can be used for proficiency testing (PT) programs in various areas of laboratory testing. Statistical methods described in ISO 13528 and other international standards address PT in a wide variety of applications. The most significant difference in statistical techniques is whether performance evaluations are determined from the participant results using consensus statistics from the current round, or whether the performance criteria are determined independently. For schemes evaluated by consensus, the next most significant factor is the experience of both the scheme and its participants. This is evidenced in the proportion of results reported by participants who lack competence, are newly enrolled, or do not understand the instructions provided. For example, statistical techniques that are necessary for novel schemes (e.g. run for the first time) may not be appropriate for a similar scheme after several rounds with the same participants. Similarly, different techniques may apply for closed schemes that have regular technical review of a limited group of experienced laboratories. Techniques that make allowances for high levels of “contamination” from incompetent or inexperienced participants, such as the z’ score with consensus assigned values, should not be used in experienced schemes using consensus statistics. Other techniques that are more sensitive to small systematic errors should be employed for closer monitoring of experienced laboratories, including statistical techniques that consider the measurement uncertainty of the results. Mature PT schemes and closed schemes for special purposes should evaluate the measurement uncertainty of participant results in any PT scheme used by laboratories that make decisions on conformity assessment, or where improvement of participant agreement is an objective for the scheme. Oversight bodies that require compliance with ISO/IEC 17043 should consider these recommendations, to better ensure global compatibility of measurements.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Accreditation and Quality Assurance 工程技术-分析化学

CiteScore

1.80

自引率

22.20%

发文量

审稿时长

6-12 weeks

期刊介绍： Accreditation and Quality Assurance has established itself as the leading information and discussion forum for all aspects relevant to quality, transparency and reliability of measurement results in chemical and biological sciences. The journal serves the information needs of researchers, practitioners and decision makers dealing with quality assurance and quality management, including the development and application of metrological principles and concepts such as traceability or measurement uncertainty in the following fields: environment, nutrition, consumer protection, geology, metallurgy, pharmacy, forensics, clinical chemistry and laboratory medicine, and microbiology.