在预先确定的变更控制计划下优化全新的基于人工智能的医疗设备:提高检测或排除儿童自闭症的能力

Intelligence-based medicine Pub Date : 2023-01-01 DOI:10.1016/j.ibmed.2023.100102

Dennis P. Wall , Stuart Liu-Mayo , Carmela Salomon , Jennifer Shannon , Sharief Taraman

{"title":"在预先确定的变更控制计划下优化全新的基于人工智能的医疗设备:提高检测或排除儿童自闭症的能力","authors":"Dennis P. Wall , Stuart Liu-Mayo , Carmela Salomon , Jennifer Shannon , Sharief Taraman","doi":"10.1016/j.ibmed.2023.100102","DOIUrl":null,"url":null,"abstract":"<div><p>A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"8 ","pages":"Article 100102"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing a de novo artificial intelligence-based medical device under a predetermined change control plan: Improved ability to detect or rule out pediatric autism\",\"authors\":\"Dennis P. Wall , Stuart Liu-Mayo , Carmela Salomon , Jennifer Shannon , Sharief Taraman\",\"doi\":\"10.1016/j.ibmed.2023.100102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.</p></div>\",\"PeriodicalId\":73399,\"journal\":{\"name\":\"Intelligence-based medicine\",\"volume\":\"8 \",\"pages\":\"Article 100102\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligence-based medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666521223000169\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521223000169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

越来越多的基于人工智能的医疗设备正在获得美国食品药品监督管理局（FDA）的批准。关于监管和安全监督此类设备的最佳实践，人们展开了争论，这些设备的功能如果“解锁”，包括迭代学习和适应新数据。美国食品药品监督管理局提出的一种监管机制是预先确定的变更控制计划（PCCP）。这项分析提供了我们认为是第一个在实践中如何利用PCCP来提高新自闭症诊断设备性能的例子。根据第一代设备营销授权中包含的PCCP模型更新程序（“算法V1”），我们进行了算法阈值优化程序，以提高设备在不改变设备准确性或预期用途的情况下检测或排除18-72个月儿童自闭症的能力。决策阈值优化是在722名18-22个月的发育迟缓儿童（28%为自闭症，22%为神经典型，50%为其他发育迟缓，平均年龄3.6岁，39%为女性）的数据集上使用重复训练/测试验证程序实现的。在1000次重复中，选择70%的样本进行阈值优化，30%进行评估，确保测试集中没有出现训练数据。通过评估测试集上选择的阈值对并将新对的性能度量与同一测试集上对应的V1度量进行比较来估计样本外性能。该设备具有优化的决策阈值，为66.5%（95%置信区间，62.5–71.0）的儿童产生了确定的输出。阳性预测值（PPV）和阴性预测值（PPV）分别为87.5%（95%CI，82.5–96.7）和95.6%（95%CI，93.7–97.9）。阈值优化提高了该设备在更大比例的儿童中准确检测或排除自闭症的能力。鉴于目前美国自闭症评估的等待名单危机，优化阈值提供的覆盖范围的潜在增加是有希望的，并强调了监管机制的价值，该机制允许作为医疗设备的软件在给定真实世界数据的情况下安全、适当地适应。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimizing a de novo artificial intelligence-based medical device under a predetermined change control plan: Improved ability to detect or rule out pediatric autism

A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Intelligence-based medicine Health Informatics

CiteScore

5.00

自引率

0.00%

发文量

审稿时长

187 days