Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis.

IF 0.7 4区 生物学 Q4 PLANT SCIENCES
Patrícia Pita Ferreira, Diogo Godinho Simões, Constança Pinto de Carvalho, Francisco Duarte, Eugénia Fernandes, Pedro Casaca Carvalho, José Francisco Loff, Ana Paula Soares, Maria João Albuquerque, Pedro Pinto-Leite, André Peralta-Santos
{"title":"Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis.","authors":"Patrícia Pita Ferreira, Diogo Godinho Simões, Constança Pinto de Carvalho, Francisco Duarte, Eugénia Fernandes, Pedro Casaca Carvalho, José Francisco Loff, Ana Paula Soares, Maria João Albuquerque, Pedro Pinto-Leite, André Peralta-Santos","doi":"10.2196/40965","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians' death certificates (DCs). Although AUTOCOD's performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality.</p><p><strong>Objective: </strong>This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality.</p><p><strong>Methods: </strong>We included all DCs between 2016 and 2019. AUTOCOD's performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F<sub>1</sub>-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score >4 SDs, and extreme excess mortality as 2 consecutive days with a Z score >6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification.</p><p><strong>Results: </strong>We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (≥0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II-\"Neoplasms,\" chapter IX-\"Diseases of the circulatory system,\" and chapter X-\"Diseases of the respiratory system\"), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (≥0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality.</p><p><strong>Conclusions: </strong>Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD's performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.</p>","PeriodicalId":50817,"journal":{"name":"American Fern Journal","volume":"63 1","pages":"e40965"},"PeriodicalIF":0.7000,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11041420/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Fern Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/40965","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians' death certificates (DCs). Although AUTOCOD's performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality.

Objective: This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality.

Methods: We included all DCs between 2016 and 2019. AUTOCOD's performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F1-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score >4 SDs, and extreme excess mortality as 2 consecutive days with a Z score >6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification.

Results: We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (≥0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II-"Neoplasms," chapter IX-"Diseases of the circulatory system," and chapter X-"Diseases of the respiratory system"), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (≥0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality.

Conclusions: Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD's performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.

利用人工智能对死因进行实时分类:敏感性分析。
背景:2021 年,欧盟报告的超额死亡人数超过 27 万,其中葡萄牙超过 1.6 万。葡萄牙卫生总局开发了深度神经网络 AUTOCOD,该网络通过分析医生死亡证明(DCs)的自由文本来确定死亡的主要原因。虽然 AUTOCOD 的性能已经得到证实,但其性能是否会随着时间的推移而保持稳定,尤其是在死亡率过高的时期,目前仍不清楚:本研究旨在评估 AUTOCOD 在分类基本死因方面的灵敏度和其他性能指标,并与人工编码进行比较,以确定死亡率过高期间的具体死因:我们纳入了 2016 年至 2019 年期间的所有 DC。通过使用混淆矩阵计算灵敏度、特异性、阳性预测值 (PPV) 和 F1 分数等各种性能指标来评估 AUTOCOD 的性能。我们比较了《国际疾病和健康相关问题统计分类》第 10 次修订版 (ICD-10)、AUTOCOD 对 DC 的分类和卫生总局人工编码员对 DC 的分类(黄金标准)。随后,我们将没有超额死亡率的时期与超额、严重和极度超额死亡率的时期进行了比较。我们将 Z 评分超过 95% 基线限值的连续 2 天定义为超额死亡率,将 Z 评分超过 4 SDs 的连续 2 天定义为严重超额死亡率,将 Z 评分超过 6 SDs 的连续 2 天定义为极度超额死亡率。最后,我们对 ICD-10 中最常见的 3 个章节进行了重复分析,重点关注区块级分类:我们分析了一个大型数据集,其中包括由人工编码员和 AUTOCOD 分类的 330,098 例 DC。AUTOCOD 对所研究的 10 个 ICD-10 章节显示出较高的灵敏度(≥0.75),其中较普遍的章节(第 II 章--"肿瘤"、第 IX 章--"循环系统疾病 "和第 X 章--"呼吸系统疾病")的灵敏度超过了 0.90,占所有人工编码死因的 67.69%(223,459/330,098)。将没有超额死亡率的时期与超额、严重和极度超额死亡率的时期进行比较,没有发现这些高灵敏度值有实质性差异。特异性和 PPV 值也是如此,特异性在所研究的所有章节中都超过了 0.96,而 PPV 值在 9 个章节中超过了 0.75,其中包括较普遍的章节。当考虑在 3 个最常见的 ICD-10 章节中进行区块分类时,AUTOCOD 保持了较高的性能,在 13 个 ICD-10 区块中显示出较高的灵敏度(≥0.75),在 9 个区块中显示出较高的 PPV,在所有区块中显示出大于 0.98 的特异性,在没有超常死亡率和超常死亡率的时期之间没有显著差异:我们的研究结果表明,在死亡率过高和极度过高的时期,AUTOCOD 的性能不受医疗服务压力导致的潜在文本质量下降的影响。因此,即使在死亡率极度超标的情况下,AUTOCOD 也可以可靠地用于实时病因死亡率监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
American Fern Journal
American Fern Journal 生物-植物科学
CiteScore
1.20
自引率
0.00%
发文量
28
审稿时长
6 months
期刊介绍: The American Fern Journal is a peer-reviewed journal focused on the biology of ferns and lycophytes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信