Context-aware automated ICD coding: A semantic-driven approach

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems Pub Date : 2025-03-04 DOI:10.1016/j.is.2025.102539

O.K. Reshma, N. Saleena, K.A. Abdul Nazeer

{"title":"Context-aware automated ICD coding: A semantic-driven approach","authors":"O.K. Reshma, N. Saleena, K.A. Abdul Nazeer","doi":"10.1016/j.is.2025.102539","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying the exact International Classification of Diseases (ICD) codes describing a patient’ s health condition is essential in classifying patients with similar disease conditions. Numerous studies have devised automated approaches to retrieve the ICD codes from patients’ health records. However, majority of these methodologies have considered ICD codes solely as alphanumeric codes, overlooking their descriptions and thus neglecting the inherent semantics. Also, these methodologies overlook the one-to-many semantic relationships between diagnosis and assigned ICD code descriptions. Subsequently, this constrains these approaches from effectively assigning ICD codes with meaningful context. This work addresses these limitations by capturing the semantic similarity between the diagnosis and ICD code descriptions, while utilising the inherent one-to-many relationships between them, to accurately assign ICD codes. For this, we formulate the ICD coding problem as a Semantic Text Similarity task. The proposed approach uses a siamese stacked Bi-LSTM network to learn context-aware representations of diagnoses and ICD code descriptions. We transform each patient-visit data into sentence pairs by considering the one-to-many relationships between diagnosis and assigned ICD code descriptions. Further, we compute their semantic similarity and classify them as similar or dissimilar. The proposed approach was evaluated using 5-fold cross-validation on MIMIC-III dataset and achieved the highest evaluation metric scores (F1-score 0.66, precision 0.67, recall 0.84) compared with other sequential models. The per-label evaluation demonstrates the performance of the proposed approach for each ICD code. Furthermore, the proposed approach outperformed several existing attention-based models, demonstrating the potential use of semantics in automated ICD coding.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"132 ","pages":"Article 102539"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306437925000249","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Identifying the exact International Classification of Diseases (ICD) codes describing a patient’ s health condition is essential in classifying patients with similar disease conditions. Numerous studies have devised automated approaches to retrieve the ICD codes from patients’ health records. However, majority of these methodologies have considered ICD codes solely as alphanumeric codes, overlooking their descriptions and thus neglecting the inherent semantics. Also, these methodologies overlook the one-to-many semantic relationships between diagnosis and assigned ICD code descriptions. Subsequently, this constrains these approaches from effectively assigning ICD codes with meaningful context. This work addresses these limitations by capturing the semantic similarity between the diagnosis and ICD code descriptions, while utilising the inherent one-to-many relationships between them, to accurately assign ICD codes. For this, we formulate the ICD coding problem as a Semantic Text Similarity task. The proposed approach uses a siamese stacked Bi-LSTM network to learn context-aware representations of diagnoses and ICD code descriptions. We transform each patient-visit data into sentence pairs by considering the one-to-many relationships between diagnosis and assigned ICD code descriptions. Further, we compute their semantic similarity and classify them as similar or dissimilar. The proposed approach was evaluated using 5-fold cross-validation on MIMIC-III dataset and achieved the highest evaluation metric scores (F1-score 0.66, precision 0.67, recall 0.84) compared with other sequential models. The per-label evaluation demonstrates the performance of the proposed approach for each ICD code. Furthermore, the proposed approach outperformed several existing attention-based models, demonstrating the potential use of semantics in automated ICD coding.

Abstract Image

查看原文本刊更多论文

上下文感知的自动ICD编码：语义驱动的方法

确定描述患者健康状况的准确国际疾病分类（ICD）代码对于对患有类似疾病的患者进行分类至关重要。许多研究已经设计出从患者健康记录中检索ICD代码的自动化方法。然而，这些方法中的大多数只将ICD代码视为字母数字代码，忽略了它们的描述，从而忽略了其固有的语义。此外，这些方法忽略了诊断和分配的ICD代码描述之间的一对多语义关系。随后，这限制了这些方法有效地分配具有有意义上下文的ICD代码。这项工作通过捕获诊断和ICD代码描述之间的语义相似性来解决这些限制，同时利用它们之间固有的一对多关系来准确地分配ICD代码。为此，我们将ICD编码问题表述为语义文本相似度任务。提出的方法使用连体堆叠Bi-LSTM网络来学习诊断和ICD代码描述的上下文感知表示。我们通过考虑诊断和分配的ICD代码描述之间的一对多关系，将每个患者访问数据转换为句子对。进一步，我们计算它们的语义相似度，并将它们分类为相似或不相似。在MIMIC-III数据集上进行5倍交叉验证，与其他序列模型相比，该方法获得了最高的评价指标得分（f1得分0.66，精度0.67，召回率0.84）。每个标签的评估证明了所建议的方法对每个ICD代码的性能。此外，所提出的方法优于几种现有的基于注意力的模型，证明了语义在自动ICD编码中的潜在用途。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Systems 工程技术-计算机：信息系统

CiteScore

9.40

自引率

2.70%

发文量

112

审稿时长

53 days

期刊介绍： Information systems are the software and hardware systems that support data-intensive applications. The journal Information Systems publishes articles concerning the design and implementation of languages, data models, process models, algorithms, software and hardware for information systems. Subject areas include data management issues as presented in the principal international database conferences (e.g., ACM SIGMOD/PODS, VLDB, ICDE and ICDT/EDBT) as well as data-related issues from the fields of data mining/machine learning, information retrieval coordinated with structured data, internet and cloud data management, business process management, web semantics, visual and audio information systems, scientific computing, and data science. Implementation papers having to do with massively parallel data management, fault tolerance in practice, and special purpose hardware for data-intensive systems are also welcome. Manuscripts from application domains, such as urban informatics, social and natural science, and Internet of Things, are also welcome. All papers should highlight innovative solutions to data management problems such as new data models, performance enhancements, and show how those innovations contribute to the goals of the application.