[Automatic ICD-10 coding : Natural language processing for German MRI reports].

Radiologie (Heidelberg, Germany) Pub Date : 2024-10-01 Epub Date: 2024-08-09 DOI:10.1007/s00117-024-01349-2

Andreas Mittermeier, Matthias Aßenmacher, Balthasar Schachtner, Sergio Grosu, Vladana Dakovic, Viktar Kandratovich, Bastian Sabel, Michael Ingrisch

{"title":"[Automatic ICD-10 coding : Natural language processing for German MRI reports].","authors":"Andreas Mittermeier, Matthias Aßenmacher, Balthasar Schachtner, Sergio Grosu, Vladana Dakovic, Viktar Kandratovich, Bastian Sabel, Michael Ingrisch","doi":"10.1007/s00117-024-01349-2","DOIUrl":null,"url":null,"abstract":"Background: The medical coding of radiology reports is essential for a good quality of care and correct billing, but at the same time a complex and error-prone task.Objective: To assess the performance of natural language processing (NLP) for ICD-10 coding of German radiology reports using fine tuning of suitable language models.Material and methods: This retrospective study included all magnetic resonance imaging (MRI) radiology reports acquired at our institution between 2010 and 2020. The codes on discharge ICD-10 were matched to the corresponding reports to construct a dataset for multiclass classification. Fine tuning of GermanBERT and flanT5 was carried out on the total dataset (dstotal) containing 1035 different ICD-10 codes and 2 reduced subsets containing the 100 (ds100) and 50 (ds50) most frequent codes. The performance of the model was assessed using top‑k accuracy for k = 1, 3 and 5. In an ablation study both models were trained on the accompanying metadata and the radiology report alone.Results: The total dataset consisted of 100,672 radiology reports, the reduced subsets ds100 of 68,103 and ds50 of 52,293 reports. The performance of the model increased when several of the best predictions of the model were taken into consideration, when the number of target classes was reduced and the metadata were combined with the report. The flanT5 outperformed GermanBERT across all datasets and metrics and was is suited as a medical coding assistant, achieving a top 3 accuracy of nearly 70% in the real-world dataset dstotal.Conclusion: Finely tuned language models can reliably predict ICD-10 codes of German magnetic resonance imaging (MRI) radiology reports across various settings. As a coding assistant flanT5 can guide medical coders to make informed decisions and potentially reduce the workload.","PeriodicalId":74635,"journal":{"name":"Radiologie (Heidelberg, Germany)","volume":" ","pages":"793-800"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologie (Heidelberg, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00117-024-01349-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/9 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The medical coding of radiology reports is essential for a good quality of care and correct billing, but at the same time a complex and error-prone task.

Objective: To assess the performance of natural language processing (NLP) for ICD-10 coding of German radiology reports using fine tuning of suitable language models.

Material and methods: This retrospective study included all magnetic resonance imaging (MRI) radiology reports acquired at our institution between 2010 and 2020. The codes on discharge ICD-10 were matched to the corresponding reports to construct a dataset for multiclass classification. Fine tuning of GermanBERT and flanT5 was carried out on the total dataset (ds_total) containing 1035 different ICD-10 codes and 2 reduced subsets containing the 100 (ds₁₀₀) and 50 (ds₅₀) most frequent codes. The performance of the model was assessed using top‑k accuracy for k = 1, 3 and 5. In an ablation study both models were trained on the accompanying metadata and the radiology report alone.

Results: The total dataset consisted of 100,672 radiology reports, the reduced subsets ds₁₀₀ of 68,103 and ds₅₀ of 52,293 reports. The performance of the model increased when several of the best predictions of the model were taken into consideration, when the number of target classes was reduced and the metadata were combined with the report. The flanT5 outperformed GermanBERT across all datasets and metrics and was is suited as a medical coding assistant, achieving a top 3 accuracy of nearly 70% in the real-world dataset ds_total.

Conclusion: Finely tuned language models can reliably predict ICD-10 codes of German magnetic resonance imaging (MRI) radiology reports across various settings. As a coding assistant flanT5 can guide medical coders to make informed decisions and potentially reduce the workload.

查看原文本刊更多论文

[自动 ICD-10 编码：德国 MRI 报告的自然语言处理]。

背景：放射学报告的医学编码对于提高医疗质量和正确计费至关重要，但同时也是一项复杂且容易出错的任务：目的：通过微调合适的语言模型，评估自然语言处理（NLP）在德国放射学报告 ICD-10 编码中的性能：这项回顾性研究包括我院在 2010 年至 2020 年期间获得的所有磁共振成像（MRI）放射学报告。出院ICD-10的代码与相应的报告相匹配，以构建多类分类的数据集。在包含 1035 个不同 ICD-10 代码的总数据集 (dstotal) 和包含 100 个最常见代码 (ds100) 和 50 个最常见代码 (ds50) 的两个缩小子集上对 GermanBERT 和 flanT5 进行了微调。模型的性能使用 k = 1、3 和 5 的 top-k 精确度进行评估。在一项消融研究中，两个模型都只对随附的元数据和放射学报告进行了训练：总数据集包括 100,672 份放射学报告，缩小子集 ds100 包括 68,103 份报告，ds50 包括 52,293 份报告。当考虑到模型的几个最佳预测、目标类别数量减少以及元数据与报告相结合时，模型的性能有所提高。在所有数据集和指标上，flanT5 的表现都优于 GermanBERT，适合作为医疗编码助手，在现实世界的数据集 dstotal 中，前三名的准确率接近 70%：结论：经过微调的语言模型可以在各种环境下可靠地预测德国磁共振成像（MRI）放射学报告的 ICD-10 编码。作为编码助手，flanT5 可以指导医疗编码员做出明智的决定，并有可能减少工作量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Radiologie (Heidelberg, Germany)

自引率

0.00%

发文量