Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports

IF 1.8 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

European Journal of Radiology Open Pub Date : 2023-07-25 DOI:10.1016/j.ejro.2023.100512

Salvatore Claudio Fanni , Chiara Romei , Giovanni Ferrando , Federica Volpi , Caterina Aida D’Amore , Claudio Bedini , Sandro Ubbiali , Salvatore Valentino , Emanuele Neri

{"title":"Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports","authors":"Salvatore Claudio Fanni , Chiara Romei , Giovanni Ferrando , Federica Volpi , Caterina Aida D’Amore , Claudio Bedini , Sandro Ubbiali , Salvatore Valentino , Emanuele Neri","doi":"10.1016/j.ejro.2023.100512","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.</p></div><div><h3>Purpose</h3><p>A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.</p></div><div><h3>Methods</h3><p>Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.</p></div><div><h3>Results</h3><p>The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.</p></div><div><h3>Conclusions</h3><p>The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"11 ","pages":"Article 100512"},"PeriodicalIF":1.8000,"publicationDate":"2023-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/6e/3e/main.PMC10413059.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352047723000382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.

Purpose

A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.

Methods

Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.

Results

The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.

Conclusions

The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.

Abstract Image

查看原文本刊更多论文

自然语言处理，将非结构化新冠肺炎胸部CT报告转换为结构化报告

背景结构化报告已被证明可以提高报告的完整性并降低错误率，还可以实现放射性报告的数据挖掘。尽管如此，放射科医生认为结构化报告是一种零散的报告风格，限制了他们的表达自由。目的开发一种基于深度学习的自然语言处理方法，将非结构化新冠肺炎胸部CT报告自动转换为结构化报告。方法由两名经验丰富的放射科医生回顾性检查两例新冠肺炎胸部CT，他们为每次检查编写一份自由文本的放射学报告，并连贯地填写意大利医学和介入放射学会提供的模板，作为基础。采用半监督卷积神经网络从报告中提取62个分类变量。进行了两次迭代，第一次没有微调，第二次进行微调。使用平均准确度和F1平均得分来测量性能。进行了误差分析，以确定完全可归因于模型处理错误的误差。结果该算法在第一次迭代中的平均准确率为93.7%，F1得分为93.8%。大多数错误完全归因于错误推断（46%）。在第二次迭代中，该模型对两个参数都达到了95,8%，错误推理导致的错误百分比降至26%。结论卷积神经网络在将自由格式文本自动转换为结构化放射学报告方面取得了最佳性能，克服了结构化报告的所有局限性，最终为放射学报告的数据挖掘铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊