Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports

IF 1.8 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Salvatore Claudio Fanni , Chiara Romei , Giovanni Ferrando , Federica Volpi , Caterina Aida D’Amore , Claudio Bedini , Sandro Ubbiali , Salvatore Valentino , Emanuele Neri
{"title":"Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports","authors":"Salvatore Claudio Fanni ,&nbsp;Chiara Romei ,&nbsp;Giovanni Ferrando ,&nbsp;Federica Volpi ,&nbsp;Caterina Aida D’Amore ,&nbsp;Claudio Bedini ,&nbsp;Sandro Ubbiali ,&nbsp;Salvatore Valentino ,&nbsp;Emanuele Neri","doi":"10.1016/j.ejro.2023.100512","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.</p></div><div><h3>Purpose</h3><p>A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.</p></div><div><h3>Methods</h3><p>Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.</p></div><div><h3>Results</h3><p>The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.</p></div><div><h3>Conclusions</h3><p>The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2023-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/6e/3e/main.PMC10413059.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352047723000382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.

Purpose

A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.

Methods

Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.

Results

The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.

Conclusions

The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.

Abstract Image

自然语言处理,将非结构化新冠肺炎胸部CT报告转换为结构化报告
背景结构化报告已被证明可以提高报告的完整性并降低错误率,还可以实现放射性报告的数据挖掘。尽管如此,放射科医生认为结构化报告是一种零散的报告风格,限制了他们的表达自由。目的开发一种基于深度学习的自然语言处理方法,将非结构化新冠肺炎胸部CT报告自动转换为结构化报告。方法由两名经验丰富的放射科医生回顾性检查两例新冠肺炎胸部CT,他们为每次检查编写一份自由文本的放射学报告,并连贯地填写意大利医学和介入放射学会提供的模板,作为基础。采用半监督卷积神经网络从报告中提取62个分类变量。进行了两次迭代,第一次没有微调,第二次进行微调。使用平均准确度和F1平均得分来测量性能。进行了误差分析,以确定完全可归因于模型处理错误的误差。结果该算法在第一次迭代中的平均准确率为93.7%,F1得分为93.8%。大多数错误完全归因于错误推断(46%)。在第二次迭代中,该模型对两个参数都达到了95,8%,错误推理导致的错误百分比降至26%。结论卷积神经网络在将自由格式文本自动转换为结构化放射学报告方面取得了最佳性能,克服了结构化报告的所有局限性,最终为放射学报告的数据挖掘铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Radiology Open
European Journal of Radiology Open Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.10
自引率
5.00%
发文量
55
审稿时长
51 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信