Natural language processing in the classification of radiology reports in benign gallbladder diseases.

Q3 Medicine
Radiologia Brasileira Pub Date : 2024-06-26 eCollection Date: 2024-01-01 DOI:10.1590/0100-3984.2023.0096-en
Lislie Gabriela Santin, Henrique Min Ho Lee, Viviane Mariano da Silva, Ellison Fernando Cardoso, Murilo Gleyson Gazzola
{"title":"Natural language processing in the classification of radiology reports in benign gallbladder diseases.","authors":"Lislie Gabriela Santin, Henrique Min Ho Lee, Viviane Mariano da Silva, Ellison Fernando Cardoso, Murilo Gleyson Gazzola","doi":"10.1590/0100-3984.2023.0096-en","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop a natural language processing application capable of automatically identifying benign gallbladder diseases that require surgery, from radiology reports.</p><p><strong>Materials and methods: </strong>We developed a text classifier to classify reports as describing benign diseases of the gallbladder that do or do not require surgery. We randomly selected 1,200 reports describing the gallbladder from our database, including different modalities. Four radiologists classified the reports as describing benign disease that should or should not be treated surgically. Two deep learning architectures were trained for classification: a convolutional neural network (CNN) and a bidirectional long short-term memory (BiLSTM) network. In order to represent words in vector form, the models included a Word2Vec representation, with dimensions of 300 or 1,000. The models were trained and evaluated by dividing the dataset into training, validation, and subsets (80/10/10).</p><p><strong>Results: </strong>The CNN and BiLSTM performed well in both dimensional spaces. For the 300- and 1,000-dimensional spaces, respectively, the F1-scores were 0.95945 and 0.95302 for the CNN model, compared with 0.96732 and 0.96732 for the BiLSTM model.</p><p><strong>Conclusion: </strong>Our models achieved high performance, regardless of the architecture and dimensional space employed.</p>","PeriodicalId":20842,"journal":{"name":"Radiologia Brasileira","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11235066/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologia Brasileira","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1590/0100-3984.2023.0096-en","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To develop a natural language processing application capable of automatically identifying benign gallbladder diseases that require surgery, from radiology reports.

Materials and methods: We developed a text classifier to classify reports as describing benign diseases of the gallbladder that do or do not require surgery. We randomly selected 1,200 reports describing the gallbladder from our database, including different modalities. Four radiologists classified the reports as describing benign disease that should or should not be treated surgically. Two deep learning architectures were trained for classification: a convolutional neural network (CNN) and a bidirectional long short-term memory (BiLSTM) network. In order to represent words in vector form, the models included a Word2Vec representation, with dimensions of 300 or 1,000. The models were trained and evaluated by dividing the dataset into training, validation, and subsets (80/10/10).

Results: The CNN and BiLSTM performed well in both dimensional spaces. For the 300- and 1,000-dimensional spaces, respectively, the F1-scores were 0.95945 and 0.95302 for the CNN model, compared with 0.96732 and 0.96732 for the BiLSTM model.

Conclusion: Our models achieved high performance, regardless of the architecture and dimensional space employed.

良性胆囊疾病放射报告分类中的自然语言处理。
目的开发一种自然语言处理应用程序,能够从放射学报告中自动识别需要手术的良性胆囊疾病:我们开发了一种文本分类器,可将报告分为需要或不需要手术的胆囊良性疾病。我们从数据库中随机抽取了 1200 份描述胆囊疾病的报告,其中包括不同模式的报告。四位放射科医生将这些报告分类为应该或不应该进行手术治疗的良性疾病。为进行分类,我们训练了两种深度学习架构:卷积神经网络(CNN)和双向长短期记忆(BiLSTM)网络。为了以向量形式表示单词,这些模型包括维度为 300 或 1000 的 Word2Vec 表示法。通过将数据集分为训练集、验证集和子集(80/10/10),对模型进行了训练和评估:结果:CNN 和 BiLSTM 在两个维度空间中均表现良好。对于 300 维和 1000 维空间,CNN 模型的 F1 分数分别为 0.95945 和 0.95302,而 BiLSTM 模型的 F1 分数分别为 0.96732 和 0.96732:结论:无论采用何种架构和维度空间,我们的模型都取得了很高的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Radiologia Brasileira
Radiologia Brasileira Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
2.60
自引率
0.00%
发文量
75
审稿时长
28 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信