A Systematic Review of Natural Language Processing Methods and Applications in Thyroidology

Ricardo Loor-Torres MD , Mayra Duran MD , David Toro-Tobon MD , Maria Mateo Chavez MD , Oscar Ponce MD , Cristian Soto Jacome MD , Danny Segura Torres MD , Sandra Algarin Perneth MD , Victor Montori BA , Elizabeth Golembiewski PhD, MPH , Mariana Borras Osorio MD , Jungwei W. Fan PhD , Naykky Singh Ospina MD , Yonghui Wu PhD , Juan P. Brito MD, MS
{"title":"A Systematic Review of Natural Language Processing Methods and Applications in Thyroidology","authors":"Ricardo Loor-Torres MD ,&nbsp;Mayra Duran MD ,&nbsp;David Toro-Tobon MD ,&nbsp;Maria Mateo Chavez MD ,&nbsp;Oscar Ponce MD ,&nbsp;Cristian Soto Jacome MD ,&nbsp;Danny Segura Torres MD ,&nbsp;Sandra Algarin Perneth MD ,&nbsp;Victor Montori BA ,&nbsp;Elizabeth Golembiewski PhD, MPH ,&nbsp;Mariana Borras Osorio MD ,&nbsp;Jungwei W. Fan PhD ,&nbsp;Naykky Singh Ospina MD ,&nbsp;Yonghui Wu PhD ,&nbsp;Juan P. Brito MD, MS","doi":"10.1016/j.mcpdig.2024.03.007","DOIUrl":null,"url":null,"abstract":"<div><p>This study aimed to review the application of natural language processing (NLP) in thyroid-related conditions and to summarize current challenges and potential future directions. We performed a systematic search of databases for studies describing NLP applications in thyroid conditions published in English between January 1, 2012 and November 4, 2022. In addition, we used a snowballing technique to identify studies missed in the initial search or published after our search timeline until April 1, 2023. For included studies, we extracted the NLP method (eg, rule-based, machine learning, deep learning, or hybrid), NLP application (eg, identification, classification, and automation), thyroid condition (eg, thyroid cancer, thyroid nodule, and functional or autoimmune disease), data source (eg, electronic health records, health forums, medical literature databases, or genomic databases), performance metrics, and stages of development. We identified 24 eligible NLP studies focusing on thyroid-related conditions. Deep learning-based methods were the most common (38%), followed by rule-based (21%), and traditional machine learning (21%) methods. Thyroid nodules (54%) and thyroid cancer (29%) were the primary conditions under investigation. Electronic health records were the dominant data source (17/24, 71%), with imaging reports being the most frequently used (15/17, 88%). There is increasing interest in NLP applications for thyroid-related studies, mostly addressing thyroid nodules and using deep learning-based methodologies with limited external validation. However, none of the reviewed NLP applications have reached clinical practice. Several limitations, including inconsistent clinical documentation and model portability, need to be addressed to promote the evaluation and implementation of NLP applications to support patient care in thyroidology.</p></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"2 2","pages":"Pages 270-279"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949761224000270/pdfft?md5=a263fb1467469ab6d8333b257365a8ec&pid=1-s2.0-S2949761224000270-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mayo Clinic Proceedings. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949761224000270","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study aimed to review the application of natural language processing (NLP) in thyroid-related conditions and to summarize current challenges and potential future directions. We performed a systematic search of databases for studies describing NLP applications in thyroid conditions published in English between January 1, 2012 and November 4, 2022. In addition, we used a snowballing technique to identify studies missed in the initial search or published after our search timeline until April 1, 2023. For included studies, we extracted the NLP method (eg, rule-based, machine learning, deep learning, or hybrid), NLP application (eg, identification, classification, and automation), thyroid condition (eg, thyroid cancer, thyroid nodule, and functional or autoimmune disease), data source (eg, electronic health records, health forums, medical literature databases, or genomic databases), performance metrics, and stages of development. We identified 24 eligible NLP studies focusing on thyroid-related conditions. Deep learning-based methods were the most common (38%), followed by rule-based (21%), and traditional machine learning (21%) methods. Thyroid nodules (54%) and thyroid cancer (29%) were the primary conditions under investigation. Electronic health records were the dominant data source (17/24, 71%), with imaging reports being the most frequently used (15/17, 88%). There is increasing interest in NLP applications for thyroid-related studies, mostly addressing thyroid nodules and using deep learning-based methodologies with limited external validation. However, none of the reviewed NLP applications have reached clinical practice. Several limitations, including inconsistent clinical documentation and model portability, need to be addressed to promote the evaluation and implementation of NLP applications to support patient care in thyroidology.

自然语言处理方法及在甲状腺学中的应用系统综述
本研究旨在回顾自然语言处理(NLP)在甲状腺相关疾病中的应用,并总结当前面临的挑战和未来可能的发展方向。我们在数据库中系统检索了2012年1月1日至2022年11月4日期间发表的有关NLP在甲状腺疾病中应用的英文研究。此外,我们还使用了滚雪球技术,以确定初始搜索中遗漏的研究,或在搜索时间截止到 2023 年 4 月 1 日之后发表的研究。对于纳入的研究,我们提取了 NLP 方法(例如,基于规则、机器学习、深度学习或混合)、NLP 应用(例如,识别、分类和自动化)、甲状腺疾病(例如,甲状腺癌、甲状腺结节、功能性或自身免疫性疾病)、数据来源(例如,电子健康记录、健康论坛、医学文献数据库或基因组数据库)、性能指标和开发阶段。我们确定了 24 项符合条件的 NLP 研究,重点关注甲状腺相关疾病。基于深度学习的方法最常见(38%),其次是基于规则的方法(21%)和传统机器学习方法(21%)。甲状腺结节(54%)和甲状腺癌(29%)是研究的主要病症。电子健康记录是最主要的数据来源(17/24,71%),而成像报告是最常用的数据来源(15/17,88%)。人们对甲状腺相关研究中的 NLP 应用越来越感兴趣,这些应用主要针对甲状腺结节,使用基于深度学习的方法,但外部验证有限。然而,在已审查的 NLP 应用程序中,没有一个已应用于临床实践。要促进 NLP 应用的评估和实施,为甲状腺病学中的患者护理提供支持,还需要解决一些局限性问题,包括临床记录不一致和模型可移植性问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Mayo Clinic Proceedings. Digital health
Mayo Clinic Proceedings. Digital health Medicine and Dentistry (General), Health Informatics, Public Health and Health Policy
自引率
0.00%
发文量
0
审稿时长
47 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信