Exploring Deep Learning-based Approaches for Predicting Concept Names in SNOMED CT

Fengbo Zheng, Licong Cui
{"title":"Exploring Deep Learning-based Approaches for Predicting Concept Names in SNOMED CT","authors":"Fengbo Zheng, Licong Cui","doi":"10.1109/BIBM.2018.8621076","DOIUrl":null,"url":null,"abstract":"Ontologies or terminologies have been widely used as formal representation of biomedical knowledge. New concepts are constantly added to biomedical ontologies due to the evolving nature of biomedical knowledge. Much progress has been made to identify new concepts in SNOMED CT, the largest clinical healthcare terminology. However, proper naming of new concepts remains challenging and relies on the ontology curators’ manual effort. In this paper, we explore three deep learning-based approaches, given bags of words, to automatically predict concept names that comply with the naming convention of SNOMED CT. These deep learning models are simple neural network, Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) combined with LSTM. Our experiments showed that LSTM-based approach achieved the best performance: a precision of 65.98%, a recall of 61.04%, and an F1 score of 63.41% for predicting concept names for newly added concepts in the March 2018 Edition of SNOMED CT. It also achieved a precision of 74.58%, a recall of 73.33%, and an F1 score of 73.95% for naming missing concepts identified by our previous work. Further examination of results revealed inconsistencies within SNOMED CT which may be leveraged for quality assurance purpose.","PeriodicalId":108667,"journal":{"name":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2018.8621076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Ontologies or terminologies have been widely used as formal representation of biomedical knowledge. New concepts are constantly added to biomedical ontologies due to the evolving nature of biomedical knowledge. Much progress has been made to identify new concepts in SNOMED CT, the largest clinical healthcare terminology. However, proper naming of new concepts remains challenging and relies on the ontology curators’ manual effort. In this paper, we explore three deep learning-based approaches, given bags of words, to automatically predict concept names that comply with the naming convention of SNOMED CT. These deep learning models are simple neural network, Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) combined with LSTM. Our experiments showed that LSTM-based approach achieved the best performance: a precision of 65.98%, a recall of 61.04%, and an F1 score of 63.41% for predicting concept names for newly added concepts in the March 2018 Edition of SNOMED CT. It also achieved a precision of 74.58%, a recall of 73.33%, and an F1 score of 73.95% for naming missing concepts identified by our previous work. Further examination of results revealed inconsistencies within SNOMED CT which may be leveraged for quality assurance purpose.
探索基于深度学习的SNOMED CT概念名称预测方法
本体论或术语已被广泛用作生物医学知识的正式表示。由于生物医学知识的不断发展,新的概念不断被添加到生物医学本体中。在最大的临床医疗保健术语SNOMED CT中,在确定新概念方面取得了很大进展。然而,新概念的正确命名仍然具有挑战性,并且依赖于本体管理员的手工工作。在本文中,我们探索了三种基于深度学习的方法,在给定单词包的情况下,自动预测符合SNOMED CT命名约定的概念名称。这些深度学习模型有简单神经网络(simple neural network)、长短期记忆(Long - Short-Term Memory, LSTM)和结合LSTM的卷积神经网络(Convolutional neural network, CNN)。我们的实验表明,基于lstm的方法在预测2018年3月版SNOMED CT中新增概念名称方面取得了最好的性能:准确率为65.98%,召回率为61.04%,F1分数为63.41%。它还实现了74.58%的准确率,73.33%的召回率,以及73.95%的F1分数,用于命名我们之前工作中识别的缺失概念。进一步检查结果发现了SNOMED CT的不一致,这可能是为了保证质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信