{"title":"Exploring Deep Learning-based Approaches for Predicting Concept Names in SNOMED CT","authors":"Fengbo Zheng, Licong Cui","doi":"10.1109/BIBM.2018.8621076","DOIUrl":null,"url":null,"abstract":"Ontologies or terminologies have been widely used as formal representation of biomedical knowledge. New concepts are constantly added to biomedical ontologies due to the evolving nature of biomedical knowledge. Much progress has been made to identify new concepts in SNOMED CT, the largest clinical healthcare terminology. However, proper naming of new concepts remains challenging and relies on the ontology curators’ manual effort. In this paper, we explore three deep learning-based approaches, given bags of words, to automatically predict concept names that comply with the naming convention of SNOMED CT. These deep learning models are simple neural network, Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) combined with LSTM. Our experiments showed that LSTM-based approach achieved the best performance: a precision of 65.98%, a recall of 61.04%, and an F1 score of 63.41% for predicting concept names for newly added concepts in the March 2018 Edition of SNOMED CT. It also achieved a precision of 74.58%, a recall of 73.33%, and an F1 score of 73.95% for naming missing concepts identified by our previous work. Further examination of results revealed inconsistencies within SNOMED CT which may be leveraged for quality assurance purpose.","PeriodicalId":108667,"journal":{"name":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2018.8621076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Ontologies or terminologies have been widely used as formal representation of biomedical knowledge. New concepts are constantly added to biomedical ontologies due to the evolving nature of biomedical knowledge. Much progress has been made to identify new concepts in SNOMED CT, the largest clinical healthcare terminology. However, proper naming of new concepts remains challenging and relies on the ontology curators’ manual effort. In this paper, we explore three deep learning-based approaches, given bags of words, to automatically predict concept names that comply with the naming convention of SNOMED CT. These deep learning models are simple neural network, Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) combined with LSTM. Our experiments showed that LSTM-based approach achieved the best performance: a precision of 65.98%, a recall of 61.04%, and an F1 score of 63.41% for predicting concept names for newly added concepts in the March 2018 Edition of SNOMED CT. It also achieved a precision of 74.58%, a recall of 73.33%, and an F1 score of 73.95% for naming missing concepts identified by our previous work. Further examination of results revealed inconsistencies within SNOMED CT which may be leveraged for quality assurance purpose.