{"title":"Exploring the effectiveness of medical entity recognition for clinical information retrieval","authors":"J. Cogley, N. Stokes, J. Carthy","doi":"10.1145/2512089.2512091","DOIUrl":null,"url":null,"abstract":"The growth of medical and clinical textual datasets has fostered research interests in methods for storing, retrieving and extracting of pertinent data. In more recent years, shared tasks and more comprehensive data sharing agreements have seen a further growth in the research area spanning Natural Language Processing (NLP) and Information Retrieval (IR) to aid the world of healthcare. Frequently NLP applications such as Medical Entity Recognition (MER), are motivated within the context of improving IR system performance. In this paper, we investigate the application of MER to a clinical retrieval system in the context of shared tasks in the respective areas. Namely, we aim to add structure to previously unstructured clinical reports and query sets. We evaluate the performance of MER on the query set, highlighting issues in constructing queries in a clinical setting. Further to this, we evaluate the performance of structuring queries on a retrieval dataset. We find that while structuring queries improves performance on complex queries that contain many term dependencies, there is a larger issue of linguistic variation found in clinical texts that must also be addressed.","PeriodicalId":143937,"journal":{"name":"Data and Text Mining in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and Text Mining in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2512089.2512091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The growth of medical and clinical textual datasets has fostered research interests in methods for storing, retrieving and extracting of pertinent data. In more recent years, shared tasks and more comprehensive data sharing agreements have seen a further growth in the research area spanning Natural Language Processing (NLP) and Information Retrieval (IR) to aid the world of healthcare. Frequently NLP applications such as Medical Entity Recognition (MER), are motivated within the context of improving IR system performance. In this paper, we investigate the application of MER to a clinical retrieval system in the context of shared tasks in the respective areas. Namely, we aim to add structure to previously unstructured clinical reports and query sets. We evaluate the performance of MER on the query set, highlighting issues in constructing queries in a clinical setting. Further to this, we evaluate the performance of structuring queries on a retrieval dataset. We find that while structuring queries improves performance on complex queries that contain many term dependencies, there is a larger issue of linguistic variation found in clinical texts that must also be addressed.