Mercedes Argüello Casteleiro, J. Des, Maria Jesus Fernandez Prieto, Rogelio Perez, Stavros Lekkas
{"title":"An Ontology-Based Approach to Natural Language Generation from Coded Data in Electronic Health Records","authors":"Mercedes Argüello Casteleiro, J. Des, Maria Jesus Fernandez Prieto, Rogelio Perez, Stavros Lekkas","doi":"10.1109/EMS.2011.47","DOIUrl":null,"url":null,"abstract":"The worldwide adoption of the HL7 Clinical Document Architecture (CDA) is promoting the availability of coded data (CDA entries) within sections of clinical documents. At the moment, an increasing number of studies are investigating ways to transform the narratives of CDA documents into machine process able CDA entries. This paper addresses the reverse problem, i.e. obtaining linguistic representations (sentences) from CDA entries. The approach presented employs Natural Language Generation (NLG) techniques and deals with two major tasks: content selection and content expression. The current research proposes a formal semantic representation of CDA entries and investigates how expressive domain ontologies in OWL and SPARQL SELECT queries can contribute to NLG. To validate the proposal, the study has focused on CDA entries from the History of Present Illness sections of CDA consultation notes. The results obtained are encouraging, as the clinical narratives automatically generated from these CDA entries fulfil the clinicians' expectations.","PeriodicalId":131364,"journal":{"name":"2011 UKSim 5th European Symposium on Computer Modeling and Simulation","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 UKSim 5th European Symposium on Computer Modeling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMS.2011.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The worldwide adoption of the HL7 Clinical Document Architecture (CDA) is promoting the availability of coded data (CDA entries) within sections of clinical documents. At the moment, an increasing number of studies are investigating ways to transform the narratives of CDA documents into machine process able CDA entries. This paper addresses the reverse problem, i.e. obtaining linguistic representations (sentences) from CDA entries. The approach presented employs Natural Language Generation (NLG) techniques and deals with two major tasks: content selection and content expression. The current research proposes a formal semantic representation of CDA entries and investigates how expressive domain ontologies in OWL and SPARQL SELECT queries can contribute to NLG. To validate the proposal, the study has focused on CDA entries from the History of Present Illness sections of CDA consultation notes. The results obtained are encouraging, as the clinical narratives automatically generated from these CDA entries fulfil the clinicians' expectations.