从登革热患者病例表生成基于成分与依赖关系解析的RDF模型

IF 1 Q3 INFORMATION SCIENCE & LIBRARY SCIENCE

Journal of Information & Knowledge Management Pub Date : 2022-01-04 DOI:10.1142/s0219649222500137

Runumi Devi, D. Mehrotra, Sana Ben Abdallah Ben Lamine

{"title":"从登革热患者病例表生成基于成分与依赖关系解析的RDF模型","authors":"Runumi Devi, D. Mehrotra, Sana Ben Abdallah Ben Lamine","doi":"10.1142/s0219649222500137","DOIUrl":null,"url":null,"abstract":"Electronic Health Record (EHR) systems in healthcare organisations are primarily maintained in isolation from each other that makes interoperability of unstructured(text) data stored in these EHR systems challenging in the healthcare domain. Similar information may be described using different terminologies by different applications that can be evaded by transforming the content into the Resource Description Framework (RDF) model that is interoperable amongst organisations. RDF requires a document’s contents to be translated into a repository of triplets (subject, predicate, object) known as RDF statements. Natural Language Processing (NLP) techniques can help get actionable insights from these text data and create triplets for RDF model generation. This paper discusses two NLP-based approaches to generate the RDF models from unstructured patients’ documents, namely dependency structure-based and constituent(phrase) structure-based parser. Models generated by both approaches are evaluated in two aspects: exhaustiveness of the represented knowledge and the model generation time. The precision measure is used to compute the models’ exhaustiveness in terms of the number of facts that are transformed into RDF representations.","PeriodicalId":45460,"journal":{"name":"Journal of Information & Knowledge Management","volume":"1 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2022-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Constituent vs Dependency Parsing-Based RDF Model Generation from Dengue Patients’ Case Sheets\",\"authors\":\"Runumi Devi, D. Mehrotra, Sana Ben Abdallah Ben Lamine\",\"doi\":\"10.1142/s0219649222500137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Electronic Health Record (EHR) systems in healthcare organisations are primarily maintained in isolation from each other that makes interoperability of unstructured(text) data stored in these EHR systems challenging in the healthcare domain. Similar information may be described using different terminologies by different applications that can be evaded by transforming the content into the Resource Description Framework (RDF) model that is interoperable amongst organisations. RDF requires a document’s contents to be translated into a repository of triplets (subject, predicate, object) known as RDF statements. Natural Language Processing (NLP) techniques can help get actionable insights from these text data and create triplets for RDF model generation. This paper discusses two NLP-based approaches to generate the RDF models from unstructured patients’ documents, namely dependency structure-based and constituent(phrase) structure-based parser. Models generated by both approaches are evaluated in two aspects: exhaustiveness of the represented knowledge and the model generation time. The precision measure is used to compute the models’ exhaustiveness in terms of the number of facts that are transformed into RDF representations.\",\"PeriodicalId\":45460,\"journal\":{\"name\":\"Journal of Information & Knowledge Management\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2022-01-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information & Knowledge Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219649222500137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information & Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219649222500137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}

引用次数: 0

摘要

医疗保健组织中的电子健康记录（EHR）系统主要是在相互隔离的情况下进行维护的，这使得存储在这些EHR系统中的非结构化（文本）数据的互操作性在医疗保健领域具有挑战性。类似的信息可以由不同的应用程序使用不同的术语来描述，这些应用程序可以通过将内容转换为组织之间可互操作的资源描述框架（RDF）模型来规避。RDF要求将文档的内容翻译成三元组（主语、谓语、宾语）的存储库，称为RDF语句。自然语言处理（NLP）技术可以帮助从这些文本数据中获得可操作的见解，并为RDF模型生成创建三元组。本文讨论了从非结构化患者文档中生成RDF模型的两种基于NLP的方法，即基于依赖结构的解析器和基于成分（短语）结构的解析器。两种方法生成的模型从两个方面进行评估：表示知识的穷尽性和模型生成时间。精度度量用于根据转换为RDF表示的事实数量来计算模型的穷尽性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Constituent vs Dependency Parsing-Based RDF Model Generation from Dengue Patients’ Case Sheets

Electronic Health Record (EHR) systems in healthcare organisations are primarily maintained in isolation from each other that makes interoperability of unstructured(text) data stored in these EHR systems challenging in the healthcare domain. Similar information may be described using different terminologies by different applications that can be evaded by transforming the content into the Resource Description Framework (RDF) model that is interoperable amongst organisations. RDF requires a document’s contents to be translated into a repository of triplets (subject, predicate, object) known as RDF statements. Natural Language Processing (NLP) techniques can help get actionable insights from these text data and create triplets for RDF model generation. This paper discusses two NLP-based approaches to generate the RDF models from unstructured patients’ documents, namely dependency structure-based and constituent(phrase) structure-based parser. Models generated by both approaches are evaluated in two aspects: exhaustiveness of the represented knowledge and the model generation time. The precision measure is used to compute the models’ exhaustiveness in terms of the number of facts that are transformed into RDF representations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Information & Knowledge Management INFORMATION SCIENCE & LIBRARY SCIENCE-

CiteScore

2.40

自引率

25.00%

发文量

期刊介绍： JIKM is a refereed journal published quarterly by World Scientific and dedicated to the exchange of the latest research and practical information in the field of information processing and knowledge management. The journal publishes original research and case studies by academic, business and government contributors on all aspects of information processing, information management, knowledge management, tools, techniques and technologies, knowledge creation and sharing, best practices, policies and guidelines. JIKM is an international journal aimed at providing quality information to subscribers around the world. Managed by an international editorial board, JIKM positions itself as one of the leading scholarly journals in the field of information processing and knowledge management. It is a good reference for both information and knowledge management professionals. The journal covers key areas in the field of information and knowledge management. Research papers, practical applications, working papers, and case studies are invited in the following areas: -Business intelligence and competitive intelligence -Communication and organizational culture -e-Learning and life long learning -Electronic records and document management -Information processing and information management -Information organization, taxonomies and ontology -Intellectual capital -Knowledge creation, retention, sharing and transfer -Knowledge discovery, data and text mining -Knowledge management and innovations -Knowledge management education -Knowledge management tools and technologies -Knowledge management measurements -Knowledge professionals and leadership -Learning organization and organizational learning -Practical implementations of knowledge management