发现生物医学文献中命名实体之间的信息句法关系

2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications Pub Date : 2010-04-11 DOI:10.1109/DBKDA.2010.14

A. Appice, Michelangelo Ceci, Corrado Loglisci

{"title":"发现生物医学文献中命名实体之间的信息句法关系","authors":"A. Appice, Michelangelo Ceci, Corrado Loglisci","doi":"10.1109/DBKDA.2010.14","DOIUrl":null,"url":null,"abstract":"The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in biomedical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Discovering Informative Syntactic Relationships between Named Entities in Biomedical Literature\",\"authors\":\"A. Appice, Michelangelo Ceci, Corrado Loglisci\",\"doi\":\"10.1109/DBKDA.2010.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in biomedical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.\",\"PeriodicalId\":273177,\"journal\":{\"name\":\"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DBKDA.2010.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DBKDA.2010.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在生物医学文献中发现命名实体之间新的和潜在的有意义的关系可以从文本挖掘中的多关系数据挖掘方法的应用中得到很大的好处。这是由于多关系数据挖掘的特性能够表达和操纵实体之间的关系。我们研究了这种方法的应用，以解决识别生物医学抽象语料库中常见的信息句法结构的任务。最初，命名实体在文本语料库中根据一些生物医学词典(如MeSH分类法)进行注释。然后将标记的实体与相应动词的主语和/或宾语的角色集成到句法结构中。这些结构用一阶语言表示。频繁模式发现的多关系方法允许识别语料库中频繁出现的命名实体之间基于动词的关系。本文报道了通过查询Medline获得的一组关于特定疾病的摘要的初步实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Discovering Informative Syntactic Relationships between Named Entities in Biomedical Literature

The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in biomedical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications

自引率

0.00%

发文量