从文献中提取和描述基因与药物的关系。

Pharmacogenetics Pub Date : 2004-09-01 DOI:10.1097/00008571-200409000-00002

Jeffrey T Chang, Russ B Altman

{"title":"从文献中提取和描述基因与药物的关系。","authors":"Jeffrey T Chang, Russ B Altman","doi":"10.1097/00008571-200409000-00002","DOIUrl":null,"url":null,"abstract":"A fundamental task of pharmacogenetics is to collect and classify relationships between genes and drugs. Currently, this useful information has not been comprehensively aggregated in any database and remains scattered throughout the published literature. Although there are efforts to collect this information manually, they are limited by the size of the published literature on gene-drug relationships. Therefore, we investigated computational methods to extract and characterize pharmacogenetic relationships between genes and drugs from the literature. We first evaluated the effectiveness of the co-occurrence method in identifying related genes and drugs. We then used supervised machine learning algorithms to classify the relationships between genes and drugs from the Pharmacogenetics and Pharmacogenomics Knowledge Base (PharmGKB) into five categories that have been defined by active pharmacogenetic researchers as relevant to their work. The final co-occurrence algorithm was able to extract 78% of the related genes and drugs that were published in a review article from the literature. Our algorithm subsequently classified the relationships between genes and drugs from the PharmGKB into five categories with 74% accuracy. We have made the data available on a supplementary website at http://bionlp.stanford.edu/genedrug/ Gene-drug relationships can be accurately extracted from text and classified into categories. Although the relationships that we have identified do not capture the details and fine distinctions often made in the literature, these methods will help scientists to track the ever-growing literature and create information resources to support future discoveries.","PeriodicalId":19917,"journal":{"name":"Pharmacogenetics","volume":"14 9","pages":"577-86"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1097/00008571-200409000-00002","citationCount":"59","resultStr":"{\"title\":\"Extracting and characterizing gene-drug relationships from the literature.\",\"authors\":\"Jeffrey T Chang, Russ B Altman\",\"doi\":\"10.1097/00008571-200409000-00002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A fundamental task of pharmacogenetics is to collect and classify relationships between genes and drugs. Currently, this useful information has not been comprehensively aggregated in any database and remains scattered throughout the published literature. Although there are efforts to collect this information manually, they are limited by the size of the published literature on gene-drug relationships. Therefore, we investigated computational methods to extract and characterize pharmacogenetic relationships between genes and drugs from the literature. We first evaluated the effectiveness of the co-occurrence method in identifying related genes and drugs. We then used supervised machine learning algorithms to classify the relationships between genes and drugs from the Pharmacogenetics and Pharmacogenomics Knowledge Base (PharmGKB) into five categories that have been defined by active pharmacogenetic researchers as relevant to their work. The final co-occurrence algorithm was able to extract 78% of the related genes and drugs that were published in a review article from the literature. Our algorithm subsequently classified the relationships between genes and drugs from the PharmGKB into five categories with 74% accuracy. We have made the data available on a supplementary website at http://bionlp.stanford.edu/genedrug/ Gene-drug relationships can be accurately extracted from text and classified into categories. Although the relationships that we have identified do not capture the details and fine distinctions often made in the literature, these methods will help scientists to track the ever-growing literature and create information resources to support future discoveries.\",\"PeriodicalId\":19917,\"journal\":{\"name\":\"Pharmacogenetics\",\"volume\":\"14 9\",\"pages\":\"577-86\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1097/00008571-200409000-00002\",\"citationCount\":\"59\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pharmacogenetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1097/00008571-200409000-00002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmacogenetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/00008571-200409000-00002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 59

摘要

药物遗传学的一项基本任务是收集和分类基因与药物之间的关系。目前，这些有用的资料还没有在任何数据库中全面汇总，在已发表的文献中仍然分散。尽管有人努力人工收集这些信息，但它们受到有关基因-药物关系的已发表文献数量的限制。因此，我们研究了从文献中提取和表征基因与药物之间的药物遗传关系的计算方法。我们首先评估了共现法在识别相关基因和药物方面的有效性。然后，我们使用监督机器学习算法将药物遗传学和药物基因组学知识库(PharmGKB)中的基因和药物之间的关系分类为五类，这些类别是由活跃的药物遗传学研究人员根据他们的工作定义的。最终的共现算法能够从文献综述文章中提取出78%的相关基因和药物。我们的算法随后将PharmGKB中的基因和药物之间的关系分为五类，准确率为74%。我们已经在一个补充网站http://bionlp.stanford.edu/genedrug/上提供了数据，基因-药物关系可以准确地从文本中提取并分类。虽然我们已经确定的关系并没有捕捉到文献中经常出现的细节和细微的区别，但这些方法将帮助科学家追踪不断增长的文献并创建信息资源，以支持未来的发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Extracting and characterizing gene-drug relationships from the literature.

A fundamental task of pharmacogenetics is to collect and classify relationships between genes and drugs. Currently, this useful information has not been comprehensively aggregated in any database and remains scattered throughout the published literature. Although there are efforts to collect this information manually, they are limited by the size of the published literature on gene-drug relationships. Therefore, we investigated computational methods to extract and characterize pharmacogenetic relationships between genes and drugs from the literature. We first evaluated the effectiveness of the co-occurrence method in identifying related genes and drugs. We then used supervised machine learning algorithms to classify the relationships between genes and drugs from the Pharmacogenetics and Pharmacogenomics Knowledge Base (PharmGKB) into five categories that have been defined by active pharmacogenetic researchers as relevant to their work. The final co-occurrence algorithm was able to extract 78% of the related genes and drugs that were published in a review article from the literature. Our algorithm subsequently classified the relationships between genes and drugs from the PharmGKB into five categories with 74% accuracy. We have made the data available on a supplementary website at http://bionlp.stanford.edu/genedrug/ Gene-drug relationships can be accurately extracted from text and classified into categories. Although the relationships that we have identified do not capture the details and fine distinctions often made in the literature, these methods will help scientists to track the ever-growing literature and create information resources to support future discoveries.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Pharmacogenetics

自引率

0.00%

发文量