Mining the biomedical literature using semantic analysis and natural language processing techniques

Ronen Feldman , Yizhar Regev , Eyal Hurvitz , Michal Finkelstein-Landau
{"title":"Mining the biomedical literature using semantic analysis and natural language processing techniques","authors":"Ronen Feldman ,&nbsp;Yizhar Regev ,&nbsp;Eyal Hurvitz ,&nbsp;Michal Finkelstein-Landau","doi":"10.1016/S1478-5382(03)02330-8","DOIUrl":null,"url":null,"abstract":"<div><p>The information age has made the electronic storage of large amounts of data effortless. The proliferation of documents available on the Internet, corporate intranets, news wires and elsewhere is overwhelming. Search engines only exacerbate this overload problem by making increasingly more documents available in only a few keystrokes. This information overload also exists in the biomedical field, where scientific publications, and other forms of text-based data are produced at an unprecedented rate. Text mining is the combined, automated process of analyzing unstructured, natural language text to discover information and knowledge that are typically difficult to retrieve. Here, we focus on text mining as applied to the biomedical literature. We focus in particular on finding relationships among genes, proteins, drugs and diseases, to facilitate an understanding and prediction of complex biological processes. The LitMiner™ system, developed specifically for this purpose; is described in relation to the Knowledge Discovery and Data Mining Cup 2002, which serves as a formal evaluation of the system.</p></div>","PeriodicalId":9227,"journal":{"name":"Biosilico","volume":"1 2","pages":"Pages 69-80"},"PeriodicalIF":0.0000,"publicationDate":"2003-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1478-5382(03)02330-8","citationCount":"47","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosilico","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1478538203023308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 47

Abstract

The information age has made the electronic storage of large amounts of data effortless. The proliferation of documents available on the Internet, corporate intranets, news wires and elsewhere is overwhelming. Search engines only exacerbate this overload problem by making increasingly more documents available in only a few keystrokes. This information overload also exists in the biomedical field, where scientific publications, and other forms of text-based data are produced at an unprecedented rate. Text mining is the combined, automated process of analyzing unstructured, natural language text to discover information and knowledge that are typically difficult to retrieve. Here, we focus on text mining as applied to the biomedical literature. We focus in particular on finding relationships among genes, proteins, drugs and diseases, to facilitate an understanding and prediction of complex biological processes. The LitMiner™ system, developed specifically for this purpose; is described in relation to the Knowledge Discovery and Data Mining Cup 2002, which serves as a formal evaluation of the system.

使用语义分析和自然语言处理技术挖掘生物医学文献
信息时代使大量数据的电子存储变得毫不费力。Internet、企业内部网、新闻线路和其他地方可用文档的激增是压倒性的。搜索引擎只会使这个过载问题恶化,因为只需敲击几下键盘就可以获得越来越多的文档。这种信息超载也存在于生物医学领域,在该领域,科学出版物和其他形式的基于文本的数据以前所未有的速度产生。文本挖掘是分析非结构化自然语言文本以发现通常难以检索的信息和知识的组合自动化过程。在这里,我们专注于应用于生物医学文献的文本挖掘。我们特别专注于寻找基因、蛋白质、药物和疾病之间的关系,以促进对复杂生物过程的理解和预测。LitMiner™系统,专门为此目的而开发;是关于2002年知识发现和数据挖掘杯的描述,这是对系统的正式评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信