Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy.

Tanja Bekhuis
{"title":"Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy.","authors":"Tanja Bekhuis","doi":"10.1186/1742-5581-3-2","DOIUrl":null,"url":null,"abstract":"<p><p>Innovative biomedical librarians and information specialists who want to expand their roles as expert searchers need to know about profound changes in biology and parallel trends in text mining. In recent years, conceptual biology has emerged as a complement to empirical biology. This is partly in response to the availability of massive digital resources such as the network of databases for molecular biologists at the National Center for Biotechnology Information. Developments in text mining and hypothesis discovery systems based on the early work of Swanson, a mathematician and information scientist, are coincident with the emergence of conceptual biology. Very little has been written to introduce biomedical digital librarians to these new trends. In this paper, background for data and text mining, as well as for knowledge discovery in databases (KDD) and in text (KDT) is presented, then a brief review of Swanson's ideas, followed by a discussion of recent approaches to hypothesis discovery and testing. 'Testing' in the context of text mining involves partially automated methods for finding evidence in the literature to support hypothetical relationships. Concluding remarks follow regarding (a) the limits of current strategies for evaluation of hypothesis discovery systems and (b) the role of literature-based discovery in concert with empirical research. Report of an informatics-driven literature review for biomarkers of systemic lupus erythematosus is mentioned. Swanson's vision of the hidden value in the literature of science and, by extension, in biomedical digital databases, is still remarkably generative for information scientists, biologists, and physicians.</p>","PeriodicalId":87058,"journal":{"name":"Biomedical digital libraries","volume":"3 ","pages":"2"},"PeriodicalIF":0.0000,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5581-3-2","citationCount":"181","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical digital libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1742-5581-3-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 181

Abstract

Innovative biomedical librarians and information specialists who want to expand their roles as expert searchers need to know about profound changes in biology and parallel trends in text mining. In recent years, conceptual biology has emerged as a complement to empirical biology. This is partly in response to the availability of massive digital resources such as the network of databases for molecular biologists at the National Center for Biotechnology Information. Developments in text mining and hypothesis discovery systems based on the early work of Swanson, a mathematician and information scientist, are coincident with the emergence of conceptual biology. Very little has been written to introduce biomedical digital librarians to these new trends. In this paper, background for data and text mining, as well as for knowledge discovery in databases (KDD) and in text (KDT) is presented, then a brief review of Swanson's ideas, followed by a discussion of recent approaches to hypothesis discovery and testing. 'Testing' in the context of text mining involves partially automated methods for finding evidence in the literature to support hypothetical relationships. Concluding remarks follow regarding (a) the limits of current strategies for evaluation of hypothesis discovery systems and (b) the role of literature-based discovery in concert with empirical research. Report of an informatics-driven literature review for biomarkers of systemic lupus erythematosus is mentioned. Swanson's vision of the hidden value in the literature of science and, by extension, in biomedical digital databases, is still remarkably generative for information scientists, biologists, and physicians.

概念生物学、假设发现和文本挖掘:斯旺森的遗产。
创新的生物医学图书馆员和信息专家想要扩大他们作为专家搜索者的角色,需要了解生物学的深刻变化和文本挖掘的平行趋势。近年来,概念生物学作为经验生物学的补充而出现。这在一定程度上是对大量数字资源可用性的回应,例如国家生物技术信息中心的分子生物学家数据库网络。基于数学家和信息科学家Swanson早期工作的文本挖掘和假设发现系统的发展与概念生物学的出现是一致的。很少有文章向生物医学数字图书管理员介绍这些新趋势。在本文中,介绍了数据和文本挖掘的背景,以及数据库(KDD)和文本(KDT)中的知识发现,然后简要回顾了Swanson的思想,然后讨论了最近的假设发现和测试方法。在文本挖掘的背景下,“测试”涉及到在文献中寻找证据以支持假设关系的部分自动化方法。结束语以下是关于(a)评估假设发现系统的当前策略的局限性和(b)基于文献的发现与实证研究相一致的作用。报告信息学驱动的文献综述生物标志物的系统性红斑狼疮被提及。斯旺森对科学文献,乃至生物医学数字数据库中隐藏价值的看法,对信息科学家、生物学家和医生来说,仍然具有显著的生成性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信