Resource capability discovery and description management system for bioinformatics Data and service Integration - an experiment with gene regulatory networks

2008 11th International Conference on Computer and Information Technology Pub Date : 2008-12-01 DOI:10.1109/ICCITECHN.2008.4802991

E. Ahmed

{"title":"Resource capability discovery and description management system for bioinformatics Data and service Integration - an experiment with gene regulatory networks","authors":"E. Ahmed","doi":"10.1109/ICCITECHN.2008.4802991","DOIUrl":null,"url":null,"abstract":"Traditional legacy HTML based web sites/ page can be thought of as web services because the dynamic web pages can take user input argument via web forms and response to user query. The ability of agents and services to automatically locate and interact with unknown partners is a goal for Web based Data Integration system. This ldquoserendipitous interoperabilityrdquo is hindered by the lack of an explicit means of describing what web pages are able to do and in order to do it what input it takes and what output it produces, that is what is their capabilities [1]. The tremendous success of the WWW is countervailed by the efforts needed to search and find relevant information. For tabular structures embedded in HTML documents, typical keyword or link-analysis based search fails. The next phase envisioned for the WWW is automatic ad-hoc interaction between intelligent agents, web services, databases and semantic web enabled applications. A large amount of information available on the Web is formatted in HTML tables, which are mainly presentation oriented and are not suited for database applications. As a result, how to capture information in HTML tables semantically and integrate relevant information is a challenge. We are envisioning another layer of web abstraction where user can query intra web document table like structure. Our prototype application is based on WebFusion and an ad hoc query language BioFlow [2], [3], [4], [5], [6] a software agent that can simulate a person interacting with web search forms and extracting information from the resulting pages by means of an API. We need to develop a framework which is able to query search web forms and the web page tables in a SQL way. In this context we also report a Java based implementation for integrating Flybase and AlignACE site.","PeriodicalId":335795,"journal":{"name":"2008 11th International Conference on Computer and Information Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 11th International Conference on Computer and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHN.2008.4802991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Traditional legacy HTML based web sites/ page can be thought of as web services because the dynamic web pages can take user input argument via web forms and response to user query. The ability of agents and services to automatically locate and interact with unknown partners is a goal for Web based Data Integration system. This ldquoserendipitous interoperabilityrdquo is hindered by the lack of an explicit means of describing what web pages are able to do and in order to do it what input it takes and what output it produces, that is what is their capabilities [1]. The tremendous success of the WWW is countervailed by the efforts needed to search and find relevant information. For tabular structures embedded in HTML documents, typical keyword or link-analysis based search fails. The next phase envisioned for the WWW is automatic ad-hoc interaction between intelligent agents, web services, databases and semantic web enabled applications. A large amount of information available on the Web is formatted in HTML tables, which are mainly presentation oriented and are not suited for database applications. As a result, how to capture information in HTML tables semantically and integrate relevant information is a challenge. We are envisioning another layer of web abstraction where user can query intra web document table like structure. Our prototype application is based on WebFusion and an ad hoc query language BioFlow [2], [3], [4], [5], [6] a software agent that can simulate a person interacting with web search forms and extracting information from the resulting pages by means of an API. We need to develop a framework which is able to query search web forms and the web page tables in a SQL way. In this context we also report a Java based implementation for integrating Flybase and AlignACE site.

查看原文本刊更多论文

生物信息学数据与服务集成的资源能力发现与描述管理系统——基因调控网络实验

传统的基于HTML的web站点/页面可以被认为是web服务，因为动态web页面可以通过web表单接受用户输入参数并响应用户查询。代理和服务自动定位未知伙伴并与之交互的能力是基于Web的数据集成系统的目标。由于缺乏一种明确的方法来描述网页能够做什么，以及为了做到这一点，它需要什么输入和产生什么输出，这就是它们的能力，因此阻碍了这种偶然的互操作性[1]。WWW的巨大成功被搜索和查找相关信息所需要的努力所抵消。对于HTML文档中嵌入的表格结构，典型的基于关键字或链接分析的搜索会失败。WWW的下一个阶段是智能代理、web服务、数据库和支持语义web的应用程序之间的自动自组织交互。Web上可用的大量信息都是用HTML表格式化的，HTML表主要面向表示，不适合数据库应用程序。因此，如何从语义上捕获HTML表中的信息并集成相关信息是一个挑战。我们正在设想另一层web抽象，用户可以查询内部web文档表的结构。我们的原型应用程序基于WebFusion和一种特别的查询语言BioFlow[2]，[3]，[4]，[5]，[6]，一种软件代理，可以模拟一个人与web搜索表单交互，并通过API从结果页面中提取信息。我们需要开发一个能够以SQL方式查询搜索web表单和web页面表的框架。在这种情况下，我们还报告了一个基于Java的实现，用于集成Flybase和AlignACE站点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 11th International Conference on Computer and Information Technology

自引率

0.00%

发文量