A Self-Healing Approach for a Domain-Specific Deep Web Search Tool

2010 IEEE International Conference on BioInformatics and BioEngineering Pub Date : 2010-05-31 DOI:10.1109/BIBE.2010.13

Fan Wang, G. Agrawal

{"title":"A Self-Healing Approach for a Domain-Specific Deep Web Search Tool","authors":"Fan Wang, G. Agrawal","doi":"10.1109/BIBE.2010.13","DOIUrl":null,"url":null,"abstract":"Nowadays, a large part of the online biological data resides in the deep web. Lately, there have been several efforts focusing on integrating and providing search functionality for biological deep web data sources. Such systems often require data access involving a large number of remote data sources and the use of various communication links. Both the servers and networking links are vulnerable to congestion and failures. This can lead to an unpredictable unavailability or inaccessibility, which can disrupt access to the information. In this paper, we propose a solution to maintain query processing capability of an integrated biological deep web search system in the presence of unavailable or inaccessible data sources. Our solution involves dynamically adapting query processing when unexpected data source unavailability or inaccessibility is detected. We exploit the data redundancy that is found across biological deep web data sources. We incrementally generate a partial new query plan by bringing in new data sources that were not in the original query plan to replace the subplan that became inaccessible.","PeriodicalId":330904,"journal":{"name":"2010 IEEE International Conference on BioInformatics and BioEngineering","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on BioInformatics and BioEngineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2010.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Nowadays, a large part of the online biological data resides in the deep web. Lately, there have been several efforts focusing on integrating and providing search functionality for biological deep web data sources. Such systems often require data access involving a large number of remote data sources and the use of various communication links. Both the servers and networking links are vulnerable to congestion and failures. This can lead to an unpredictable unavailability or inaccessibility, which can disrupt access to the information. In this paper, we propose a solution to maintain query processing capability of an integrated biological deep web search system in the presence of unavailable or inaccessible data sources. Our solution involves dynamically adapting query processing when unexpected data source unavailability or inaccessibility is detected. We exploit the data redundancy that is found across biological deep web data sources. We incrementally generate a partial new query plan by bringing in new data sources that were not in the original query plan to replace the subplan that became inaccessible.

查看原文本刊更多论文

特定领域深度网络搜索工具的自修复方法

如今，很大一部分在线生物数据驻留在深层网络中。最近，已经有一些努力集中在集成和提供生物深度网络数据源的搜索功能上。这种系统通常需要涉及大量远程数据源和使用各种通信链路的数据访问。服务器和网络链路都容易出现拥塞和故障。这可能导致不可预测的不可用性或不可访问性，从而中断对信息的访问。在本文中，我们提出了一种在数据源不可用或不可访问的情况下保持集成生物深度网络搜索系统查询处理能力的解决方案。我们的解决方案涉及在检测到意外数据源不可用或不可访问时动态调整查询处理。我们利用了在生物深层网络数据源中发现的数据冗余。我们通过引入不在原始查询计划中的新数据源，以增量方式生成部分新查询计划，以替换无法访问的子计划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE International Conference on BioInformatics and BioEngineering

自引率

0.00%

发文量