Enhancing automatic extraction of top-k list from web

International Conference for Convergence for Technology-2014 Pub Date : 2014-04-06 DOI:10.1109/I2CT.2014.7092331

D. Patil, N. Dhawas

引用次数: 0

Abstract

Now a day's World Wide Web is considered as biggest resource of information. This large database which contains information in all area but finding particular information or extracting accurate data from web is difficult. The strong reason behind this sentence is that the data available on this huge database is not in same format. When data is in particular format you can extract information without any difficulty when extract data from HTML pages, we select data easily with the help of tags. This paper is extracting top-k list from all available web database which contain data either in structured or unstructured format. An algorithm is implemented for this reason which provides an accurate and faster generation of top-k list.

查看原文本刊更多论文

增强从web中自动提取top-k列表

现在每天的万维网被认为是最大的信息资源。这个庞大的数据库包含了所有领域的信息，但从网络中找到特定的信息或提取准确的数据是困难的。这句话背后的有力理由是，这个庞大的数据库中可用的数据格式不同。当数据是特定格式时，您可以毫不费力地提取信息;当从HTML页面提取数据时，我们可以借助标记轻松地选择数据。本文从所有可用的web数据库中提取top-k列表，其中包含结构化和非结构化格式的数据。为此实现了一种算法，该算法提供了准确且快速的top-k列表生成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference for Convergence for Technology-2014

自引率

0.00%

发文量