KnowItNow

Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05 Pub Date : 1900-01-01 DOI:10.3115/1220575.1220646

Michael J. Cafarella, Doug Downey, S. Soderland, Oren Etzioni

引用次数: 142

Abstract

Numerous NLP applications rely on search-engine queries, both to extract information from and to compute statistics over the Web corpus. But search engines often limit the number of available queries. As a result, query-intensive NLP applications such as Information Extraction (IE) distribute their query load over several days, making IE a slow, offline process.This paper introduces a novel architecture for IE that obviates queries to commercial search engines. The architecture is embodied in a system called KnowItNow that performs high-precision IE in minutes instead of days. We compare KnowItNow experimentally with the previously-published KnowItAll system, and quantify the tradeoff between recall and speed. KnowItNow's extraction rate is two to three orders of magnitude higher than KnowItAll's.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05

自引率

0.00%

发文量