Ontology Learning Through Focused Crawling and Information Extraction

2009 International Conference on Knowledge and Systems Engineering Pub Date : 2009-10-13 DOI:10.1109/KSE.2009.28

H. Luong, Susan Gauch, Qiang Wang

引用次数: 13

Abstract

Ontology learning aims to facilitate the construction of ontologies by decreasing the amount of effort required to produce an ontology for a new domain. However, there are few studies that attempt to automate the entire ontology learning process from the collection of domain-specific literature, to text mining to build new ontologies or enrich existing ones. In this paper, we present a complete framework for ontology learning that enables us to retrieve documents from the Web using focused crawling, and then use a SVM (Support Vector Machine) classifier to identify domain-specific documents and perform text mining in order to extract useful information for the ontology enrichment process. We have carried out several experiments on components of this framework in a biological domain, amphibian morphology. This paper reports on the overall system architecture and our initial experiments on information extraction using text mining techniques to enrich the domain ontology.

查看原文本刊更多论文

聚焦爬行和信息提取的本体学习

本体学习旨在通过减少为新领域生成本体所需的工作量来促进本体的构建。然而，很少有研究试图自动化整个本体学习过程，从特定领域文献的收集，到文本挖掘，以建立新的本体或丰富现有的本体。在本文中，我们提出了一个完整的本体学习框架，使我们能够使用集中爬行从Web检索文档，然后使用SVM(支持向量机)分类器识别特定领域的文档并进行文本挖掘，以便为本体丰富过程提取有用的信息。我们在生物领域两栖动物形态学中对这个框架的组成部分进行了几次实验。本文报告了整个系统架构和我们使用文本挖掘技术来丰富领域本体的信息提取的初步实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 International Conference on Knowledge and Systems Engineering

自引率

0.00%

发文量