A comparison of supervised learning classifiers for link discovery

Joint Conference on Lexical and Computational Semantics Pub Date : 2014-09-04 DOI:10.1145/2660517.2660532

Tommaso Soru, A. N. Ngomo

引用次数: 26

Abstract

The detection of links between resources is intrinsic to the vision of the Linked Data Web. Due to the mere size of current knowledge bases, this task is commonly addressed by using tools. In particular, manifold link discovery frameworks have been developed. These frameworks implement several different machine-learning approaches to discovering links. In this paper, we investigate which of the commonly used supervised machine-learning classifiers performs best on the link discovery task. To this end, we first present our evaluation pipeline. Then, we compare ten different approaches on three artificial and three real-world benchmark data sets. The classification outcomes are subsequently compared with several state-of-the-art frameworks. Our results suggest that while several algorithms perform well, multilayer perceptrons perform best on average. Moreover, logistic regression seems best suited for noisy data.

查看原文本刊更多论文

链接发现的监督学习分类器的比较

资源之间链接的检测是关联数据Web的内在愿景。由于当前知识库的大小，这个任务通常是通过使用工具来解决的。特别是，已经开发了流形链接发现框架。这些框架实现了几种不同的机器学习方法来发现链接。在本文中，我们研究了哪种常用的监督机器学习分类器在链接发现任务上表现最好。为此，我们首先提出我们的评估管道。然后，我们在三个人工和三个现实世界的基准数据集上比较了十种不同的方法。分类结果随后与几个最先进的框架进行比较。我们的结果表明，虽然有几种算法表现良好，但多层感知器的平均表现最好。此外，逻辑回归似乎最适合于有噪声的数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Joint Conference on Lexical and Computational Semantics

自引率

0.00%

发文量