How to improve the statistical power of the 10-fold cross validation scheme in recommender systems

RepSys '13 Pub Date : 2013-10-12 DOI:10.1145/2532508.2532510

A. Košir, Ante Odic, M. Tkalcic

引用次数: 12

Abstract

At this stage development of recommender systems (RS), an evaluation of competing approaches (methods) yielding similar performances in terms of experiment reproduction is of crucial importance in order to direct the further development toward the most promising direction. These comparisons are usually based on the 10-fold cross validation scheme. Since the compared performances are often similar to each other, the application of statistical significance testing is inevitable in order to not to get misled by randomly caused differences of achieved performances. For the same reason, to reproduce experiments on a different set of experimental data, the most powerful significance testing should be applied. In this work we provide guidelines on how to achieve the highest power in the comparison of RS and we demonstrate them on a comparison of RS performances when different variables are contextualized.

查看原文本刊更多论文

如何提高推荐系统中10倍交叉验证方案的统计能力

在推荐系统(RS)发展的这个阶段，对在实验再现方面产生相似性能的竞争方法(方法)进行评估是至关重要的，以便指导进一步发展朝着最有希望的方向发展。这些比较通常基于10倍交叉验证方案。由于比较的性能往往是相似的，为了不被随机产生的性能差异所误导，应用统计显著性检验是不可避免的。出于同样的原因，为了在不同的实验数据集上重现实验，应该使用最强大的显著性检验。在这项工作中，我们提供了关于如何在RS比较中实现最高功率的指导方针，并在不同变量上下文化时对RS性能进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

RepSys '13

自引率

0.00%

发文量