How to improve the statistical power of the 10-fold cross validation scheme in recommender systems

RepSys '13 Pub Date : 2013-10-12 DOI:10.1145/2532508.2532510
A. Košir, Ante Odic, M. Tkalcic
{"title":"How to improve the statistical power of the 10-fold cross validation scheme in recommender systems","authors":"A. Košir, Ante Odic, M. Tkalcic","doi":"10.1145/2532508.2532510","DOIUrl":null,"url":null,"abstract":"At this stage development of recommender systems (RS), an evaluation of competing approaches (methods) yielding similar performances in terms of experiment reproduction is of crucial importance in order to direct the further development toward the most promising direction. These comparisons are usually based on the 10-fold cross validation scheme. Since the compared performances are often similar to each other, the application of statistical significance testing is inevitable in order to not to get misled by randomly caused differences of achieved performances. For the same reason, to reproduce experiments on a different set of experimental data, the most powerful significance testing should be applied. In this work we provide guidelines on how to achieve the highest power in the comparison of RS and we demonstrate them on a comparison of RS performances when different variables are contextualized.","PeriodicalId":398648,"journal":{"name":"RepSys '13","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RepSys '13","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2532508.2532510","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

At this stage development of recommender systems (RS), an evaluation of competing approaches (methods) yielding similar performances in terms of experiment reproduction is of crucial importance in order to direct the further development toward the most promising direction. These comparisons are usually based on the 10-fold cross validation scheme. Since the compared performances are often similar to each other, the application of statistical significance testing is inevitable in order to not to get misled by randomly caused differences of achieved performances. For the same reason, to reproduce experiments on a different set of experimental data, the most powerful significance testing should be applied. In this work we provide guidelines on how to achieve the highest power in the comparison of RS and we demonstrate them on a comparison of RS performances when different variables are contextualized.
如何提高推荐系统中10倍交叉验证方案的统计能力
在推荐系统(RS)发展的这个阶段,对在实验再现方面产生相似性能的竞争方法(方法)进行评估是至关重要的,以便指导进一步发展朝着最有希望的方向发展。这些比较通常基于10倍交叉验证方案。由于比较的性能往往是相似的,为了不被随机产生的性能差异所误导,应用统计显著性检验是不可避免的。出于同样的原因,为了在不同的实验数据集上重现实验,应该使用最强大的显著性检验。在这项工作中,我们提供了关于如何在RS比较中实现最高功率的指导方针,并在不同变量上下文化时对RS性能进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信