Detecting concept drift in fully distributed environments

István Hegedüs, L. Nyers, Róbert Ormándi
{"title":"Detecting concept drift in fully distributed environments","authors":"István Hegedüs, L. Nyers, Róbert Ormándi","doi":"10.1109/SISY.2012.6339511","DOIUrl":null,"url":null,"abstract":"Applying sophisticated machine learning techniques on fully distributed data is increasingly important in many applications like distributed recommender systems or spam filters. In this type of networked environment the data model can change dynamically over time (concept drift). Identifying when concept drift occurred is a key for several drift handling techniques and important in numerous scenarios. However drift handling approaches exist, no efficient solution for detecting the drift is known in very large scale networks. Here, we propose an approach that can detect the concept drift in large scale and fully distributed networks. In our approach, the learning is performed by applying online learners that take random walks in the network while updating themselves using the samples available at the nodes. The drift detection is based on an adaptive mechanism which uses the historical performances of the models. Through empirical evaluations we demonstrate that our approach handles the drifting concept while additionally detects the occurrence of the concept drift with high accuracy.","PeriodicalId":207630,"journal":{"name":"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY.2012.6339511","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Applying sophisticated machine learning techniques on fully distributed data is increasingly important in many applications like distributed recommender systems or spam filters. In this type of networked environment the data model can change dynamically over time (concept drift). Identifying when concept drift occurred is a key for several drift handling techniques and important in numerous scenarios. However drift handling approaches exist, no efficient solution for detecting the drift is known in very large scale networks. Here, we propose an approach that can detect the concept drift in large scale and fully distributed networks. In our approach, the learning is performed by applying online learners that take random walks in the network while updating themselves using the samples available at the nodes. The drift detection is based on an adaptive mechanism which uses the historical performances of the models. Through empirical evaluations we demonstrate that our approach handles the drifting concept while additionally detects the occurrence of the concept drift with high accuracy.
在完全分布的环境中检测概念漂移
在分布式推荐系统或垃圾邮件过滤器等许多应用中,在完全分布式数据上应用复杂的机器学习技术变得越来越重要。在这种类型的网络环境中,数据模型可以随时间动态变化(概念漂移)。确定何时发生概念漂移是几种漂移处理技术的关键,在许多场景中都很重要。尽管存在漂移处理方法,但在超大规模网络中还没有有效的漂移检测方法。在这里,我们提出了一种可以在大规模和全分布式网络中检测概念漂移的方法。在我们的方法中,学习是通过应用在线学习者来执行的,这些学习者在网络中随机行走,同时使用节点上可用的样本更新自己。漂移检测是基于一种利用模型历史性能的自适应机制。通过实证评估,我们证明了我们的方法在处理漂移概念的同时,还能高精度地检测概念漂移的发生。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信