Combined Active and Semi-supervised Learning Using Particle Walking Temporal Dynamics

2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence Pub Date : 2013-09-08 DOI:10.1109/BRICS-CCI-CBIC.2013.14

Fabricio A. Breve

{"title":"Combined Active and Semi-supervised Learning Using Particle Walking Temporal Dynamics","authors":"Fabricio A. Breve","doi":"10.1109/BRICS-CCI-CBIC.2013.14","DOIUrl":null,"url":null,"abstract":"Both Semi-Supervised Leaning and Active Learning are techniques used when unlabeled data is abundant, but the process of labeling them is expensive and/or time consuming. In this paper, those two machine learning techniques are combined into a single nature-inspired method. It features particles walking on a network built from the data set, using a unique random-greedy rule to select neighbors to visit. The particles, which have both competitive and cooperative behavior, are created on the network as the result of label queries. They may be created as the algorithm executes and only nodes affected by the new particles have to be updated. Therefore, it saves execution time compared to traditional active learning frameworks, in which the learning algorithm has to be executed several times. The data items to be queried are select based on information extracted from the nodes and particles temporal dynamics. Two different rules for queries are explored in this paper, one of them is based on querying by uncertainty approaches and the other is based on data and labeled nodes distribution. Each of them may perform better than the other according to some data sets peculiarities. Experimental results on some real-world data sets are provided, and the proposed method outperforms the semi-supervised learning method, from which it is derived, in all of them.","PeriodicalId":306195,"journal":{"name":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRICS-CCI-CBIC.2013.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Both Semi-Supervised Leaning and Active Learning are techniques used when unlabeled data is abundant, but the process of labeling them is expensive and/or time consuming. In this paper, those two machine learning techniques are combined into a single nature-inspired method. It features particles walking on a network built from the data set, using a unique random-greedy rule to select neighbors to visit. The particles, which have both competitive and cooperative behavior, are created on the network as the result of label queries. They may be created as the algorithm executes and only nodes affected by the new particles have to be updated. Therefore, it saves execution time compared to traditional active learning frameworks, in which the learning algorithm has to be executed several times. The data items to be queried are select based on information extracted from the nodes and particles temporal dynamics. Two different rules for queries are explored in this paper, one of them is based on querying by uncertainty approaches and the other is based on data and labeled nodes distribution. Each of them may perform better than the other according to some data sets peculiarities. Experimental results on some real-world data sets are provided, and the proposed method outperforms the semi-supervised learning method, from which it is derived, in all of them.

查看原文本刊更多论文

结合主动和半监督学习的粒子行走时间动力学

半监督学习和主动学习都是在未标记数据丰富时使用的技术，但标记它们的过程昂贵且/或耗时。在本文中，这两种机器学习技术被结合成一个单一的自然启发方法。它的特点是粒子在一个由数据集构建的网络上行走，使用一个独特的随机贪婪规则来选择要访问的邻居。同时具有竞争行为和合作行为的粒子作为标签查询的结果在网络上产生。它们可以在算法执行时创建，只有受新粒子影响的节点需要更新。因此，与传统的主动学习框架相比，它节省了执行时间，在传统的主动学习框架中，学习算法必须执行多次。根据从节点和粒子的时间动态中提取的信息选择要查询的数据项。本文探讨了两种不同的查询规则，一种是基于不确定性方法的查询，另一种是基于数据和标记节点分布的查询。根据某些数据集的特性，它们中的每一个都可能比另一个表现得更好。在一些实际数据集上的实验结果表明，所提出的方法在所有这些数据集上都优于其衍生的半监督学习方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence

自引率

0.00%

发文量