Combining Active Learning with Self-train algorithm for classification of multimodal problems

Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis
{"title":"Combining Active Learning with Self-train algorithm for classification of multimodal problems","authors":"Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis","doi":"10.1109/IISA.2019.8900724","DOIUrl":null,"url":null,"abstract":"In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2019.8900724","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.
结合主动学习与自训练算法的多模态问题分类
在现实世界中,处理标记和未标记数据引起了一些数据科学家和机器学习工程师的兴趣,导致了一些应用数据增强方法来实现有效学习行为的演示。尽管他们中的大多数人单独提出了半监督或主动学习方法的利用,但他们的组合并没有被广泛使用。该策略的目标是在一个共同的框架下,依靠自动化方法驱动的决策,有效地利用可用的人类知识。因此,我们对这种组合方法进行了实证评估,涉及到在基于池的场景下运行的多模态数据的三个问题:性别识别、攻击性语言识别和情绪检测。在我们提出的学习框架中,我们的结果证明了采用这种半自动方法在预测正确性和减少时间和资源消耗方面的好处,以及学习收敛的平滑性,主要使用集成分类器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信