Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis
{"title":"Combining Active Learning with Self-train algorithm for classification of multimodal problems","authors":"Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis","doi":"10.1109/IISA.2019.8900724","DOIUrl":null,"url":null,"abstract":"In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2019.8900724","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.