{"title":"Speeding up the SUCCESS Approach for Massive Industrial Datasets","authors":"Krisztián Búza, Aleksandra Revina","doi":"10.1109/INISTA49547.2020.9194656","DOIUrl":null,"url":null,"abstract":"In many applications, it may be expensive, difficult or even impossible to obtain class labels for a large amount of the instances, therefore, the labeled data may not be representative. Semi-supervised learning aims to alleviate this problem by using both labeled and unlabeled data. Recently, we introduced the SUCCESS approach for semi-supervised classification of time series. Although SUCCESS achieved promising results, its ability to classify massive, industrial datasets has not been studied yet. In this paper, we aim to fill this gap: we propose a simple but effective method to speed up SUCCESS without loss of its accuracy. We evaluate the resulting approach on the classification of both publicly available and industrial datasets. Hence, we expect the increase of interest in the algorithm both in industry and the research community.","PeriodicalId":124632,"journal":{"name":"2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INISTA49547.2020.9194656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In many applications, it may be expensive, difficult or even impossible to obtain class labels for a large amount of the instances, therefore, the labeled data may not be representative. Semi-supervised learning aims to alleviate this problem by using both labeled and unlabeled data. Recently, we introduced the SUCCESS approach for semi-supervised classification of time series. Although SUCCESS achieved promising results, its ability to classify massive, industrial datasets has not been studied yet. In this paper, we aim to fill this gap: we propose a simple but effective method to speed up SUCCESS without loss of its accuracy. We evaluate the resulting approach on the classification of both publicly available and industrial datasets. Hence, we expect the increase of interest in the algorithm both in industry and the research community.