Early classification of time series based on uncertainty measure

2019 IEEE Conference on Information and Communication Technology Pub Date : 2019-12-01 DOI:10.1109/CICT48419.2019.9066213

Anshul Sharma, S. Singh

{"title":"Early classification of time series based on uncertainty measure","authors":"Anshul Sharma, S. Singh","doi":"10.1109/CICT48419.2019.9066213","DOIUrl":null,"url":null,"abstract":"The early classification of time series data is a critical problem in many time-sensitive applications such as health informatics. Where the prediction of class value, as early as possible, is highly valuable while preserving the accuracy as on full-length sequence data. For example, early diagnosis can provide better treatment to the patient or even save their lives. The aim of early classification is to analyse the sequence data at each time point continuously and predict the class label when a sufficient amount of data is available. Thus, the decision of early classification is a challenging task that needs to be addressed. Therefore, in this work, we propose an early classification model which relies on a set of probabilistic classifier and a confidence threshold that is measured in term of uncertainty. Formally, our model is divided into two parts. i) Learning phase, define the safeguard point for each class so that it makes sense to predict the label of any sequence with some acceptable accuracy. These safeguard points are identified based on user-defined accuracy. ii) Prediction phase, classify the time series only if the uncertainty of probabilistic output lie under the confidence threshold, that is obtained in the learning phase. We have evaluated our proposed model for 15 UCR datasets and compared with baseline state-of-art methods. Results clearly show that our proposed model is sianificantlv better in term of early classification.","PeriodicalId":234540,"journal":{"name":"2019 IEEE Conference on Information and Communication Technology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Conference on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICT48419.2019.9066213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The early classification of time series data is a critical problem in many time-sensitive applications such as health informatics. Where the prediction of class value, as early as possible, is highly valuable while preserving the accuracy as on full-length sequence data. For example, early diagnosis can provide better treatment to the patient or even save their lives. The aim of early classification is to analyse the sequence data at each time point continuously and predict the class label when a sufficient amount of data is available. Thus, the decision of early classification is a challenging task that needs to be addressed. Therefore, in this work, we propose an early classification model which relies on a set of probabilistic classifier and a confidence threshold that is measured in term of uncertainty. Formally, our model is divided into two parts. i) Learning phase, define the safeguard point for each class so that it makes sense to predict the label of any sequence with some acceptable accuracy. These safeguard points are identified based on user-defined accuracy. ii) Prediction phase, classify the time series only if the uncertainty of probabilistic output lie under the confidence threshold, that is obtained in the learning phase. We have evaluated our proposed model for 15 UCR datasets and compared with baseline state-of-art methods. Results clearly show that our proposed model is sianificantlv better in term of early classification.

查看原文本刊更多论文

基于不确定性测度的时间序列早期分类

时间序列数据的早期分类是许多时间敏感应用(如卫生信息学)中的一个关键问题。其中，在保持全长序列数据精度的同时，尽早预测类值是非常有价值的。例如，早期诊断可以为患者提供更好的治疗，甚至挽救他们的生命。早期分类的目的是连续分析每个时间点的序列数据，并在数据量足够时预测类别标签。因此，早期分类的决策是一项需要解决的具有挑战性的任务。因此，在这项工作中，我们提出了一个早期分类模型，该模型依赖于一组概率分类器和一个根据不确定性测量的置信阈值。形式上，我们的模型分为两部分。i)学习阶段，定义每个类的保障点，以便能够以可接受的精度预测任何序列的标签。这些保障点是根据用户定义的准确性来确定的。ii)预测阶段，只有在学习阶段获得的概率输出的不确定性在置信度阈值以下时，才对时间序列进行分类。我们已经在15个UCR数据集上评估了我们提出的模型，并与最先进的基线方法进行了比较。结果清楚地表明，我们提出的模型在早期分类方面明显更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE Conference on Information and Communication Technology

自引率

0.00%

发文量