Early classification of time series based on uncertainty measure

Anshul Sharma, S. Singh
{"title":"Early classification of time series based on uncertainty measure","authors":"Anshul Sharma, S. Singh","doi":"10.1109/CICT48419.2019.9066213","DOIUrl":null,"url":null,"abstract":"The early classification of time series data is a critical problem in many time-sensitive applications such as health informatics. Where the prediction of class value, as early as possible, is highly valuable while preserving the accuracy as on full-length sequence data. For example, early diagnosis can provide better treatment to the patient or even save their lives. The aim of early classification is to analyse the sequence data at each time point continuously and predict the class label when a sufficient amount of data is available. Thus, the decision of early classification is a challenging task that needs to be addressed. Therefore, in this work, we propose an early classification model which relies on a set of probabilistic classifier and a confidence threshold that is measured in term of uncertainty. Formally, our model is divided into two parts. i) Learning phase, define the safeguard point for each class so that it makes sense to predict the label of any sequence with some acceptable accuracy. These safeguard points are identified based on user-defined accuracy. ii) Prediction phase, classify the time series only if the uncertainty of probabilistic output lie under the confidence threshold, that is obtained in the learning phase. We have evaluated our proposed model for 15 UCR datasets and compared with baseline state-of-art methods. Results clearly show that our proposed model is sianificantlv better in term of early classification.","PeriodicalId":234540,"journal":{"name":"2019 IEEE Conference on Information and Communication Technology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Conference on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICT48419.2019.9066213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The early classification of time series data is a critical problem in many time-sensitive applications such as health informatics. Where the prediction of class value, as early as possible, is highly valuable while preserving the accuracy as on full-length sequence data. For example, early diagnosis can provide better treatment to the patient or even save their lives. The aim of early classification is to analyse the sequence data at each time point continuously and predict the class label when a sufficient amount of data is available. Thus, the decision of early classification is a challenging task that needs to be addressed. Therefore, in this work, we propose an early classification model which relies on a set of probabilistic classifier and a confidence threshold that is measured in term of uncertainty. Formally, our model is divided into two parts. i) Learning phase, define the safeguard point for each class so that it makes sense to predict the label of any sequence with some acceptable accuracy. These safeguard points are identified based on user-defined accuracy. ii) Prediction phase, classify the time series only if the uncertainty of probabilistic output lie under the confidence threshold, that is obtained in the learning phase. We have evaluated our proposed model for 15 UCR datasets and compared with baseline state-of-art methods. Results clearly show that our proposed model is sianificantlv better in term of early classification.
基于不确定性测度的时间序列早期分类
时间序列数据的早期分类是许多时间敏感应用(如卫生信息学)中的一个关键问题。其中,在保持全长序列数据精度的同时,尽早预测类值是非常有价值的。例如,早期诊断可以为患者提供更好的治疗,甚至挽救他们的生命。早期分类的目的是连续分析每个时间点的序列数据,并在数据量足够时预测类别标签。因此,早期分类的决策是一项需要解决的具有挑战性的任务。因此,在这项工作中,我们提出了一个早期分类模型,该模型依赖于一组概率分类器和一个根据不确定性测量的置信阈值。形式上,我们的模型分为两部分。i)学习阶段,定义每个类的保障点,以便能够以可接受的精度预测任何序列的标签。这些保障点是根据用户定义的准确性来确定的。ii)预测阶段,只有在学习阶段获得的概率输出的不确定性在置信度阈值以下时,才对时间序列进行分类。我们已经在15个UCR数据集上评估了我们提出的模型,并与最先进的基线方法进行了比较。结果清楚地表明,我们提出的模型在早期分类方面明显更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信