Bake off redux: a review and experimental evaluation of recent time series classification algorithms

IF 4.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Data Mining and Knowledge Discovery Pub Date : 2024-04-19 DOI:10.1007/s10618-024-01022-1

Matthew Middlehurst, Patrick Schäfer, Anthony Bagnall

{"title":"Bake off redux: a review and experimental evaluation of recent time series classification algorithms","authors":"Matthew Middlehurst, Patrick Schäfer, Anthony Bagnall","doi":"10.1007/s10618-024-01022-1","DOIUrl":null,"url":null,"abstract":"<p>In 2017, a research paper (Bagnall et al. Data Mining and Knowledge Discovery 31(3):606-660. 2017) compared 18 Time Series Classification (TSC) algorithms on 85 datasets from the University of California, Riverside (UCR) archive. This study, commonly referred to as a ‘bake off’, identified that only nine algorithms performed significantly better than the Dynamic Time Warping (DTW) and Rotation Forest benchmarks that were used. The study categorised each algorithm by the type of feature they extract from time series data, forming a taxonomy of five main algorithm types. This categorisation of algorithms alongside the provision of code and accessible results for reproducibility has helped fuel an increase in popularity of the TSC field. Over six years have passed since this bake off, the UCR archive has expanded to 112 datasets and there have been a large number of new algorithms proposed. We revisit the bake off, seeing how each of the proposed categories have advanced since the original publication, and evaluate the performance of newer algorithms against the previous best-of-category using an expanded UCR archive. We extend the taxonomy to include three new categories to reflect recent developments. Alongside the originally proposed distance, interval, shapelet, dictionary and hybrid based algorithms, we compare newer convolution and feature based algorithms as well as deep learning approaches. We introduce 30 classification datasets either recently donated to the archive or reformatted to the TSC format, and use these to further evaluate the best performing algorithm from each category. Overall, we find that two recently proposed algorithms, MultiROCKET+Hydra (Dempster et al. 2022) and HIVE-COTEv2 (Middlehurst et al. Mach Learn 110:3211-3243. 2021), perform significantly better than other approaches on both the current and new TSC problems.</p>","PeriodicalId":55183,"journal":{"name":"Data Mining and Knowledge Discovery","volume":"54 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Mining and Knowledge Discovery","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10618-024-01022-1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In 2017, a research paper (Bagnall et al. Data Mining and Knowledge Discovery 31(3):606-660. 2017) compared 18 Time Series Classification (TSC) algorithms on 85 datasets from the University of California, Riverside (UCR) archive. This study, commonly referred to as a ‘bake off’, identified that only nine algorithms performed significantly better than the Dynamic Time Warping (DTW) and Rotation Forest benchmarks that were used. The study categorised each algorithm by the type of feature they extract from time series data, forming a taxonomy of five main algorithm types. This categorisation of algorithms alongside the provision of code and accessible results for reproducibility has helped fuel an increase in popularity of the TSC field. Over six years have passed since this bake off, the UCR archive has expanded to 112 datasets and there have been a large number of new algorithms proposed. We revisit the bake off, seeing how each of the proposed categories have advanced since the original publication, and evaluate the performance of newer algorithms against the previous best-of-category using an expanded UCR archive. We extend the taxonomy to include three new categories to reflect recent developments. Alongside the originally proposed distance, interval, shapelet, dictionary and hybrid based algorithms, we compare newer convolution and feature based algorithms as well as deep learning approaches. We introduce 30 classification datasets either recently donated to the archive or reformatted to the TSC format, and use these to further evaluate the best performing algorithm from each category. Overall, we find that two recently proposed algorithms, MultiROCKET+Hydra (Dempster et al. 2022) and HIVE-COTEv2 (Middlehurst et al. Mach Learn 110:3211-3243. 2021), perform significantly better than other approaches on both the current and new TSC problems.

Abstract Image

查看原文本刊更多论文

烘焙大赛再现：近期时间序列分类算法回顾与实验评估

2017 年，一篇研究论文（Bagnall et al. Data Mining and Knowledge Discovery 31(3):606-660.2017）在加州大学河滨分校（UCR）档案馆的 85 个数据集上比较了 18 种时间序列分类（TSC）算法。这项通常被称为 "烘烤 "的研究发现，只有九种算法的性能明显优于所使用的动态时间扭曲（DTW）和旋转森林基准。该研究按照从时间序列数据中提取特征的类型对每种算法进行了分类，形成了五种主要算法类型的分类法。对算法进行分类，同时提供代码和可访问的结果，以实现可重复性，这有助于提高 TSC 领域的受欢迎程度。自这次竞赛以来，六年多过去了，UCR 档案已扩展到 112 个数据集，并提出了大量新算法。我们重温了这次评选活动，看看自最初发表以来，所提出的每个分类是如何发展的，并利用扩充的 UCR 档案，对照以前的最佳分类，评估新算法的性能。我们扩展了分类法，增加了三个新类别，以反映最新发展。除了最初提出的基于距离、区间、小形、字典和混合的算法外，我们还比较了较新的基于卷积和特征的算法以及深度学习方法。我们引入了 30 个分类数据集，这些数据集要么是最近捐赠给档案馆的，要么是重新格式化为 TSC 格式的，我们将利用这些数据集进一步评估每个类别中性能最佳的算法。总体而言，我们发现最近提出的两种算法--MultiROCKET+Hydra (Dempster et al. 2022) 和 HIVE-COTEv2 (Middlehurst et al. Mach Learn 110:3211-3243. 2021)，在当前和新的 TSC 问题上的表现都明显优于其他方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Data Mining and Knowledge Discovery 工程技术-计算机：人工智能

CiteScore

10.40

自引率

4.20%

发文量

审稿时长

10 months

期刊介绍： Advances in data gathering, storage, and distribution have created a need for computational tools and techniques to aid in data analysis. Data Mining and Knowledge Discovery in Databases (KDD) is a rapidly growing area of research and application that builds on techniques and theories from many fields, including statistics, databases, pattern recognition and learning, data visualization, uncertainty modelling, data warehousing and OLAP, optimization, and high performance computing.