Sungmin Lee , Kichang Lee , JaeYeon Park , JeongGil Ko
{"title":"GMT: Gzip-based Memory-efficient Time-series classification","authors":"Sungmin Lee , Kichang Lee , JaeYeon Park , JeongGil Ko","doi":"10.1016/j.icte.2024.12.003","DOIUrl":null,"url":null,"abstract":"<div><div>The deployment of embedded time-series sensing devices enabled better understanding of user environments and contexts. However, classifying them solely on extremely limited devices under data-scarce conditions is still a remaining challenge. We introduce <em>GMT</em>, a memory-efficient parameter-free classifier that uses <em>gzip</em> compressor and <em>k</em>-nearest neighbors (<em>k</em>NN) for classifying multi-channel time-series data. <em>GMT</em> tackles issues due to high data fidelity, multi-channel characteristics, and numerical properties of sensor data using techniques such as <em>floating point quantization</em>, <em>channel-wise compression</em>, and <em>hybrid distance</em>. Experiments show that <em>GMT</em> provides superior accuracy and memory efficiency compared to other classifiers across various tasks and applications.</div></div>","PeriodicalId":48526,"journal":{"name":"ICT Express","volume":"11 2","pages":"Pages 270-274"},"PeriodicalIF":4.1000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICT Express","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405959524001528","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The deployment of embedded time-series sensing devices enabled better understanding of user environments and contexts. However, classifying them solely on extremely limited devices under data-scarce conditions is still a remaining challenge. We introduce GMT, a memory-efficient parameter-free classifier that uses gzip compressor and k-nearest neighbors (kNN) for classifying multi-channel time-series data. GMT tackles issues due to high data fidelity, multi-channel characteristics, and numerical properties of sensor data using techniques such as floating point quantization, channel-wise compression, and hybrid distance. Experiments show that GMT provides superior accuracy and memory efficiency compared to other classifiers across various tasks and applications.
期刊介绍:
The ICT Express journal published by the Korean Institute of Communications and Information Sciences (KICS) is an international, peer-reviewed research publication covering all aspects of information and communication technology. The journal aims to publish research that helps advance the theoretical and practical understanding of ICT convergence, platform technologies, communication networks, and device technologies. The technology advancement in information and communication technology (ICT) sector enables portable devices to be always connected while supporting high data rate, resulting in the recent popularity of smartphones that have a considerable impact in economic and social development.