Impact of Data Model on Performance of Time Series Database for Internet of Things Applications

S. Rinaldi, Federico Bonafini, P. Ferrari, A. Flammini, E. Sisinni, D. Bianchini
{"title":"Impact of Data Model on Performance of Time Series Database for Internet of Things Applications","authors":"S. Rinaldi, Federico Bonafini, P. Ferrari, A. Flammini, E. Sisinni, D. Bianchini","doi":"10.1109/I2MTC.2019.8827164","DOIUrl":null,"url":null,"abstract":"The Internet of Things (IoT) paradigm is gaining interest in several application fields, from medical devices to smart building and industrial automation. Such a success is due to the flexibility and interoperability between different application domains: the possibility to vertically share data among applications is the winning point of this technology. IoT sensors installed on the field generate a large amount of data, which have to be stored somewhere for subsequent analysis. Database technologies are experiencing a deep transformation to be able to handle these data streams. The recent trend is a transition from relational to non-relational databases. Among the latter, the Time Series Databases (TSDBs) seem to be the solution for storing large amount of time series data generated by IoT applications. Although these solutions are optimized to handle thousands of parallel data streams from IoT sensors, the performance of data extraction could not be compatible with some applications. The target of the paper is to investigate the impact that different metadata could have over the data extraction performance in TSDBs. A dedicated testing procedure has been configured for evaluating InfluxDB, one of the most effective and widespread TSDBs. The performance analysis, carried out on a specific use case, demonstrated that the database write and read performance can be significantly affected by the used data model, with queries executed on the same data requiring times from hundreds of ms to seconds in the worst cases.","PeriodicalId":132588,"journal":{"name":"2019 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2MTC.2019.8827164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

The Internet of Things (IoT) paradigm is gaining interest in several application fields, from medical devices to smart building and industrial automation. Such a success is due to the flexibility and interoperability between different application domains: the possibility to vertically share data among applications is the winning point of this technology. IoT sensors installed on the field generate a large amount of data, which have to be stored somewhere for subsequent analysis. Database technologies are experiencing a deep transformation to be able to handle these data streams. The recent trend is a transition from relational to non-relational databases. Among the latter, the Time Series Databases (TSDBs) seem to be the solution for storing large amount of time series data generated by IoT applications. Although these solutions are optimized to handle thousands of parallel data streams from IoT sensors, the performance of data extraction could not be compatible with some applications. The target of the paper is to investigate the impact that different metadata could have over the data extraction performance in TSDBs. A dedicated testing procedure has been configured for evaluating InfluxDB, one of the most effective and widespread TSDBs. The performance analysis, carried out on a specific use case, demonstrated that the database write and read performance can be significantly affected by the used data model, with queries executed on the same data requiring times from hundreds of ms to seconds in the worst cases.
物联网应用中数据模型对时间序列数据库性能的影响
物联网(IoT)范式正在从医疗设备到智能建筑和工业自动化等多个应用领域引起人们的兴趣。这样的成功是由于不同应用程序域之间的灵活性和互操作性:在应用程序之间垂直共享数据的可能性是该技术的制胜点。安装在现场的物联网传感器会产生大量数据,这些数据必须存储在某个地方以供后续分析。数据库技术正在经历一场深刻的变革,以便能够处理这些数据流。最近的趋势是从关系数据库到非关系数据库的过渡。在后者中,时间序列数据库(tsdb)似乎是存储物联网应用产生的大量时间序列数据的解决方案。尽管这些解决方案经过优化,可以处理来自物联网传感器的数千个并行数据流,但数据提取的性能可能与某些应用程序不兼容。本文的目标是研究不同的元数据对tsdb中数据提取性能的影响。已经配置了一个专门的测试过程来评估InfluxDB,它是最有效和最广泛的tsdb之一。对一个特定用例进行的性能分析表明,数据库的写和读性能会受到所使用的数据模型的显著影响,在最坏的情况下,对相同数据执行查询需要数百毫秒到几秒钟的时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信