SynTiSeD – Synthetic Time Series Data Generator

Michael Meiser, Benjamin Duppe, I. Zinnikus
{"title":"SynTiSeD – Synthetic Time Series Data Generator","authors":"Michael Meiser, Benjamin Duppe, I. Zinnikus","doi":"10.1109/MSCPES58582.2023.10123429","DOIUrl":null,"url":null,"abstract":"Recently, an increasing number of Artificial Intelligence services have been developed for a variety of domains. Machine Learning and especially Deep Learning services require a large amount of data to provide their functionality. Since data collection is typically complex and difficult, there is often not enough data available. Machine learning services such as anomaly detection or disaggregation algorithms are also being developed in the smart living domain. In practice, however, only a few energy datasets are publicly available, as the collection of such data is expensive and time-consuming due to the equipment required. One way to generate more smart meter data is to use a simulation. Developing such a simulation that is capable of generating meaningful data is a complex task. Therefore, in this paper, we present the Synthetic Time Series Data Generator (SynTiSeD), a multi-agent-based simulation tool that generates meaningful synthetic energy data based on real-world data. Furthermore, SynTiSeD allows generating data of critical situations, which are important for the development of such services, but which cannot be provoked in the real world. For transferability, we demonstrate that Nonintrusive Load Monitoring algorithms trained on synthetic data generated by SynTiSeD provide meaningful results that are even better than those of models trained on real data.","PeriodicalId":162383,"journal":{"name":"2023 11th Workshop on Modelling and Simulation of Cyber-Physical Energy Systems (MSCPES)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 11th Workshop on Modelling and Simulation of Cyber-Physical Energy Systems (MSCPES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSCPES58582.2023.10123429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Recently, an increasing number of Artificial Intelligence services have been developed for a variety of domains. Machine Learning and especially Deep Learning services require a large amount of data to provide their functionality. Since data collection is typically complex and difficult, there is often not enough data available. Machine learning services such as anomaly detection or disaggregation algorithms are also being developed in the smart living domain. In practice, however, only a few energy datasets are publicly available, as the collection of such data is expensive and time-consuming due to the equipment required. One way to generate more smart meter data is to use a simulation. Developing such a simulation that is capable of generating meaningful data is a complex task. Therefore, in this paper, we present the Synthetic Time Series Data Generator (SynTiSeD), a multi-agent-based simulation tool that generates meaningful synthetic energy data based on real-world data. Furthermore, SynTiSeD allows generating data of critical situations, which are important for the development of such services, but which cannot be provoked in the real world. For transferability, we demonstrate that Nonintrusive Load Monitoring algorithms trained on synthetic data generated by SynTiSeD provide meaningful results that are even better than those of models trained on real data.
SynTiSeD -合成时间序列数据生成器
近年来,越来越多的人工智能服务被开发用于各个领域。机器学习,尤其是深度学习服务需要大量的数据来提供它们的功能。由于数据收集通常是复杂和困难的,因此通常没有足够的数据可用。异常检测或分解算法等机器学习服务也正在智能生活领域得到开发。然而,在实践中,只有少数能源数据集是公开可用的,因为这些数据的收集是昂贵的和耗时的,由于所需的设备。生成更多智能电表数据的一种方法是使用模拟。开发这样一个能够生成有意义数据的模拟是一项复杂的任务。因此,在本文中,我们提出了合成时间序列数据生成器(SynTiSeD),这是一种基于多代理的仿真工具,可以根据现实世界的数据生成有意义的合成能源数据。此外,SynTiSeD允许生成关键情况的数据,这对此类服务的开发很重要,但在现实世界中无法激发。对于可移植性,我们证明了在SynTiSeD生成的合成数据上训练的非侵入式负载监控算法提供了有意义的结果,甚至比在真实数据上训练的模型更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信