基于递归神经网络的自编码器解决电力设施时间序列自动分析问题

IF 0.3 Q4 ENERGY & FUELS

Problemele Energeticii Regionale Pub Date : 2023-05-01 DOI:10.52254/1857-0070.2023.2-58-06

P. Matrenin, A. Khalyasmaa, Y. V. Potachits

{"title":"基于递归神经网络的自编码器解决电力设施时间序列自动分析问题","authors":"P. Matrenin, A. Khalyasmaa, Y. V. Potachits","doi":"10.52254/1857-0070.2023.2-58-06","DOIUrl":null,"url":null,"abstract":"Digitalization of the energy sector leads to an increase in the volume and rate of data collection. A primary barrier to the proper management of the technological data is the lack of data labeling corresponding to emergency modes, power equipment technical state, etc. Thus, despite the large amount of data, there is a shortage of labeled data suitable for training, validating and testing the machine learning models. Labeling by an expert takes too much time, so there is an actual task to automatically identify data fragments that are potentially of interest. The aim of the work is to develop an algorithm for prioritizing the fragments of the time series using the compact recurrent autoencoder. To achieve the goal, a neural network architecture was developed based on recurrent encoding and decoding cells, capable of unsupervised learning. The model was tested on two data sets: a synthetic sinusoidal signal with missing values and electric current measurements with thermal limit deviations. The substantial results of the work are the compact architecture of the autocoding model and the high interpretability of the output. The most significant achievements of the study are both the autocoding neural network model, which does not require initial assumption about the type of deviations, and the proposed algorithm for prioritizing the data fragments. The significance of the results is prooved by the reduction of the time for analyzing and labeling large data arrays with technological parameters of the electrical networks, which allows using these data for training, validating and testing.","PeriodicalId":41974,"journal":{"name":"Problemele Energeticii Regionale","volume":" ","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Recurrent Neural Network-Based Autoencoder for Problems of Automatic Time Series Analysis at Power Facilities\",\"authors\":\"P. Matrenin, A. Khalyasmaa, Y. V. Potachits\",\"doi\":\"10.52254/1857-0070.2023.2-58-06\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digitalization of the energy sector leads to an increase in the volume and rate of data collection. A primary barrier to the proper management of the technological data is the lack of data labeling corresponding to emergency modes, power equipment technical state, etc. Thus, despite the large amount of data, there is a shortage of labeled data suitable for training, validating and testing the machine learning models. Labeling by an expert takes too much time, so there is an actual task to automatically identify data fragments that are potentially of interest. The aim of the work is to develop an algorithm for prioritizing the fragments of the time series using the compact recurrent autoencoder. To achieve the goal, a neural network architecture was developed based on recurrent encoding and decoding cells, capable of unsupervised learning. The model was tested on two data sets: a synthetic sinusoidal signal with missing values and electric current measurements with thermal limit deviations. The substantial results of the work are the compact architecture of the autocoding model and the high interpretability of the output. The most significant achievements of the study are both the autocoding neural network model, which does not require initial assumption about the type of deviations, and the proposed algorithm for prioritizing the data fragments. The significance of the results is prooved by the reduction of the time for analyzing and labeling large data arrays with technological parameters of the electrical networks, which allows using these data for training, validating and testing.\",\"PeriodicalId\":41974,\"journal\":{\"name\":\"Problemele Energeticii Regionale\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Problemele Energeticii Regionale\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52254/1857-0070.2023.2-58-06\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Problemele Energeticii Regionale","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52254/1857-0070.2023.2-58-06","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

能源部门的数字化导致数据收集量和速度的增加。正确管理技术数据的主要障碍是缺乏与应急模式、电力设备技术状态等相对应的数据标签。因此，尽管数据量很大，但缺乏适合训练、验证和测试机器学习模型的标签数据。由专家进行标记需要花费太多时间，因此实际任务是自动识别可能感兴趣的数据片段。这项工作的目的是开发一种使用紧凑递归自动编码器对时间序列片段进行优先级排序的算法。为了实现这一目标，开发了一种基于递归编码和解码单元的神经网络架构，能够进行无监督学习。该模型在两个数据集上进行了测试：具有缺失值的合成正弦信号和具有热极限偏差的电流测量。这项工作的实质性结果是自动编码模型的紧凑架构和输出的高可解释性。该研究最重要的成就是不需要对偏差类型进行初始假设的自动编码神经网络模型，以及所提出的对数据片段进行优先级排序的算法。减少了用电网技术参数分析和标记大型数据阵列的时间，从而可以将这些数据用于训练、验证和测试，从而证明了结果的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Recurrent Neural Network-Based Autoencoder for Problems of Automatic Time Series Analysis at Power Facilities

Digitalization of the energy sector leads to an increase in the volume and rate of data collection. A primary barrier to the proper management of the technological data is the lack of data labeling corresponding to emergency modes, power equipment technical state, etc. Thus, despite the large amount of data, there is a shortage of labeled data suitable for training, validating and testing the machine learning models. Labeling by an expert takes too much time, so there is an actual task to automatically identify data fragments that are potentially of interest. The aim of the work is to develop an algorithm for prioritizing the fragments of the time series using the compact recurrent autoencoder. To achieve the goal, a neural network architecture was developed based on recurrent encoding and decoding cells, capable of unsupervised learning. The model was tested on two data sets: a synthetic sinusoidal signal with missing values and electric current measurements with thermal limit deviations. The substantial results of the work are the compact architecture of the autocoding model and the high interpretability of the output. The most significant achievements of the study are both the autocoding neural network model, which does not require initial assumption about the type of deviations, and the proposed algorithm for prioritizing the data fragments. The significance of the results is prooved by the reduction of the time for analyzing and labeling large data arrays with technological parameters of the electrical networks, which allows using these data for training, validating and testing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Problemele Energeticii Regionale ENERGY & FUELS-

CiteScore

0.70

自引率

33.30%

发文量