Accelerating the learning process of a neural network by predicting the weight coefficient

V. Speranskyy, Mihail O. Domanciuc
{"title":"Accelerating the learning process of a neural network by predicting the weight coefficient","authors":"V. Speranskyy, Mihail O. Domanciuc","doi":"10.15276/hait.04.2021.1","DOIUrl":null,"url":null,"abstract":"The purpose of this study is to analyze and implement the acceleration of the neural network learning process by predicting the weight coefficients. The relevance of accelerating the learning of neural networks is touched upon, as well as the possibility of using prediction models in a wide range of tasks where it is necessary to build fast classifiers. When data is received from the array of sensors of a chemical unit in real time, it is necessary to be able to predict changes and change the operating parameters. After assessment, this should be done as quickly as possible in order to promptly change the current structure and state of the resulting substances.. Work on speeding up classifiers usually focuses on speeding up the applied classifier. The calculation of the predicted values of the weight coefficients is carried out using the calculation of the value using the known prediction models. The possibility of the combined use of prediction models and optimization models was tested to accelerate the learning process of a neural network. The scientific novelty of the study lies in the effectiveness analysis of prediction models use in training neural networks. For the experimental evaluation of the effectiveness of prediction models use, the classification problem was chosen. To solve the experimental problem, the type of neural network “multilayer perceptron” was chosen. The experiment is divided into several stages: initial training of the neural network without a model, and then using prediction models; initial training of a neural network without an optimization method, and then using optimization methods; initial training of the neural network using combinations of prediction models and optimization methods; measuring the relative error of using prediction models, optimization methods and combined use. Models such as “Seasonal Linear Regression”, “Simple Moving Average”, and “Jump” were used in the experiment. The “Jump” model was proposed and developed based on the results of observing the dependence of changes in the values of the weighting coefficient on the epoch. Methods such as “Adagrad”, “Adadelta”, “Adam” were chosen for training neural and subsequent verification of the combined use of prediction models with optimization methods. As a result of the study, the effectiveness of the use of prediction models in predicting the weight coefficients of a neural network has been revealed. The idea is proposed and models are used that can significantly reduce the training time of a neural network. The idea of using prediction models is that the model of the change in the weight coefficient from the epoch is a time series, which in turn tends to a certain value. As a result of the study, it was found that it is possible to combine prediction models and optimization models. Also, prediction models do not interfere with optimization models, since they do not affect the formula of the training itself, as a result of which it is possible to achieve rapid training of the neural network. In the practical part of the work, two known prediction models and the proposed developed model were used. As a result of the experiment, operating conditions were determined using prediction models.","PeriodicalId":375628,"journal":{"name":"Herald of Advanced Information Technology","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Herald of Advanced Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15276/hait.04.2021.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The purpose of this study is to analyze and implement the acceleration of the neural network learning process by predicting the weight coefficients. The relevance of accelerating the learning of neural networks is touched upon, as well as the possibility of using prediction models in a wide range of tasks where it is necessary to build fast classifiers. When data is received from the array of sensors of a chemical unit in real time, it is necessary to be able to predict changes and change the operating parameters. After assessment, this should be done as quickly as possible in order to promptly change the current structure and state of the resulting substances.. Work on speeding up classifiers usually focuses on speeding up the applied classifier. The calculation of the predicted values of the weight coefficients is carried out using the calculation of the value using the known prediction models. The possibility of the combined use of prediction models and optimization models was tested to accelerate the learning process of a neural network. The scientific novelty of the study lies in the effectiveness analysis of prediction models use in training neural networks. For the experimental evaluation of the effectiveness of prediction models use, the classification problem was chosen. To solve the experimental problem, the type of neural network “multilayer perceptron” was chosen. The experiment is divided into several stages: initial training of the neural network without a model, and then using prediction models; initial training of a neural network without an optimization method, and then using optimization methods; initial training of the neural network using combinations of prediction models and optimization methods; measuring the relative error of using prediction models, optimization methods and combined use. Models such as “Seasonal Linear Regression”, “Simple Moving Average”, and “Jump” were used in the experiment. The “Jump” model was proposed and developed based on the results of observing the dependence of changes in the values of the weighting coefficient on the epoch. Methods such as “Adagrad”, “Adadelta”, “Adam” were chosen for training neural and subsequent verification of the combined use of prediction models with optimization methods. As a result of the study, the effectiveness of the use of prediction models in predicting the weight coefficients of a neural network has been revealed. The idea is proposed and models are used that can significantly reduce the training time of a neural network. The idea of using prediction models is that the model of the change in the weight coefficient from the epoch is a time series, which in turn tends to a certain value. As a result of the study, it was found that it is possible to combine prediction models and optimization models. Also, prediction models do not interfere with optimization models, since they do not affect the formula of the training itself, as a result of which it is possible to achieve rapid training of the neural network. In the practical part of the work, two known prediction models and the proposed developed model were used. As a result of the experiment, operating conditions were determined using prediction models.
通过预测权系数来加速神经网络的学习过程
本研究的目的是通过预测权系数来分析和实现神经网络学习过程的加速。本文讨论了加速神经网络学习的相关性,以及在需要构建快速分类器的广泛任务中使用预测模型的可能性。当从化学装置的传感器阵列实时接收数据时,必须能够预测变化并更改操作参数。评估后,应尽快完成,以便及时改变所得物质的当前结构和状态。加速分类器的工作通常集中在加速应用分类器上。利用已知的预测模型计算权重系数的预测值,进行权重系数预测值的计算。验证了预测模型和优化模型结合使用的可能性,以加速神经网络的学习过程。本研究的科学新颖之处在于对用于训练神经网络的预测模型进行有效性分析。为了对预测模型使用的有效性进行实验评价,选择了分类问题。为了解决实验问题,选择了“多层感知机”类型的神经网络。实验分为几个阶段:在没有模型的情况下对神经网络进行初始训练,然后使用预测模型;对一个没有优化方法的神经网络进行初始训练,然后使用优化方法;结合预测模型和优化方法对神经网络进行初始训练;采用预测模型、优化方法和组合使用测量相对误差。实验中使用了“季节性线性回归”、“简单移动平均”和“跳跃”等模型。“跳跃”模型是在观测加权系数值变化随时代变化的基础上提出和发展起来的。选择“Adagrad”、“Adadelta”、“Adam”等方法进行神经训练,并对预测模型与优化方法的结合使用进行后续验证。研究结果揭示了利用预测模型预测神经网络权系数的有效性。提出了这种思想,并使用了能够显著减少神经网络训练时间的模型。使用预测模型的思想是,权重系数从历元开始变化的模型是一个时间序列,而时间序列又趋向于某一值。研究结果表明,预测模型与优化模型相结合是可行的。此外,预测模型不会干扰优化模型,因为它们不会影响训练本身的公式,因此可以实现神经网络的快速训练。在工作的实际部分,使用了两个已知的预测模型和提出的模型。根据实验结果,利用预测模型确定了操作条件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信