Building robust neural networks using different loss functions

Analysis and data processing systems Pub Date : 2021-06-18 DOI:10.17212/2782-2001-2021-2-67-82

M. Sivak, V. Timofeev

{"title":"Building robust neural networks using different loss functions","authors":"M. Sivak, V. Timofeev","doi":"10.17212/2782-2001-2021-2-67-82","DOIUrl":null,"url":null,"abstract":"The paper considers the problem of building robust neural networks using different robust loss functions. Applying such neural networks is reasonably when working with noisy data, and it can serve as an alternative to data preprocessing and to making neural network architecture more complex. In order to work adequately, the error back-propagation algorithm requires a loss function to be continuously or two-times differentiable. According to this requirement, two five robust loss functions were chosen (Andrews, Welsch, Huber, Ramsey and Fair). Using the above-mentioned functions in the error back-propagation algorithm instead of the quadratic one allows obtaining an entirely new class of neural networks. For investigating the properties of the built networks a number of computational experiments were carried out. Different values of outliers’ fraction and various numbers of epochs were considered. The first step included adjusting the obtained neural networks, which lead to choosing such values of internal loss function parameters that resulted in achieving the highest accuracy of a neural network. To determine the ranges of parameter values, a preliminary study was pursued. The results of the first stage allowed giving recommendations on choosing the best parameter values for each of the loss functions under study. The second stage dealt with comparing the investigated robust networks with each other and with the classical one. The analysis of the results shows that using the robust technique leads to a significant increase in neural network accuracy and in a learning rate.","PeriodicalId":292298,"journal":{"name":"Analysis and data processing systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analysis and data processing systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17212/2782-2001-2021-2-67-82","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The paper considers the problem of building robust neural networks using different robust loss functions. Applying such neural networks is reasonably when working with noisy data, and it can serve as an alternative to data preprocessing and to making neural network architecture more complex. In order to work adequately, the error back-propagation algorithm requires a loss function to be continuously or two-times differentiable. According to this requirement, two five robust loss functions were chosen (Andrews, Welsch, Huber, Ramsey and Fair). Using the above-mentioned functions in the error back-propagation algorithm instead of the quadratic one allows obtaining an entirely new class of neural networks. For investigating the properties of the built networks a number of computational experiments were carried out. Different values of outliers’ fraction and various numbers of epochs were considered. The first step included adjusting the obtained neural networks, which lead to choosing such values of internal loss function parameters that resulted in achieving the highest accuracy of a neural network. To determine the ranges of parameter values, a preliminary study was pursued. The results of the first stage allowed giving recommendations on choosing the best parameter values for each of the loss functions under study. The second stage dealt with comparing the investigated robust networks with each other and with the classical one. The analysis of the results shows that using the robust technique leads to a significant increase in neural network accuracy and in a learning rate.

查看原文本刊更多论文

利用不同的损失函数构建鲁棒神经网络

研究了利用不同的鲁棒损失函数构建鲁棒神经网络的问题。在处理有噪声的数据时，应用这种神经网络是合理的，它可以作为数据预处理的一种替代方法，使神经网络结构变得更加复杂。为了充分地工作，误差反向传播算法要求损失函数是连续的或两次可微的。根据这一要求，我们选择了两个五个鲁棒损失函数(Andrews, Welsch, Huber, Ramsey和Fair)。在误差反向传播算法中使用上述函数代替二次型算法，可以得到一类全新的神经网络。为了研究所建网络的性质，进行了大量的计算实验。考虑了异常值分数的不同值和不同的历元数。第一步包括调整得到的神经网络，从而选择内部损失函数参数的值，从而使神经网络的精度达到最高。为了确定参数值的范围，进行了初步研究。第一阶段的结果允许对所研究的每个损失函数选择最佳参数值提出建议。第二阶段是将所研究的鲁棒网络相互比较，并与经典鲁棒网络进行比较。结果分析表明，采用鲁棒技术可以显著提高神经网络的准确率和学习率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Analysis and data processing systems

自引率

0.00%

发文量