Robust Learning Algorithm with LTS Error Function

Encyclopedia of Artificial Intelligence Pub Date : 1900-01-01 DOI:10.4018/978-1-59904-849-9.CH204

A. Rusiecki

{"title":"Robust Learning Algorithm with LTS Error Function","authors":"A. Rusiecki","doi":"10.4018/978-1-59904-849-9.CH204","DOIUrl":null,"url":null,"abstract":"Feedforward neural networks (FFNs) are often considered as universal tools and find their applications in areas such as function approximation, pattern recognition, or signal and image processing. One of the main advantages of using FFNs is that they usually do not require, in the learning process, exact mathematical knowledge about input-output dependencies. In other words, they may be regarded as model-free approximators (Hornik, 1989). They learn by minimizing some kind of an error function to fit training data as close as possible. Such learning scheme doesn’t take into account a quality of the training data, so its performance depends strongly on the fact whether the assumption, that the data are reliable and trustable, is hold. This is why when the data are corrupted by the large noise, or when outliers and gross errors appear, the network builds a model that can be very inaccurate. In most real-world cases the assumption that errors are normal and iid, simply doesn’t hold. The data obtained from the environment are very often affected by noise of unknown form or outliers, suspected to be gross errors. The quantity of outliers in routine data ranges from 1 to 10% (Hampel, 1986). They usually appear in data sets during obtaining the information and pre-processing them when, for instance, measurement errors, long-tailed noise, or results of human mistakes may occur. Intuitively we can define an outlier as an observation that significantly deviates from the bulk of data. Nevertheless, this definition doesn’t help in classifying an outlier as a gross error or a meaningful and important observation. To deal with the problem of outliers a separate branch of statistics, called robust statistics (Hampel, 1986, Huber, 1981), was developed. Robust statistical methods are designed to act well when the true underlying model deviates from the assumed parametric model. Ideally, they should be efficient and reliable for the observations that are very close to the assumed model and simultaneously for the observations containing larger deviations and outliers. The other way is to detect and remove outliers before the beginning of the model building process. Such methods are more universal but they do not take into account the specific type of modeling philosophy (e.g. modeling by the FFNs). In this article we propose new robust FFNs learning algorithm based on the least trimmed squares estimator.","PeriodicalId":320314,"journal":{"name":"Encyclopedia of Artificial Intelligence","volume":"48 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Encyclopedia of Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/978-1-59904-849-9.CH204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Feedforward neural networks (FFNs) are often considered as universal tools and find their applications in areas such as function approximation, pattern recognition, or signal and image processing. One of the main advantages of using FFNs is that they usually do not require, in the learning process, exact mathematical knowledge about input-output dependencies. In other words, they may be regarded as model-free approximators (Hornik, 1989). They learn by minimizing some kind of an error function to fit training data as close as possible. Such learning scheme doesn’t take into account a quality of the training data, so its performance depends strongly on the fact whether the assumption, that the data are reliable and trustable, is hold. This is why when the data are corrupted by the large noise, or when outliers and gross errors appear, the network builds a model that can be very inaccurate. In most real-world cases the assumption that errors are normal and iid, simply doesn’t hold. The data obtained from the environment are very often affected by noise of unknown form or outliers, suspected to be gross errors. The quantity of outliers in routine data ranges from 1 to 10% (Hampel, 1986). They usually appear in data sets during obtaining the information and pre-processing them when, for instance, measurement errors, long-tailed noise, or results of human mistakes may occur. Intuitively we can define an outlier as an observation that significantly deviates from the bulk of data. Nevertheless, this definition doesn’t help in classifying an outlier as a gross error or a meaningful and important observation. To deal with the problem of outliers a separate branch of statistics, called robust statistics (Hampel, 1986, Huber, 1981), was developed. Robust statistical methods are designed to act well when the true underlying model deviates from the assumed parametric model. Ideally, they should be efficient and reliable for the observations that are very close to the assumed model and simultaneously for the observations containing larger deviations and outliers. The other way is to detect and remove outliers before the beginning of the model building process. Such methods are more universal but they do not take into account the specific type of modeling philosophy (e.g. modeling by the FFNs). In this article we propose new robust FFNs learning algorithm based on the least trimmed squares estimator.

查看原文本刊更多论文

具有LTS误差函数的鲁棒学习算法

前馈神经网络(ffn)通常被认为是通用工具，并在函数逼近、模式识别或信号和图像处理等领域找到了它们的应用。使用ffn的一个主要优点是，在学习过程中，它们通常不需要关于输入-输出依赖关系的精确数学知识。换句话说，它们可以被视为无模型逼近器(Hornik, 1989)。它们通过最小化某种误差函数来学习，以尽可能接近训练数据。这种学习方案不考虑训练数据的质量，因此其性能在很大程度上取决于数据可靠可信的假设是否成立。这就是为什么当数据被大噪声破坏时，或者当异常值和严重误差出现时，网络建立的模型可能非常不准确。在大多数现实世界的情况下，错误是正常和不正常的假设根本站不住脚。从环境中获得的数据经常受到未知形式的噪声或异常值的影响，被怀疑是严重误差。常规数据中异常值的数量在1%到10%之间(Hampel, 1986)。它们通常出现在获取信息和对数据进行预处理的过程中，例如可能出现测量误差、长尾噪声或人为错误的结果。直观地，我们可以将异常值定义为显著偏离大量数据的观测值。然而，这个定义无助于将异常值分类为严重错误或有意义且重要的观察结果。为了处理异常值的问题，统计学的一个独立分支被称为稳健统计(Hampel, 1986, Huber, 1981)。当真实的基础模型偏离假设的参数模型时，稳健的统计方法可以很好地发挥作用。理想情况下，对于与假设模型非常接近的观测值，同时对于包含较大偏差和异常值的观测值，它们应该是有效和可靠的。另一种方法是在模型构建过程开始之前检测并移除异常值。这些方法更为普遍，但它们没有考虑到特定类型的建模哲学(例如由ffn建模)。在本文中，我们提出了一种新的基于最小裁剪二乘估计的鲁棒ffn学习算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Encyclopedia of Artificial Intelligence

自引率

0.00%

发文量