{"title":"异常值的无分布检验","authors":"B. Candelon, N. Metiu","doi":"10.2139/ssrn.2796894","DOIUrl":null,"url":null,"abstract":"Determining whether a data set contains one or more outliers is a challenge commonly faced in applied statistics. This paper introduces a distribution-free test for multiple outliers in data drawn from an unknown data generating process. Besides, a sequential algorithm is proposed in order to identify the outlying observations in the sample. Our methodology relies on a two-stage nonparametric bootstrap procedure. Monte Carlo experiments show that the proposed test has good asymptotic properties, even for relatively small samples and heavy tailed distributions. The new outlier detection test could be instrumental in a wide range of statistical applications. The empirical performance of the test is illustrated by means of two examples in the fields of aeronautics and macroeconomics.","PeriodicalId":11744,"journal":{"name":"ERN: Nonparametric Methods (Topic)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A Distribution-Free Test for Outliers\",\"authors\":\"B. Candelon, N. Metiu\",\"doi\":\"10.2139/ssrn.2796894\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Determining whether a data set contains one or more outliers is a challenge commonly faced in applied statistics. This paper introduces a distribution-free test for multiple outliers in data drawn from an unknown data generating process. Besides, a sequential algorithm is proposed in order to identify the outlying observations in the sample. Our methodology relies on a two-stage nonparametric bootstrap procedure. Monte Carlo experiments show that the proposed test has good asymptotic properties, even for relatively small samples and heavy tailed distributions. The new outlier detection test could be instrumental in a wide range of statistical applications. The empirical performance of the test is illustrated by means of two examples in the fields of aeronautics and macroeconomics.\",\"PeriodicalId\":11744,\"journal\":{\"name\":\"ERN: Nonparametric Methods (Topic)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Nonparametric Methods (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.2796894\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Nonparametric Methods (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2796894","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Determining whether a data set contains one or more outliers is a challenge commonly faced in applied statistics. This paper introduces a distribution-free test for multiple outliers in data drawn from an unknown data generating process. Besides, a sequential algorithm is proposed in order to identify the outlying observations in the sample. Our methodology relies on a two-stage nonparametric bootstrap procedure. Monte Carlo experiments show that the proposed test has good asymptotic properties, even for relatively small samples and heavy tailed distributions. The new outlier detection test could be instrumental in a wide range of statistical applications. The empirical performance of the test is illustrated by means of two examples in the fields of aeronautics and macroeconomics.