离群值对釜山沿海环境监测数据统计度量的影响

Q4 Engineering
Hong-Yeon Cho, Ki-Seop Lee, Soon-Mo Ahn
{"title":"离群值对釜山沿海环境监测数据统计度量的影响","authors":"Hong-Yeon Cho, Ki-Seop Lee, Soon-Mo Ahn","doi":"10.4217/OPR.2016.38.2.149","DOIUrl":null,"url":null,"abstract":"The statistical measures of the coastal environmental data are used in a variety of statistical inferences, hypothesis tests, and data-driven modeling. If the measures are biased, then the statistical estimations and models may also be biased and this potential for bias is great when data contain some outliers defined as extraordinary large or small data values. This study aims to suggest more robust statistical measures as alternatives to more commonly used measures and to assess the performance these robust measures through a quantitative evaluation of more typical measures, such as in terms of locations, spreads, and shapes, with regard to environmental monitoring data in the Busan coastal sea. The detection of outliers within the data was carried out on the basis of Rosner’s test. About 5−10% of the nutrient data were found to contain outliers based on Rosner’s test. After removal (zero-weighting) of the outliers in the data sets, the relative change ratios of the mean and standard deviation between before and after outlier-removal conditions revealed the figures 13 and 33%, respectively. The variation magnitudes of skewness and kurtosis are 1.36 and 8.11 in a decreasing trend, respectively. On the other hand, the change ratios for more robust measures regarding the mean and standard deviation are 3.7−10.5%, and the variation magnitudes of robust skewness and kurtosis are about only 2−4% of the magnitude of the non-robust measures. The robust measures can be regarded as outlier-resistant statistical measures based on the relatively small changes in the scenarios before and after outlier removal conditions.","PeriodicalId":35665,"journal":{"name":"Ocean and Polar Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Impact of Outliers on the Statistical Measures of the Environmental Monitoring Data in Busan Coastal Sea\",\"authors\":\"Hong-Yeon Cho, Ki-Seop Lee, Soon-Mo Ahn\",\"doi\":\"10.4217/OPR.2016.38.2.149\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The statistical measures of the coastal environmental data are used in a variety of statistical inferences, hypothesis tests, and data-driven modeling. If the measures are biased, then the statistical estimations and models may also be biased and this potential for bias is great when data contain some outliers defined as extraordinary large or small data values. This study aims to suggest more robust statistical measures as alternatives to more commonly used measures and to assess the performance these robust measures through a quantitative evaluation of more typical measures, such as in terms of locations, spreads, and shapes, with regard to environmental monitoring data in the Busan coastal sea. The detection of outliers within the data was carried out on the basis of Rosner’s test. About 5−10% of the nutrient data were found to contain outliers based on Rosner’s test. After removal (zero-weighting) of the outliers in the data sets, the relative change ratios of the mean and standard deviation between before and after outlier-removal conditions revealed the figures 13 and 33%, respectively. The variation magnitudes of skewness and kurtosis are 1.36 and 8.11 in a decreasing trend, respectively. On the other hand, the change ratios for more robust measures regarding the mean and standard deviation are 3.7−10.5%, and the variation magnitudes of robust skewness and kurtosis are about only 2−4% of the magnitude of the non-robust measures. The robust measures can be regarded as outlier-resistant statistical measures based on the relatively small changes in the scenarios before and after outlier removal conditions.\",\"PeriodicalId\":35665,\"journal\":{\"name\":\"Ocean and Polar Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean and Polar Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4217/OPR.2016.38.2.149\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean and Polar Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4217/OPR.2016.38.2.149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 4

摘要

沿海环境数据的统计度量用于各种统计推断、假设检验和数据驱动建模。如果测量是有偏差的,那么统计估计和模型也可能是有偏差的,当数据包含一些异常值(定义为异常大或异常小的数据值)时,这种偏差的可能性很大。本研究旨在提出更可靠的统计措施,作为更常用措施的替代方案,并通过对釜山沿海环境监测数据中更典型的措施(如地点、分布和形状)的定量评估,评估这些可靠措施的性能。在Rosner检验的基础上对数据中的异常值进行检测。根据Rosner的测试,大约5 - 10%的营养数据被发现含有异常值。数据集中的离群值去除(零加权)后,离群值去除前后的均值和标准差的相对变化率分别为图13和33%。偏度和峰度的变异量分别为1.36和8.11,呈减小趋势。另一方面,更为稳健的均值和标准差的变化率为3.7 ~ 10.5%,稳健偏度和峰度的变化幅度仅为非稳健措施的2 ~ 4%。鲁棒性措施可以看作是基于异常值去除条件前后情景变化相对较小的抗异常值统计措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Impact of Outliers on the Statistical Measures of the Environmental Monitoring Data in Busan Coastal Sea
The statistical measures of the coastal environmental data are used in a variety of statistical inferences, hypothesis tests, and data-driven modeling. If the measures are biased, then the statistical estimations and models may also be biased and this potential for bias is great when data contain some outliers defined as extraordinary large or small data values. This study aims to suggest more robust statistical measures as alternatives to more commonly used measures and to assess the performance these robust measures through a quantitative evaluation of more typical measures, such as in terms of locations, spreads, and shapes, with regard to environmental monitoring data in the Busan coastal sea. The detection of outliers within the data was carried out on the basis of Rosner’s test. About 5−10% of the nutrient data were found to contain outliers based on Rosner’s test. After removal (zero-weighting) of the outliers in the data sets, the relative change ratios of the mean and standard deviation between before and after outlier-removal conditions revealed the figures 13 and 33%, respectively. The variation magnitudes of skewness and kurtosis are 1.36 and 8.11 in a decreasing trend, respectively. On the other hand, the change ratios for more robust measures regarding the mean and standard deviation are 3.7−10.5%, and the variation magnitudes of robust skewness and kurtosis are about only 2−4% of the magnitude of the non-robust measures. The robust measures can be regarded as outlier-resistant statistical measures based on the relatively small changes in the scenarios before and after outlier removal conditions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ocean and Polar Research
Ocean and Polar Research Engineering-Ocean Engineering
CiteScore
0.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信