按时间顺序计算平均值——一种不应被忽视的统计数据处理方法

Аналитика и контроль Pub Date : 2023-01-01 DOI:10.15826/analitika.2023.27.1.005

Igor G. Zenkevich

{"title":"按时间顺序计算平均值——一种不应被忽视的统计数据处理方法","authors":"Igor G. Zenkevich","doi":"10.15826/analitika.2023.27.1.005","DOIUrl":null,"url":null,"abstract":"In real analytical practice involving parallel determinations there are certain objective reasons (limited measurement time, available resources, etc.) that prevent performing sufficient number of measurements required for rigorous statistical data processing. Such parameters as standard deviations are particularly unreliable for small data sets (they are usually too high in comparison with the results obtained for more representative data sets). This problem can be minimized by changing the character of data processing, i.e. calculating so-called chronological averages instead of average arithmetical values. This, however, requires ranking the data not in the order as they were measured, but in the ascending order. The essence of calculating chronological averages is that the minimum and the maximum values of the initial data set are replaced with a single number, namely their arithmetical average, which is then averaged with other data. As a result, the total number of data values decreases by one but the changes of the averages would be negligible. At the same time, the standard deviations of the modified data sets become more characteristic and approaching the standard deviations of extended data sets. Replacing the arithmetical means by chronological average values leads to sufficiently smaller distortions of the initial data sets than, for example, in the case of excluding outliers by using the well-known “3s” criterion.","PeriodicalId":24041,"journal":{"name":"Аналитика и контроль","volume":"55 17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Calculation of average chronological values – an undeservedly neglected method of statistical data processing\",\"authors\":\"Igor G. Zenkevich\",\"doi\":\"10.15826/analitika.2023.27.1.005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real analytical practice involving parallel determinations there are certain objective reasons (limited measurement time, available resources, etc.) that prevent performing sufficient number of measurements required for rigorous statistical data processing. Such parameters as standard deviations are particularly unreliable for small data sets (they are usually too high in comparison with the results obtained for more representative data sets). This problem can be minimized by changing the character of data processing, i.e. calculating so-called chronological averages instead of average arithmetical values. This, however, requires ranking the data not in the order as they were measured, but in the ascending order. The essence of calculating chronological averages is that the minimum and the maximum values of the initial data set are replaced with a single number, namely their arithmetical average, which is then averaged with other data. As a result, the total number of data values decreases by one but the changes of the averages would be negligible. At the same time, the standard deviations of the modified data sets become more characteristic and approaching the standard deviations of extended data sets. Replacing the arithmetical means by chronological average values leads to sufficiently smaller distortions of the initial data sets than, for example, in the case of excluding outliers by using the well-known “3s” criterion.\",\"PeriodicalId\":24041,\"journal\":{\"name\":\"Аналитика и контроль\",\"volume\":\"55 17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Аналитика и контроль\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15826/analitika.2023.27.1.005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Аналитика и контроль","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15826/analitika.2023.27.1.005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在涉及平行测定的实际分析实践中，存在某些客观原因(有限的测量时间、可用资源等)，妨碍执行严格统计数据处理所需的足够数量的测量。标准偏差等参数对于小数据集尤其不可靠(与更具代表性的数据集获得的结果相比，它们通常过高)。这个问题可以通过改变数据处理的特征来最小化，即计算所谓的时间平均值而不是平均的算术值。然而，这需要将数据按升序排列，而不是按测量的顺序排列。计算时间平均的本质是将初始数据集的最小值和最大值替换为一个数字，即它们的算术平均值，然后将其与其他数据平均。因此，数据值的总数减少了一个，但平均值的变化可以忽略不计。同时，修正后的数据集的标准差变得更有特征，接近扩展数据集的标准差。用按时间顺序排列的平均值代替算术平均值，对初始数据集造成的扭曲比使用众所周知的“3s”标准排除异常值的情况要小得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Calculation of average chronological values – an undeservedly neglected method of statistical data processing

In real analytical practice involving parallel determinations there are certain objective reasons (limited measurement time, available resources, etc.) that prevent performing sufficient number of measurements required for rigorous statistical data processing. Such parameters as standard deviations are particularly unreliable for small data sets (they are usually too high in comparison with the results obtained for more representative data sets). This problem can be minimized by changing the character of data processing, i.e. calculating so-called chronological averages instead of average arithmetical values. This, however, requires ranking the data not in the order as they were measured, but in the ascending order. The essence of calculating chronological averages is that the minimum and the maximum values of the initial data set are replaced with a single number, namely their arithmetical average, which is then averaged with other data. As a result, the total number of data values decreases by one but the changes of the averages would be negligible. At the same time, the standard deviations of the modified data sets become more characteristic and approaching the standard deviations of extended data sets. Replacing the arithmetical means by chronological average values leads to sufficiently smaller distortions of the initial data sets than, for example, in the case of excluding outliers by using the well-known “3s” criterion.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Аналитика и контроль

自引率

0.00%

发文量