Robust isochron calculation

IF 2.5 Q2 GEOCHEMISTRY & GEOPHYSICS

Geochronology Pub Date : 2020-02-14 DOI:10.5194/gchron-2020-4

R. Powell, E. Green, Estephany Marillo Sialer, J. Woodhead

{"title":"Robust isochron calculation","authors":"R. Powell, E. Green, Estephany Marillo Sialer, J. Woodhead","doi":"10.5194/gchron-2020-4","DOIUrl":null,"url":null,"abstract":"Abstract. The standard classical statistics approach to isochron calculation assumes that the distribution of uncertainties on the data arising from isotopic analysis is strictly Gaussian. This effectively excludes datasets that have more scatter from consideration, even though many appear to have age significance. A new approach to isochron calculations is developed in order to circumvent this problem, requiring only that the central part of the uncertainty distribution of the data defines a “spine” in the trend of the data. This central spine can be Gaussian but this is not a requirement. This approach significantly increases the range of datasets from which age information can be extracted but also provides seamless integration with well-behaved datasets and thus all legacy age determinations. The approach is built on the robust statistics of Huber (1981) but using the data uncertainties for the scale of data scatter around the spine rather than a scale derived from the scatter itself, ignoring the data uncertainties. This robust data fitting reliably determines the position of the spine when applied to data with outliers but converges on the classical statistics approach for datasets without outliers. The spine width is determined by a robust measure, the normalised median absolute deviation of the distances of the data points to the centre of the spine, divided by the uncertainties on the distances. A test is provided to ascertain that there is a spine in the data, requiring that the spine width is consistent with the uncertainties expected for Gaussian-distributed data. An iteratively reweighted least squares algorithm is presented to calculate the position of the robust line and its uncertainty, accompanied by an implementation in Python.\n","PeriodicalId":12723,"journal":{"name":"Geochronology","volume":"34 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geochronology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/gchron-2020-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}

引用次数: 19

Abstract

Abstract. The standard classical statistics approach to isochron calculation assumes that the distribution of uncertainties on the data arising from isotopic analysis is strictly Gaussian. This effectively excludes datasets that have more scatter from consideration, even though many appear to have age significance. A new approach to isochron calculations is developed in order to circumvent this problem, requiring only that the central part of the uncertainty distribution of the data defines a “spine” in the trend of the data. This central spine can be Gaussian but this is not a requirement. This approach significantly increases the range of datasets from which age information can be extracted but also provides seamless integration with well-behaved datasets and thus all legacy age determinations. The approach is built on the robust statistics of Huber (1981) but using the data uncertainties for the scale of data scatter around the spine rather than a scale derived from the scatter itself, ignoring the data uncertainties. This robust data fitting reliably determines the position of the spine when applied to data with outliers but converges on the classical statistics approach for datasets without outliers. The spine width is determined by a robust measure, the normalised median absolute deviation of the distances of the data points to the centre of the spine, divided by the uncertainties on the distances. A test is provided to ascertain that there is a spine in the data, requiring that the spine width is consistent with the uncertainties expected for Gaussian-distributed data. An iteratively reweighted least squares algorithm is presented to calculate the position of the robust line and its uncertainty, accompanied by an implementation in Python.

查看原文本刊更多论文

稳健等时计算

摘要等时线计算的标准经典统计方法假定同位素分析数据上的不确定性分布是严格的高斯分布。这有效地排除了分散度更高的数据集，尽管许多数据集似乎具有年龄意义。为了避免这个问题，一种新的等时线计算方法被开发出来，只需要数据不确定性分布的中心部分在数据的趋势中定义一个“脊柱”。这个中央脊柱可以是高斯的，但这不是必需的。这种方法大大增加了可以从中提取年龄信息的数据集的范围，而且还提供了与行为良好的数据集的无缝集成，从而提供了所有遗留年龄确定。该方法建立在Huber(1981)的稳健统计基础上，但使用数据不确定性作为脊柱周围数据分散的尺度，而不是从分散本身得出的尺度，忽略了数据不确定性。当应用于有异常值的数据时，这种稳健的数据拟合可靠地确定了脊柱的位置，但对于没有异常值的数据集，这种方法收敛于经典统计方法。脊柱宽度由一种稳健的测量方法决定，即数据点到脊柱中心距离的归一化中位数绝对偏差，除以距离的不确定性。提供了一个测试来确定数据中是否存在脊椎，要求脊椎宽度与高斯分布数据的不确定性相一致。提出了一种迭代加权最小二乘算法来计算鲁棒线的位置及其不确定性，并给出了Python实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊