建立近邻图模型以估算数据独立性的概率

Q3 Mathematics
A. A. Kislitsyn
{"title":"建立近邻图模型以估算数据独立性的概率","authors":"A. A. Kislitsyn","doi":"10.1134/s2070048223070086","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>The proposed method is based on calculations of the statistics of the nearest neighbor graph (NNG) structures, which are presented as a benchmark of the probabilities of the distribution of graphs by the number of disconnected fragments. The deviation of the actually observed occurrence of connectivity from the calculated one will allow us to determine the probability that this sample can be considered a set of statistically independent variables. The statements about the independence of the NNG statistics from the distribution of distances and from the triangle inequality are proved, which allows the numerical modeling of such structures. Estimates of the accuracy of the calculated statistics for graphs and their comparison with estimates obtained by modeling random coordinates of points in <i>d</i>-dimensional space are carried out. It is shown that the model of the NNGs without taking into account the dimension of the space leads to fairly accurate estimates of the statistics of graph structures in spaces of dimensionality higher than five. For spaces of smaller dimensionality, the benchmark can be obtained by directly calculating the distances between points with random coordinates in a unit cube. The proposed method is applied to the problem of analyzing the level of unsteadiness of the earthquake catalog in the Kuril–Kamchatka region. The lengths of samples of time intervals between neighboring events are analyzed. It is shown that the analyzed system as a whole is interconnected with a probability of 0.91, and this dependence is fundamentally different from the lag correlation between the sample elements.</p>","PeriodicalId":38050,"journal":{"name":"Mathematical Models and Computer Simulations","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modeling the Nearest Neighbor Graphs to Estimate the Probability of the Independence of Data\",\"authors\":\"A. A. Kislitsyn\",\"doi\":\"10.1134/s2070048223070086\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>The proposed method is based on calculations of the statistics of the nearest neighbor graph (NNG) structures, which are presented as a benchmark of the probabilities of the distribution of graphs by the number of disconnected fragments. The deviation of the actually observed occurrence of connectivity from the calculated one will allow us to determine the probability that this sample can be considered a set of statistically independent variables. The statements about the independence of the NNG statistics from the distribution of distances and from the triangle inequality are proved, which allows the numerical modeling of such structures. Estimates of the accuracy of the calculated statistics for graphs and their comparison with estimates obtained by modeling random coordinates of points in <i>d</i>-dimensional space are carried out. It is shown that the model of the NNGs without taking into account the dimension of the space leads to fairly accurate estimates of the statistics of graph structures in spaces of dimensionality higher than five. For spaces of smaller dimensionality, the benchmark can be obtained by directly calculating the distances between points with random coordinates in a unit cube. The proposed method is applied to the problem of analyzing the level of unsteadiness of the earthquake catalog in the Kuril–Kamchatka region. The lengths of samples of time intervals between neighboring events are analyzed. It is shown that the analyzed system as a whole is interconnected with a probability of 0.91, and this dependence is fundamentally different from the lag correlation between the sample elements.</p>\",\"PeriodicalId\":38050,\"journal\":{\"name\":\"Mathematical Models and Computer Simulations\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Models and Computer Simulations\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1134/s2070048223070086\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Models and Computer Simulations","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1134/s2070048223070086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

摘要

摘要 本文提出的方法基于对近邻图(NNG)结构统计量的计算,并将其作为按断开片段数量计算图分布概率的基准。根据实际观测到的连通性与计算结果的偏差,我们可以确定该样本被视为一组统计上独立变量的概率。我们证明了 NNG 统计量与距离分布和三角形不等式的独立性,从而可以对此类结构进行数值建模。对图形计算统计量的准确性进行了估算,并将其与 d 维空间中点的随机坐标建模所获得的估算值进行了比较。结果表明,在不考虑空间维度的情况下,NNG 模型可以对维度大于 5 的空间中的图形结构的统计量进行相当精确的估计。对于维度较小的空间,可以通过直接计算单位立方体中随机坐标点之间的距离来获得基准。所提出的方法被应用于分析千岛-堪察加地区地震目录的不稳定性程度。对相邻事件之间的时间间隔样本长度进行了分析。结果表明,所分析的系统作为一个整体以 0.91 的概率相互关联,这种依赖性与样本元素之间的滞后相关性有着本质区别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Modeling the Nearest Neighbor Graphs to Estimate the Probability of the Independence of Data

Modeling the Nearest Neighbor Graphs to Estimate the Probability of the Independence of Data

Abstract

The proposed method is based on calculations of the statistics of the nearest neighbor graph (NNG) structures, which are presented as a benchmark of the probabilities of the distribution of graphs by the number of disconnected fragments. The deviation of the actually observed occurrence of connectivity from the calculated one will allow us to determine the probability that this sample can be considered a set of statistically independent variables. The statements about the independence of the NNG statistics from the distribution of distances and from the triangle inequality are proved, which allows the numerical modeling of such structures. Estimates of the accuracy of the calculated statistics for graphs and their comparison with estimates obtained by modeling random coordinates of points in d-dimensional space are carried out. It is shown that the model of the NNGs without taking into account the dimension of the space leads to fairly accurate estimates of the statistics of graph structures in spaces of dimensionality higher than five. For spaces of smaller dimensionality, the benchmark can be obtained by directly calculating the distances between points with random coordinates in a unit cube. The proposed method is applied to the problem of analyzing the level of unsteadiness of the earthquake catalog in the Kuril–Kamchatka region. The lengths of samples of time intervals between neighboring events are analyzed. It is shown that the analyzed system as a whole is interconnected with a probability of 0.91, and this dependence is fundamentally different from the lag correlation between the sample elements.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mathematical Models and Computer Simulations
Mathematical Models and Computer Simulations Mathematics-Computational Mathematics
CiteScore
1.20
自引率
0.00%
发文量
99
期刊介绍: Mathematical Models and Computer Simulations  is a journal that publishes high-quality and original articles at the forefront of development of mathematical models, numerical methods, computer-assisted studies in science and engineering with the potential for impact across the sciences, and construction of massively parallel codes for supercomputers. The problem-oriented papers are devoted to various problems including industrial mathematics, numerical simulation in multiscale and multiphysics, materials science, chemistry, economics, social, and life sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信