The persistent homology of genealogical networks.

IF 1.3 Q3 COMPUTER SCIENCE, THEORY & METHODS
Zachary M Boyd, Nick Callor, Taylor Gledhill, Abigail Jenkins, Robert Snellman, Benjamin Webb, Raelynn Wonnacott
{"title":"The persistent homology of genealogical networks.","authors":"Zachary M Boyd,&nbsp;Nick Callor,&nbsp;Taylor Gledhill,&nbsp;Abigail Jenkins,&nbsp;Robert Snellman,&nbsp;Benjamin Webb,&nbsp;Raelynn Wonnacott","doi":"10.1007/s41109-023-00538-7","DOIUrl":null,"url":null,"abstract":"<p><p>Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one's immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network's persistence curve, which encodes the network's set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Network Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41109-023-00538-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one's immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network's persistence curve, which encodes the network's set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand.

Abstract Image

Abstract Image

Abstract Image

宗谱网络的持久同源性。
家谱网络(即家谱)日益引起人们的兴趣,目前已知的最大数据集包括远远超过10亿个人。对家族史的兴趣也支持了一个85亿美元的产业,其规模预计将在7年内翻一番[FutureWise报告HC-1137]。然而,很少有数学关注谱系网络的复杂网络特性,特别是在大尺度上。家谱网络的结构特别令人感兴趣,因为形成联盟的实践,例如婚姻,通常是在一个人的直系亲属之外。在大多数其他网络中,包括其他社交网络,对关系形成的距离没有相应的限制。为了研究这对系谱网络的影响,我们使用持久同源性来识别和比较101个系谱网络和31个其他社会网络的结构。具体来说,我们引入了网络持久曲线的概念,它对网络的持久间隔集进行编码。我们发现,与其他社会网络相比,家谱网络的持续曲线具有明显的结构。这种结构上的差异也延伸到系谱和社会网络的子网络,这表明,即使数据不完整,持久的同源性也可以用于有意义的系谱网络分析。在这里,我们还描述了如何使用持久同源性来表示来自系谱网络的概念,例如共同祖先循环。我们预计,随着人们对祖先研究的兴趣不断扩大,持久的同源工具将在家谱探索中变得越来越重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Network Science
Applied Network Science Multidisciplinary-Multidisciplinary
CiteScore
4.60
自引率
4.50%
发文量
74
审稿时长
5 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信