Py_ped_sim:灵活的前向谱系和遗传模拟器,用于复杂的家族谱系分析。

IF 2.9 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
Miguel Guardado, Cynthia Perez, Sthen Campana, Berenice Chavez Rojas, Joaquín Magaña, Shalom Jackson, Emily Samperio, Selena Hernandez, Kaela Syas, Ryan D Hernandez, Elena I Zavala, Rori V Rohlfs
{"title":"Py_ped_sim:灵活的前向谱系和遗传模拟器,用于复杂的家族谱系分析。","authors":"Miguel Guardado, Cynthia Perez, Sthen Campana, Berenice Chavez Rojas, Joaquín Magaña, Shalom Jackson, Emily Samperio, Selena Hernandez, Kaela Syas, Ryan D Hernandez, Elena I Zavala, Rori V Rohlfs","doi":"10.1186/s12859-025-06142-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large-scale family pedigrees are commonly used across medical, evolutionary, and forensic genetics. These pedigrees are tools for identifying genetic disorders, tracking evolutionary patterns, and establishing familial relationships via forensic genetic identification. However, there is a lack of software to accurately simulate different pedigree structures along with genomes corresponding to those individuals in a family pedigree. This limits simulation-based evaluations of methods that use pedigrees.</p><p><strong>Results: </strong>We have developed a python command-line-based tool called py_ped_sim that facilitates the simulation of pedigree structures and the genomes of individuals in a pedigree. py_ped_sim represents pedigrees as directed acyclic graphs, enabling conversion between standard pedigree formats and integration with the forward population genetic simulator, SLiM. Notably, py_ped_sim allows the simulation of varying numbers of offspring for a set of parents, with the capacity to shift the distribution of sibship sizes over generations. We additionally add simulations for events of misattributed paternity, which offers a way to simulate half-sibling relationships, and simulations to extend the breadth of a family pedigree. We validated the accuracy of both our genome simulator and pedigree simulator. We show that we can simulate genomes onto family pedigrees with levels of expected kinship.</p><p><strong>Conclusions: </strong>py_ped_sim is a user-friendly and open-source solution for simulating pedigree structures and conducting pedigree genome simulations. It empowers medical, forensic, and evolutionary genetics researchers to gain deeper insights into the dynamics of genetic inheritance and relatedness within families.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"122"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12060417/pdf/","citationCount":"0","resultStr":"{\"title\":\"py_ped_sim: a flexible forward pedigree and genetic simulator for complex family pedigree analysis.\",\"authors\":\"Miguel Guardado, Cynthia Perez, Sthen Campana, Berenice Chavez Rojas, Joaquín Magaña, Shalom Jackson, Emily Samperio, Selena Hernandez, Kaela Syas, Ryan D Hernandez, Elena I Zavala, Rori V Rohlfs\",\"doi\":\"10.1186/s12859-025-06142-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Large-scale family pedigrees are commonly used across medical, evolutionary, and forensic genetics. These pedigrees are tools for identifying genetic disorders, tracking evolutionary patterns, and establishing familial relationships via forensic genetic identification. However, there is a lack of software to accurately simulate different pedigree structures along with genomes corresponding to those individuals in a family pedigree. This limits simulation-based evaluations of methods that use pedigrees.</p><p><strong>Results: </strong>We have developed a python command-line-based tool called py_ped_sim that facilitates the simulation of pedigree structures and the genomes of individuals in a pedigree. py_ped_sim represents pedigrees as directed acyclic graphs, enabling conversion between standard pedigree formats and integration with the forward population genetic simulator, SLiM. Notably, py_ped_sim allows the simulation of varying numbers of offspring for a set of parents, with the capacity to shift the distribution of sibship sizes over generations. We additionally add simulations for events of misattributed paternity, which offers a way to simulate half-sibling relationships, and simulations to extend the breadth of a family pedigree. We validated the accuracy of both our genome simulator and pedigree simulator. We show that we can simulate genomes onto family pedigrees with levels of expected kinship.</p><p><strong>Conclusions: </strong>py_ped_sim is a user-friendly and open-source solution for simulating pedigree structures and conducting pedigree genome simulations. It empowers medical, forensic, and evolutionary genetics researchers to gain deeper insights into the dynamics of genetic inheritance and relatedness within families.</p>\",\"PeriodicalId\":8958,\"journal\":{\"name\":\"BMC Bioinformatics\",\"volume\":\"26 1\",\"pages\":\"122\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12060417/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12859-025-06142-z\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06142-z","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

背景:大尺度家庭谱系通常用于医学、进化和法医遗传学。这些家谱是鉴定遗传疾病、追踪进化模式和通过法医遗传鉴定建立家族关系的工具。然而,目前还缺乏能够准确模拟不同谱系结构的软件,以及与家族谱系中这些个体相对应的基因组。这限制了对使用谱系的方法进行基于模拟的评估。结果:我们开发了一个基于python命令行的工具py_ped_sim,它有助于模拟系谱结构和系谱中个体的基因组。py_ped_sim将谱系表示为有向无环图,支持标准谱系格式之间的转换,并与向前种群遗传模拟器SLiM集成。值得注意的是,py_ped_sim允许模拟一组父母的不同数量的后代,具有在几代之间改变兄弟姐妹大小分布的能力。我们还增加了对错误归属的父亲事件的模拟,这提供了一种模拟半兄弟姐妹关系的方法,以及扩展家庭谱系广度的模拟。我们验证了我们的基因组模拟器和谱系模拟器的准确性。我们表明,我们可以在具有预期亲缘关系水平的家庭谱系上模拟基因组。结论:py_ped_sim是一个用户友好的开源解决方案,用于模拟家系结构和进行家系基因组模拟。它使医学、法医和进化遗传学研究人员能够更深入地了解基因遗传的动态和家庭内的关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
py_ped_sim: a flexible forward pedigree and genetic simulator for complex family pedigree analysis.

Background: Large-scale family pedigrees are commonly used across medical, evolutionary, and forensic genetics. These pedigrees are tools for identifying genetic disorders, tracking evolutionary patterns, and establishing familial relationships via forensic genetic identification. However, there is a lack of software to accurately simulate different pedigree structures along with genomes corresponding to those individuals in a family pedigree. This limits simulation-based evaluations of methods that use pedigrees.

Results: We have developed a python command-line-based tool called py_ped_sim that facilitates the simulation of pedigree structures and the genomes of individuals in a pedigree. py_ped_sim represents pedigrees as directed acyclic graphs, enabling conversion between standard pedigree formats and integration with the forward population genetic simulator, SLiM. Notably, py_ped_sim allows the simulation of varying numbers of offspring for a set of parents, with the capacity to shift the distribution of sibship sizes over generations. We additionally add simulations for events of misattributed paternity, which offers a way to simulate half-sibling relationships, and simulations to extend the breadth of a family pedigree. We validated the accuracy of both our genome simulator and pedigree simulator. We show that we can simulate genomes onto family pedigrees with levels of expected kinship.

Conclusions: py_ped_sim is a user-friendly and open-source solution for simulating pedigree structures and conducting pedigree genome simulations. It empowers medical, forensic, and evolutionary genetics researchers to gain deeper insights into the dynamics of genetic inheritance and relatedness within families.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Bioinformatics
BMC Bioinformatics 生物-生化研究方法
CiteScore
5.70
自引率
3.30%
发文量
506
审稿时长
4.3 months
期刊介绍: BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信