Microsatellites’ mutation modeling through the analysis of the Y-chromosomal transmission: Results of a GHEP-ISFG collaborative study

IF 3.2 2区 医学 Q2 GENETICS & HEREDITY
Sofia Antão-Sousa , Leonor Gusmão , Nidia M. Modesti , Sofía Feliziani , Marisa Faustino , Valeria Marcucci , Claudia Sarapura , Julyana Ribeiro , Elizeu Carvalho , Vania Pereira , Carmen Tomas , Marian M. de Pancorbo , Miriam Baeta , Rashed Alghafri , Reem Almheiri , Juan José Builes , Nair Gouveia , German Burgos , Maria de Lurdes Pontes , Adriana Ibarra , Nadia Pinto
{"title":"Microsatellites’ mutation modeling through the analysis of the Y-chromosomal transmission: Results of a GHEP-ISFG collaborative study","authors":"Sofia Antão-Sousa ,&nbsp;Leonor Gusmão ,&nbsp;Nidia M. Modesti ,&nbsp;Sofía Feliziani ,&nbsp;Marisa Faustino ,&nbsp;Valeria Marcucci ,&nbsp;Claudia Sarapura ,&nbsp;Julyana Ribeiro ,&nbsp;Elizeu Carvalho ,&nbsp;Vania Pereira ,&nbsp;Carmen Tomas ,&nbsp;Marian M. de Pancorbo ,&nbsp;Miriam Baeta ,&nbsp;Rashed Alghafri ,&nbsp;Reem Almheiri ,&nbsp;Juan José Builes ,&nbsp;Nair Gouveia ,&nbsp;German Burgos ,&nbsp;Maria de Lurdes Pontes ,&nbsp;Adriana Ibarra ,&nbsp;Nadia Pinto","doi":"10.1016/j.fsigen.2023.102999","DOIUrl":null,"url":null,"abstract":"<div><p>The Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) organized a collaborative study on mutations of Y-chromosomal short tandem repeats (Y-STRs). New data from 2225 father-son duos and data from 44 previously published reports, corresponding to 25,729 duos, were collected and analyzed. Marker-specific mutation rates were estimated for 33 Y-STRs. Although highly dependent on the analyzed marker, mutations compatible with the gain or loss of a single repeat were 23.2 times more likely than those involving a greater number of repeats. Longer alleles (relatively to the modal one) showed to be nearly twice more mutable than the shorter ones. Within the subset of longer alleles, the loss of repeats showed to be nearly twice more likely than the gain. Conversely, shorter alleles showed a symmetrical trend, with repeat gains being twofold more frequent than reductions. A positive correlation between the paternal age and the mutation rate was observed, strengthening previous findings. The results of a machine learning approach, via logistic regression analyses, allowed the establishment of algebraic formulas for estimating the probability of mutation depending on paternal age and allele length for DYS389I, DYS393 and DYS627. Algebraic formulas could also be established considering only the allele length as predictor for DYS19, DYS389I, DYS389II-I, DYS390, DYS391, DYS393, DYS437, DYS439, DYS449, DYS456, DYS458, DYS460, DYS481, DYS518, DYS533, DYS576, DYS626 and DYS627 loci. For the remaining Y-STRs, a lack of statistical significance was observed, probably as a consequence of the small effective size of the subsets available, a common difficulty in the modeling of rare events as is the case of mutations. The amount of data used in the different analyses varied widely, depending on how the data were reported in the publications analyzed. This shows a regrettable waste of produced data, due to inadequate communication of the results, supporting an urgent need of publication guidelines for mutation studies.</p></div>","PeriodicalId":50435,"journal":{"name":"Forensic Science International-Genetics","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1872497323001746/pdfft?md5=0ebe9ee8d99e769626843f0524f7260b&pid=1-s2.0-S1872497323001746-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic Science International-Genetics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1872497323001746","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

The Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) organized a collaborative study on mutations of Y-chromosomal short tandem repeats (Y-STRs). New data from 2225 father-son duos and data from 44 previously published reports, corresponding to 25,729 duos, were collected and analyzed. Marker-specific mutation rates were estimated for 33 Y-STRs. Although highly dependent on the analyzed marker, mutations compatible with the gain or loss of a single repeat were 23.2 times more likely than those involving a greater number of repeats. Longer alleles (relatively to the modal one) showed to be nearly twice more mutable than the shorter ones. Within the subset of longer alleles, the loss of repeats showed to be nearly twice more likely than the gain. Conversely, shorter alleles showed a symmetrical trend, with repeat gains being twofold more frequent than reductions. A positive correlation between the paternal age and the mutation rate was observed, strengthening previous findings. The results of a machine learning approach, via logistic regression analyses, allowed the establishment of algebraic formulas for estimating the probability of mutation depending on paternal age and allele length for DYS389I, DYS393 and DYS627. Algebraic formulas could also be established considering only the allele length as predictor for DYS19, DYS389I, DYS389II-I, DYS390, DYS391, DYS393, DYS437, DYS439, DYS449, DYS456, DYS458, DYS460, DYS481, DYS518, DYS533, DYS576, DYS626 and DYS627 loci. For the remaining Y-STRs, a lack of statistical significance was observed, probably as a consequence of the small effective size of the subsets available, a common difficulty in the modeling of rare events as is the case of mutations. The amount of data used in the different analyses varied widely, depending on how the data were reported in the publications analyzed. This shows a regrettable waste of produced data, due to inadequate communication of the results, supporting an urgent need of publication guidelines for mutation studies.

通过分析 Y 染色体传播建立微卫星突变模型:GHEP-ISFG 合作研究的结果
国际法医遗传学协会西班牙语和葡萄牙语工作组(GHEP-ISFG)组织了一项关于 Y 染色体短串联重复序列(Y-STR)突变的合作研究。该研究收集并分析了来自 2,225 对父子的新数据,以及 44 份以前发表的报告中的数据,这些数据对应于 25,729 对父子。估计了 33 个 Y-STR 的标记特异性突变率。尽管突变与所分析的标记有很大关系,但涉及单个重复序列增益或缺失的突变几率是涉及更多重复序列的突变几率的 23.2 倍。较长等位基因(相对于模态等位基因)的突变率几乎是较短等位基因的两倍。在长等位基因子集中,重复序列丢失的可能性比重复序列增加的可能性高出近两倍。相反,较短的等位基因呈现出对称趋势,重复序列增加的频率是减少的两倍。父系年龄与突变率之间呈正相关,这加强了之前的研究结果。通过逻辑回归分析的机器学习方法的结果,可以建立代数公式来估计 DYS389I、DYS393 和 DYS627 的突变概率,该概率取决于父系年龄和等位基因长度。对于 DYS19、DYS389I、DYS389II-I、DYS390、DYS391、DYS393、DYS437、DYS439、DYS449、DYS456、DYS458、DYS460、DYS481、DYS518、DYS533、DYS576、DYS626 和 DYS627 基因位点,也可以建立只考虑等位基因长度作为预测因子的代数公式。在其余的 Y-STR 中,我们观察到缺乏统计意义,这可能是由于可用子集的有效规模较小,而这是建立罕见事件模型(如突变)的常见困难。不同分析所使用的数据量差别很大,这取决于所分析的出版物是如何报告数据的。这表明,由于结果交流不充分,造成了令人遗憾的数据浪费。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.50
自引率
32.30%
发文量
132
审稿时长
11.3 weeks
期刊介绍: Forensic Science International: Genetics is the premier journal in the field of Forensic Genetics. This branch of Forensic Science can be defined as the application of genetics to human and non-human material (in the sense of a science with the purpose of studying inherited characteristics for the analysis of inter- and intra-specific variations in populations) for the resolution of legal conflicts. The scope of the journal includes: Forensic applications of human polymorphism. Testing of paternity and other family relationships, immigration cases, typing of biological stains and tissues from criminal casework, identification of human remains by DNA testing methodologies. Description of human polymorphisms of forensic interest, with special interest in DNA polymorphisms. Autosomal DNA polymorphisms, mini- and microsatellites (or short tandem repeats, STRs), single nucleotide polymorphisms (SNPs), X and Y chromosome polymorphisms, mtDNA polymorphisms, and any other type of DNA variation with potential forensic applications. Non-human DNA polymorphisms for crime scene investigation. Population genetics of human polymorphisms of forensic interest. Population data, especially from DNA polymorphisms of interest for the solution of forensic problems. DNA typing methodologies and strategies. Biostatistical methods in forensic genetics. Evaluation of DNA evidence in forensic problems (such as paternity or immigration cases, criminal casework, identification), classical and new statistical approaches. Standards in forensic genetics. Recommendations of regulatory bodies concerning methods, markers, interpretation or strategies or proposals for procedural or technical standards. Quality control. Quality control and quality assurance strategies, proficiency testing for DNA typing methodologies. Criminal DNA databases. Technical, legal and statistical issues. General ethical and legal issues related to forensic genetics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信