利用病例对照母子基因型数据有效推断原生父母效应

IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY
Yuang Tian , Hong Zhang , Alexandre Bureau , Hagit Hochner , Jinbo Chen
{"title":"利用病例对照母子基因型数据有效推断原生父母效应","authors":"Yuang Tian ,&nbsp;Hong Zhang ,&nbsp;Alexandre Bureau ,&nbsp;Hagit Hochner ,&nbsp;Jinbo Chen","doi":"10.1016/j.jspi.2024.106190","DOIUrl":null,"url":null,"abstract":"<div><p>Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"233 ","pages":"Article 106190"},"PeriodicalIF":0.8000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient inference of parent-of-origin effect using case-control mother–child genotype data\",\"authors\":\"Yuang Tian ,&nbsp;Hong Zhang ,&nbsp;Alexandre Bureau ,&nbsp;Hagit Hochner ,&nbsp;Jinbo Chen\",\"doi\":\"10.1016/j.jspi.2024.106190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.</p></div>\",\"PeriodicalId\":50039,\"journal\":{\"name\":\"Journal of Statistical Planning and Inference\",\"volume\":\"233 \",\"pages\":\"Article 106190\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2024-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Statistical Planning and Inference\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378375824000478\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375824000478","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

亲本效应在哺乳动物的发育和疾病中发挥着重要作用。病例对照母子配对基因型数据可用于检测亲本效应,而且在实践中通常很容易收集。大多数现有的评估亲本效应的方法都没有纳入任何协变量,而控制混杂因素可能需要协变量。我们建议通过逻辑回归模型来模拟父母-原籍效应,预测因子包括母子基因型、父母原籍和协变量。根据目标遗传标记的基因型可能无法完全推断出父母的来源,因此我们建议使用与目标标记紧密相连的标记的基因型来提高推断效率。我们开发了一种稳健的统计推断程序,该程序以追溯的方式基于修正的轮廓对数概率。设计了一种计算上可行的期望最大化算法来估计修正的剖面对数似然所涉及的所有未知参数。该算法与传统的期望最大化算法不同,它是基于修正后的轮廓对数似然函数,而不是原始的轮廓对数似然函数。该算法的收敛性是在一些温和的正则条件下确定的。这种期望最大化算法还能方便地处理缺失的子基因型。在一些温和的正则性条件下,为所提出的估计器建立了大样本特性,包括弱一致性、渐近正则性和渐近效率。通过广泛的模拟研究和对真实数据集的应用,对有限样本特性进行了评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficient inference of parent-of-origin effect using case-control mother–child genotype data

Parent-of-origin effect plays an important role in mammal development and disorder. Case-control mother–child pair genotype data can be used to detect parent-of-origin effect and is often convenient to collect in practice. Most existing methods for assessing parent-of-origin effect do not incorporate any covariates, which may be required to control for confounding factors. We propose to model the parent-of-origin effect through a logistic regression model, with predictors including maternal and child genotypes, parental origins, and covariates. The parental origins may not be fully inferred from genotypes of a target genetic marker, so we propose to use genotypes of markers tightly linked to the target marker to increase inference efficiency. A robust statistical inference procedure is developed based on a modified profile log-likelihood in a retrospective way. A computationally feasible expectation–maximization algorithm is devised to estimate all unknown parameters involved in the modified profile log-likelihood. This algorithm differs from the conventional expectation–maximization algorithm in the sense that it is based on a modified instead of the original profile log-likelihood function. The convergence of the algorithm is established under some mild regularity conditions. This expectation–maximization algorithm also allows convenient handling of missing child genotypes. Large sample properties, including weak consistency, asymptotic normality, and asymptotic efficiency, are established for the proposed estimator under some mild regularity conditions. Finite sample properties are evaluated through extensive simulation studies and the application to a real dataset.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Statistical Planning and Inference
Journal of Statistical Planning and Inference 数学-统计学与概率论
CiteScore
2.10
自引率
11.10%
发文量
78
审稿时长
3-6 weeks
期刊介绍: The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信