Identification of significant SNPs and the quantification of correlation using genomic informational field theory (GIFT)

IF 1.8 4区 数学 Q2 BIOLOGY
Mathematical Biosciences Pub Date : 2026-03-01 Epub Date: 2026-01-10 DOI:10.1016/j.mbs.2025.109606
Scott Gadsby , Cyril Rauch , Jonathan A D Wattis
{"title":"Identification of significant SNPs and the quantification of correlation using genomic informational field theory (GIFT)","authors":"Scott Gadsby ,&nbsp;Cyril Rauch ,&nbsp;Jonathan A D Wattis","doi":"10.1016/j.mbs.2025.109606","DOIUrl":null,"url":null,"abstract":"<div><div>Given data on genotypes and phenotypes from a sample population, we show how ordering the data by phenotype and analysing the information contained in the corresponding list of genotypes can identify those SNPs which have a significant correlation with phenotype. We derive formulae for <em>p</em>-values to quantify the significance of each SNP, and show how to analyse the correlations <em>between</em> different SNPs. As well as using classical covariance and correlations, we introduce an information-theoretic measure of correlation which is based on Shannon’s informational entropy. This variational formulation also gives rise to other ways of determining the strength of a SNP’s influence on phenotype in a biallelic population using ‘field’ functions which account for the relationship between phenotype and genotype. By computing this field for each SNP, we are able to quantify the correlations between SNPs. The results are shown to depend on the number of each genostate (aa, Aa and AA) in the population in a predictable manner. The methods are illustrated using data on horse height.</div></div>","PeriodicalId":51119,"journal":{"name":"Mathematical Biosciences","volume":"393 ","pages":"Article 109606"},"PeriodicalIF":1.8000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Biosciences","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0025556425002329","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Given data on genotypes and phenotypes from a sample population, we show how ordering the data by phenotype and analysing the information contained in the corresponding list of genotypes can identify those SNPs which have a significant correlation with phenotype. We derive formulae for p-values to quantify the significance of each SNP, and show how to analyse the correlations between different SNPs. As well as using classical covariance and correlations, we introduce an information-theoretic measure of correlation which is based on Shannon’s informational entropy. This variational formulation also gives rise to other ways of determining the strength of a SNP’s influence on phenotype in a biallelic population using ‘field’ functions which account for the relationship between phenotype and genotype. By computing this field for each SNP, we are able to quantify the correlations between SNPs. The results are shown to depend on the number of each genostate (aa, Aa and AA) in the population in a predictable manner. The methods are illustrated using data on horse height.
利用基因组信息场理论(GIFT)鉴定显著snp和量化相关性。
给定来自样本群体的基因型和表型数据,我们展示了如何按表型排序数据并分析相应基因型列表中包含的信息,从而识别出与表型有显著相关性的snp。我们推导了p值公式来量化每个SNP的重要性,并展示了如何分析不同SNP之间的相关性。在使用经典协方差和相关性的基础上,引入了一种基于香农信息熵的相关度信息度量。这种变分公式还产生了其他方法来确定SNP对双等位基因群体中表型的影响强度,使用“场”函数来解释表型和基因型之间的关系。通过计算每个SNP的这个字段,我们能够量化SNP之间的相关性。结果显示,以可预测的方式依赖于群体中每种基因状态(aa, aa和aa)的数量。用马的身高数据说明了这些方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Mathematical Biosciences
Mathematical Biosciences 生物-生物学
CiteScore
7.50
自引率
2.30%
发文量
67
审稿时长
18 days
期刊介绍: Mathematical Biosciences publishes work providing new concepts or new understanding of biological systems using mathematical models, or methodological articles likely to find application to multiple biological systems. Papers are expected to present a major research finding of broad significance for the biological sciences, or mathematical biology. Mathematical Biosciences welcomes original research articles, letters, reviews and perspectives.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书