Modification of the Method for Calculating Polygenic Risks With Variation Graph

Trudy Instituta sistemnogo programmirovaniia RAN Pub Date : 2022-01-01 DOI:10.15514/ispras-2022-34(2)-15

O. Kondrateva, E. Karpulevich

引用次数: 0

Abstract

Representation of the DNA sequence is possible in various ways. The variation graph is one of the most accurate methods that allows you to work with atypical areas and take into account all their diversity. Based on this data structure and the polygenic risk assessment method, a DNA interpretation system was built. As a result, a correlation coefficient was obtained between the path in the column responsible for a specific DNA sequence and the feature. We then compared it with a coefficient obtained by a similar method but using sequence representation using a reference genome. Such a comparison helped to evaluate the effectiveness of the representation in the form of a graph. After that, a modified method for calculating the polygenic score on the alignment data of the vg tool was built, which was also compared with existing methods. The modified method showed an improvement in the prediction of the trait.

查看原文本刊更多论文

用变异图计算多基因风险方法的改进

DNA序列的表示可能有多种方式。变异图是最准确的方法之一，它允许您处理非典型区域并考虑其所有多样性。基于该数据结构和多基因风险评估方法，构建了DNA判读系统。结果，在负责特定DNA序列的列中的路径与特征之间获得了相关系数。然后，我们将其与使用参考基因组序列表示的类似方法获得的系数进行比较。这样的比较有助于评估以图表形式表示的有效性。在此基础上，建立了一种基于vg工具比对数据的多基因得分计算方法，并与现有方法进行了比较。改进后的方法在性状预测方面有一定的提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Trudy Instituta sistemnogo programmirovaniia RAN

自引率

0.00%

发文量

审稿时长

4 weeks