利用全基因组测序和机器学习寻找与家族性高胆固醇血症表型相关的新基因。

IF 0.9 Q3 AGRICULTURE, MULTIDISCIPLINARY
D E Ivanoshchuk, A B Kolker, O V Timoshchenko, S E Semaev, E V Shakhtshneider
{"title":"利用全基因组测序和机器学习寻找与家族性高胆固醇血症表型相关的新基因。","authors":"D E Ivanoshchuk,&nbsp;A B Kolker,&nbsp;O V Timoshchenko,&nbsp;S E Semaev,&nbsp;E V Shakhtshneider","doi":"10.18699/VJGB-23-63","DOIUrl":null,"url":null,"abstract":"<p><p>One of the most common congenital metabolic disorders is familial hypercholesterolemia. Familial hypercholesterolemia is a condition caused by a type of genetic defect leading to a decreased rate of removal of low-density lipoproteins from the bloodstream and a pronounced increase in the blood level of total cholesterol. This disease leads to the early development of cardiovascular diseases of atherosclerotic etiology. Familial hypercholesterolemia is a monogenic disease that is predominantly autosomal dominant. Rare pathogenic variants in the LDLR gene are present in 75-85 % of cases with an identified molecular genetic cause of the disease, and variants in other genes (APOB, PCSK9, LDLRAP1, ABCG5, ABCG8, and others) occur at a frequency of < 5 % in this group of patients. A negative result of genetic screening for pathogenic variants in genes of the low-density lipoprotein receptor and its ligands does not rule out a diagnosis of familial hypercholesterolemia. In 20-40 % of cases, molecular genetic testing fails to detect changes in the above genes. The aim of this work was to search for new genes associated with the familial hypercholesterolemia phenotype by modern high-tech methods of sequencing and machine learning. On the basis of a group of patients with familial hypercholesterolemia (enrolled according to the Dutch Lipid Clinic Network Criteria and including cases confirmed by molecular genetic analysis), decision trees were constructed, which made it possible to identify cases in the study population that require additional molecular genetic analysis. Five probands were identified as having the severest familial hypercholesterolemia without pathogenic variants in the studied genes and were analyzed by whole-genome sequencing on the HiSeq 1500 platform (Illumina). The whole-genome sequencing revealed rare variants in three out of five analyzed patients: a heterozygous variant (rs760657350) located in a splicing acceptor site in the PLD1 gene (c.2430-1G>A), a previously undescribed single-nucleotide deletion in the SIDT1 gene [c.2426del (p.Leu809CysfsTer2)], new missense variant c.10313C>G (p.Pro3438Arg) in the LRP1B gene, and single-nucleotide deletion variant rs753876598 [c.165del (p.Ser56AlafsTer11)] in the CETP gene. All these variants were found for the first time in patients with a clinical diagnosis of familial hypercholesterolemia. Variants were identified that may influence the formation of the familial hypercholesterolemia phenotype.</p>","PeriodicalId":44339,"journal":{"name":"Vavilovskii Zhurnal Genetiki i Selektsii","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551936/pdf/","citationCount":"0","resultStr":"{\"title\":\"Searching for new genes associated with the familial hypercholesterolemia phenotype using whole-genome sequencing and machine learning.\",\"authors\":\"D E Ivanoshchuk,&nbsp;A B Kolker,&nbsp;O V Timoshchenko,&nbsp;S E Semaev,&nbsp;E V Shakhtshneider\",\"doi\":\"10.18699/VJGB-23-63\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>One of the most common congenital metabolic disorders is familial hypercholesterolemia. Familial hypercholesterolemia is a condition caused by a type of genetic defect leading to a decreased rate of removal of low-density lipoproteins from the bloodstream and a pronounced increase in the blood level of total cholesterol. This disease leads to the early development of cardiovascular diseases of atherosclerotic etiology. Familial hypercholesterolemia is a monogenic disease that is predominantly autosomal dominant. Rare pathogenic variants in the LDLR gene are present in 75-85 % of cases with an identified molecular genetic cause of the disease, and variants in other genes (APOB, PCSK9, LDLRAP1, ABCG5, ABCG8, and others) occur at a frequency of < 5 % in this group of patients. A negative result of genetic screening for pathogenic variants in genes of the low-density lipoprotein receptor and its ligands does not rule out a diagnosis of familial hypercholesterolemia. In 20-40 % of cases, molecular genetic testing fails to detect changes in the above genes. The aim of this work was to search for new genes associated with the familial hypercholesterolemia phenotype by modern high-tech methods of sequencing and machine learning. On the basis of a group of patients with familial hypercholesterolemia (enrolled according to the Dutch Lipid Clinic Network Criteria and including cases confirmed by molecular genetic analysis), decision trees were constructed, which made it possible to identify cases in the study population that require additional molecular genetic analysis. Five probands were identified as having the severest familial hypercholesterolemia without pathogenic variants in the studied genes and were analyzed by whole-genome sequencing on the HiSeq 1500 platform (Illumina). The whole-genome sequencing revealed rare variants in three out of five analyzed patients: a heterozygous variant (rs760657350) located in a splicing acceptor site in the PLD1 gene (c.2430-1G>A), a previously undescribed single-nucleotide deletion in the SIDT1 gene [c.2426del (p.Leu809CysfsTer2)], new missense variant c.10313C>G (p.Pro3438Arg) in the LRP1B gene, and single-nucleotide deletion variant rs753876598 [c.165del (p.Ser56AlafsTer11)] in the CETP gene. All these variants were found for the first time in patients with a clinical diagnosis of familial hypercholesterolemia. Variants were identified that may influence the formation of the familial hypercholesterolemia phenotype.</p>\",\"PeriodicalId\":44339,\"journal\":{\"name\":\"Vavilovskii Zhurnal Genetiki i Selektsii\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551936/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vavilovskii Zhurnal Genetiki i Selektsii\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18699/VJGB-23-63\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vavilovskii Zhurnal Genetiki i Selektsii","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18699/VJGB-23-63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

家族性高胆固醇血症是最常见的先天性代谢紊乱之一。家族性高胆固醇血症是一种由遗传缺陷引起的疾病,导致血液中低密度脂蛋白的去除率降低,血液中总胆固醇水平显著升高。这种疾病导致动脉粥样硬化病因的心血管疾病的早期发展。家族性高胆固醇血症是一种以常染色体显性遗传为主的单基因疾病。在75-85%的已确定的疾病分子遗传原因的病例中,LDLR基因中存在罕见的致病性变体,而在这组患者中,其他基因(APOB、PCSK9、LDLRAP1、ABCG5、ABCG8等)的变体发生频率<5%。低密度脂蛋白受体及其配体基因致病性变体基因筛查的阴性结果并不排除家族性高胆固醇血症的诊断。在20-40%的病例中,分子遗传学检测未能检测到上述基因的变化。这项工作的目的是通过现代高科技测序和机器学习方法寻找与家族性高胆固醇血症表型相关的新基因。在一组家族性高胆固醇血症患者(根据荷兰脂质诊所网络标准登记,包括通过分子遗传分析确认的病例)的基础上,构建了决策树,这使得在研究人群中识别需要额外分子遗传分析的病例成为可能。五名先证者被确定为患有最严重的家族性高胆固醇血症,在所研究的基因中没有致病性变体,并通过HiSeq 1500平台(Illumina)上的全基因组测序进行分析。全基因组测序显示,五分之三的分析患者存在罕见变异:位于PLD1基因剪接受体位点的杂合变异株(rs760657350)(c.2430-1G>a),SIDT1基因中先前未描述的单核苷酸缺失[c.2426del(p.Leu809CysfsTer2)],LRP1B基因中的新错义变异株c.10313C>G(p.Pro3438Arg),和CETP基因中的单核苷酸缺失变体rs753876598[c.165del(p.Ser56AlafsTer11)]。所有这些变异都是首次在临床诊断为家族性高胆固醇血症的患者中发现的。发现了可能影响家族性高胆固醇血症表型形成的变异株。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Searching for new genes associated with the familial hypercholesterolemia phenotype using whole-genome sequencing and machine learning.

Searching for new genes associated with the familial hypercholesterolemia phenotype using whole-genome sequencing and machine learning.

Searching for new genes associated with the familial hypercholesterolemia phenotype using whole-genome sequencing and machine learning.

One of the most common congenital metabolic disorders is familial hypercholesterolemia. Familial hypercholesterolemia is a condition caused by a type of genetic defect leading to a decreased rate of removal of low-density lipoproteins from the bloodstream and a pronounced increase in the blood level of total cholesterol. This disease leads to the early development of cardiovascular diseases of atherosclerotic etiology. Familial hypercholesterolemia is a monogenic disease that is predominantly autosomal dominant. Rare pathogenic variants in the LDLR gene are present in 75-85 % of cases with an identified molecular genetic cause of the disease, and variants in other genes (APOB, PCSK9, LDLRAP1, ABCG5, ABCG8, and others) occur at a frequency of < 5 % in this group of patients. A negative result of genetic screening for pathogenic variants in genes of the low-density lipoprotein receptor and its ligands does not rule out a diagnosis of familial hypercholesterolemia. In 20-40 % of cases, molecular genetic testing fails to detect changes in the above genes. The aim of this work was to search for new genes associated with the familial hypercholesterolemia phenotype by modern high-tech methods of sequencing and machine learning. On the basis of a group of patients with familial hypercholesterolemia (enrolled according to the Dutch Lipid Clinic Network Criteria and including cases confirmed by molecular genetic analysis), decision trees were constructed, which made it possible to identify cases in the study population that require additional molecular genetic analysis. Five probands were identified as having the severest familial hypercholesterolemia without pathogenic variants in the studied genes and were analyzed by whole-genome sequencing on the HiSeq 1500 platform (Illumina). The whole-genome sequencing revealed rare variants in three out of five analyzed patients: a heterozygous variant (rs760657350) located in a splicing acceptor site in the PLD1 gene (c.2430-1G>A), a previously undescribed single-nucleotide deletion in the SIDT1 gene [c.2426del (p.Leu809CysfsTer2)], new missense variant c.10313C>G (p.Pro3438Arg) in the LRP1B gene, and single-nucleotide deletion variant rs753876598 [c.165del (p.Ser56AlafsTer11)] in the CETP gene. All these variants were found for the first time in patients with a clinical diagnosis of familial hypercholesterolemia. Variants were identified that may influence the formation of the familial hypercholesterolemia phenotype.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Vavilovskii Zhurnal Genetiki i Selektsii
Vavilovskii Zhurnal Genetiki i Selektsii AGRICULTURE, MULTIDISCIPLINARY-
CiteScore
1.90
自引率
0.00%
发文量
119
审稿时长
8 weeks
期刊介绍: The "Vavilov Journal of genetics and breeding" publishes original research and review articles in all key areas of modern plant, animal and human genetics, genomics, bioinformatics and biotechnology. One of the main objectives of the journal is integration of theoretical and applied research in the field of genetics. Special attention is paid to the most topical areas in modern genetics dealing with global concerns such as food security and human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信