SOX7: Autism Associated Gene Identified by Analysis of Multi-Omics Data.

Samantha Gonzales, Jane Zizhen Zhao, Na Young Choi, Prabha Acharya, Sehoon Jeong, Xuexia Wang, Moo-Yeal Lee
{"title":"SOX7: Autism Associated Gene Identified by Analysis of Multi-Omics Data.","authors":"Samantha Gonzales, Jane Zizhen Zhao, Na Young Choi, Prabha Acharya, Sehoon Jeong, Xuexia Wang, Moo-Yeal Lee","doi":"10.1101/2023.05.26.542456","DOIUrl":null,"url":null,"abstract":"<p><p>Genome-wide association studies and next generation sequencing data analyses based on DNA information have identified thousands of mutations associated with autism spectrum disorder (ASD). However, more than 99% of identified mutations are non-coding. Thus, it is unclear which of these mutations might be functional and thus potentially causal variants. Transcriptomic profiling using total RNA-sequencing has been one of the most utilized approaches to link protein levels to genetic information at the molecular level. The transcriptome captures molecular genomic complexity that the DNA sequence solely does not. Some mutations alter a gene's DNA sequence but do not necessarily change expression and/or protein function. To date, few common variants reliably associated with the diagnosis status of ASD despite consistently high estimates of heritability. In addition, reliable biomarkers used to diagnose ASD or molecular mechanisms to define the severity of ASD do not exist. Therefore, it is necessary to integrate DNA and RNA testing together to identify true causal genes and propose useful biomarkers for ASD. We performed gene-based association studies with adaptive test using genome-wide association studies (GWAS) summary statistics with two large GWAS datasets (ASD 2019 data: 18,382 ASD cases and 27,969 controls [discovery data]; ASD 2017 data: 6,197 ASD cases and 7,377 controls [replication data]) which were obtained from the Psychiatric Genomics Consortium (PGC). In addition, we investigated differential expression between ASD cases and controls for genes identified in gene-based GWAS with two RNA-seq datasets (GSE211154: 20 cases and 19 controls; GSE30573: 3 cases and 3 controls). We identified 5 genes significantly associated with ASD in ASD 2019 data ( <i>KIZ-AS1</i> , <i>p</i> =8.67×10 <sup>-10</sup> ; <i>KIZ</i> , <i>p</i> =1.16×10 <sup>-9</sup> ; <i>XRN2</i> , <i>p</i> =7.73×10 <sup>-9</sup> ; <i>SOX7</i> , <i>p</i> =2.22×10 <sup>-7</sup> ; <i>LOC101929229</i> also known as <i>PINX1-DT</i> , <i>p</i> =2.14×10 <sup>-6</sup> ). Among these 5 genes, gene <i>SOX7</i> ( <i>p</i> =0.00087) and <i>LOC101929229</i> ( <i>p</i> =0.009) were replicated in ASD 2017 data. <i>KIZ-AS1</i> ( <i>p</i> =0.059) and <i>KIZ</i> ( <i>p</i> =0.06) were close to the boundary of replication in ASD 2017 data. Genes <i>SOX7</i> ( <i>p</i> =0.036 in all samples; <i>p</i> =0.044 in white samples) indicated significant expression differences between cases and controls in the GSE211154 RNA-seq data. Furthermore, gene <i>SOX7</i> was upregulated in cases than in controls in the GSE30573 RNA-seq data ( <i>p</i> =0.0017; Benjamini-Hochberg adjusted <i>p</i> =0.0085). <i>SOX7</i> encodes a member of the SOX (SRY-related HMG-box) family of transcription factors pivotally contributing to determining of the cell fate and identity in many lineages. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins leading to autism. Gene <i>SOX7</i> in the transcription factor family could be associated with ASD. This finding may provide new diagnostic and therapeutic strategies for ASD.</p>","PeriodicalId":72407,"journal":{"name":"bioRxiv : the preprint server for biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4a/22/nihpp-2023.05.26.542456v1.PMC10245991.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.05.26.542456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Genome-wide association studies and next generation sequencing data analyses based on DNA information have identified thousands of mutations associated with autism spectrum disorder (ASD). However, more than 99% of identified mutations are non-coding. Thus, it is unclear which of these mutations might be functional and thus potentially causal variants. Transcriptomic profiling using total RNA-sequencing has been one of the most utilized approaches to link protein levels to genetic information at the molecular level. The transcriptome captures molecular genomic complexity that the DNA sequence solely does not. Some mutations alter a gene's DNA sequence but do not necessarily change expression and/or protein function. To date, few common variants reliably associated with the diagnosis status of ASD despite consistently high estimates of heritability. In addition, reliable biomarkers used to diagnose ASD or molecular mechanisms to define the severity of ASD do not exist. Therefore, it is necessary to integrate DNA and RNA testing together to identify true causal genes and propose useful biomarkers for ASD. We performed gene-based association studies with adaptive test using genome-wide association studies (GWAS) summary statistics with two large GWAS datasets (ASD 2019 data: 18,382 ASD cases and 27,969 controls [discovery data]; ASD 2017 data: 6,197 ASD cases and 7,377 controls [replication data]) which were obtained from the Psychiatric Genomics Consortium (PGC). In addition, we investigated differential expression between ASD cases and controls for genes identified in gene-based GWAS with two RNA-seq datasets (GSE211154: 20 cases and 19 controls; GSE30573: 3 cases and 3 controls). We identified 5 genes significantly associated with ASD in ASD 2019 data ( KIZ-AS1 , p =8.67×10 -10 ; KIZ , p =1.16×10 -9 ; XRN2 , p =7.73×10 -9 ; SOX7 , p =2.22×10 -7 ; LOC101929229 also known as PINX1-DT , p =2.14×10 -6 ). Among these 5 genes, gene SOX7 ( p =0.00087) and LOC101929229 ( p =0.009) were replicated in ASD 2017 data. KIZ-AS1 ( p =0.059) and KIZ ( p =0.06) were close to the boundary of replication in ASD 2017 data. Genes SOX7 ( p =0.036 in all samples; p =0.044 in white samples) indicated significant expression differences between cases and controls in the GSE211154 RNA-seq data. Furthermore, gene SOX7 was upregulated in cases than in controls in the GSE30573 RNA-seq data ( p =0.0017; Benjamini-Hochberg adjusted p =0.0085). SOX7 encodes a member of the SOX (SRY-related HMG-box) family of transcription factors pivotally contributing to determining of the cell fate and identity in many lineages. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins leading to autism. Gene SOX7 in the transcription factor family could be associated with ASD. This finding may provide new diagnostic and therapeutic strategies for ASD.

Abstract Image

Abstract Image

Abstract Image

SOX7:通过多组分数据分析鉴定的新型自闭症基因。
背景:基于DNA信息的全基因组关联研究和下一代测序数据分析已经确定了数千个与自闭症谱系障碍(ASD)相关的突变。然而,99%以上的已鉴定突变是非编码的。因此,目前尚不清楚这些突变中的哪一个可能是功能性的,从而可能导致变异。使用总RNA测序的转录组分析是在分子水平上将蛋白质水平与遗传信息联系起来的最常用方法之一。转录组捕获了分子基因组的复杂性,而DNA序列则没有。一些突变改变了基因的DNA序列,但不一定会改变表达和/或蛋白质功能。迄今为止,尽管遗传力的估计值一直很高,但很少有常见变异与ASD的诊断状态可靠相关。此外,用于诊断ASD的可靠生物标志物或定义ASD严重程度的分子机制尚不存在。目的:有必要将DNA和RNA检测整合在一起,以确定真正的致病基因,并提出有用的ASD生物标志物。方法:我们使用全基因组关联研究(GWAS)汇总统计数据对两个大型GWAS数据集(ASD 2019数据:18382例ASD病例和27969例对照[发现数据];ASD 2017数据:6197例ASD和7377例对照[复制数据])进行了基于基因的关联研究和适应性测试,这两个数据集来自精神基因组学联盟(PGC)。此外,我们使用DESeq2软件包,用RNA-seq数据集(GSE30573:3例病例和3例对照)研究了在基于基因的GWAS中鉴定的基因的差异表达。结果:我们在2019年ASD数据中确定了5个与ASD显著相关的基因(KIZ-AS1,p=8.67×10-10;KIZ,p=1.16×10-9;XRN2,p=7.73×10-9,SOX7,p=2.22×10-7;PINX1-DT,p=2.14×10-6)。在这5个基因中,SOX77基因(p=0.000087)、LOC101929229基因(p=0.009)和KIZ-AS11基因(p=0.059)在2017年ASD的数据中被复制。KIZ(p=0.06)接近ASD 2017数据中的复制边界。基因SOX7(p=0.0017,调整后的p=0.0085)、LOC101929229(也称为PINX1-DT,p=5.83×10-7,调整后p=1.18×10-5)和KIZ(p=0.000099,调整后p=0.0055)在RNA-seq数据中显示病例和对照组之间的显著表达差异。SOX7编码转录因子SOX(SRY相关HMG盒)家族的一个成员,在许多谱系中对细胞命运和身份的决定起着重要作用。在与其他导致自闭症的蛋白质形成蛋白质复合物后,编码的蛋白质可能充当转录调节因子。结论:转录因子家族SOX7基因可能与ASD有关。这一发现可能为ASD提供新的诊断和治疗策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信