Muscle Gene Sets: a versatile methodological aid to functional genomics in the neuromuscular field.

IF 5.3 2区 医学 Q2 CELL BIOLOGY
Apostolos Malatras, Stephanie Duguez, William Duddy
{"title":"Muscle Gene Sets: a versatile methodological aid to functional genomics in the neuromuscular field.","authors":"Apostolos Malatras,&nbsp;Stephanie Duguez,&nbsp;William Duddy","doi":"10.1186/s13395-019-0196-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The approach of building large collections of gene sets and then systematically testing hypotheses across these collections is a powerful tool in functional genomics, both in the pathway analysis of omics data and to uncover the polygenic effects associated with complex diseases in genome-wide association study. The Molecular Signatures Database includes collections of oncogenic and immunologic signatures enabling researchers to compare transcriptional datasets across hundreds of previous studies and leading to important insights in these fields, but such a resource does not currently exist for neuromuscular research. In previous work, we have shown the utility of gene set approaches to understand muscle cell physiology and pathology.</p><p><strong>Methods: </strong>Following a systematic survey of public muscle data, we passed gene expression profiles from 4305 samples through a robust pre-processing and standardized data analysis pipeline. Two hundred eighty-two samples were discarded based on a battery of rigorous global quality controls. From among the remaining studies, 578 comparisons of interest were identified by a combination of text mining and manual curation of the study meta-data. For each comparison, significantly dysregulated genes (FDR adjusted p < 0.05) were identified.</p><p><strong>Results: </strong>Lists of dysregulated genes were divided between upregulated and downregulated to give 1156 Muscle Gene Sets (MGS). This resource is available for download ( www.sys-myo.com/muscle_gene_sets ) and is accessible through three commonly used functional genomics platforms (GSEA, EnrichR, and WebGestalt). Basic guidance and recommendations are provided for the use of MGS through these platforms. In addition, consensus muscle gene sets were created to capture the overlap between the results of similar studies, and analysis of these highlighted the potential for novel disease-relevant findings.</p><p><strong>Conclusions: </strong>The MGS resource can be used to investigate the behaviour of any list of genes across previous comparisons of muscle conditions, to compare previous studies to one another, and to explore the functional relationship of muscle dysregulation to the Gene Ontology. Its major intended use is in enrichment testing for functional genomics analysis.</p>","PeriodicalId":21747,"journal":{"name":"Skeletal Muscle","volume":"9 1","pages":"10"},"PeriodicalIF":5.3000,"publicationDate":"2019-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13395-019-0196-z","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Skeletal Muscle","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13395-019-0196-z","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 8

Abstract

Background: The approach of building large collections of gene sets and then systematically testing hypotheses across these collections is a powerful tool in functional genomics, both in the pathway analysis of omics data and to uncover the polygenic effects associated with complex diseases in genome-wide association study. The Molecular Signatures Database includes collections of oncogenic and immunologic signatures enabling researchers to compare transcriptional datasets across hundreds of previous studies and leading to important insights in these fields, but such a resource does not currently exist for neuromuscular research. In previous work, we have shown the utility of gene set approaches to understand muscle cell physiology and pathology.

Methods: Following a systematic survey of public muscle data, we passed gene expression profiles from 4305 samples through a robust pre-processing and standardized data analysis pipeline. Two hundred eighty-two samples were discarded based on a battery of rigorous global quality controls. From among the remaining studies, 578 comparisons of interest were identified by a combination of text mining and manual curation of the study meta-data. For each comparison, significantly dysregulated genes (FDR adjusted p < 0.05) were identified.

Results: Lists of dysregulated genes were divided between upregulated and downregulated to give 1156 Muscle Gene Sets (MGS). This resource is available for download ( www.sys-myo.com/muscle_gene_sets ) and is accessible through three commonly used functional genomics platforms (GSEA, EnrichR, and WebGestalt). Basic guidance and recommendations are provided for the use of MGS through these platforms. In addition, consensus muscle gene sets were created to capture the overlap between the results of similar studies, and analysis of these highlighted the potential for novel disease-relevant findings.

Conclusions: The MGS resource can be used to investigate the behaviour of any list of genes across previous comparisons of muscle conditions, to compare previous studies to one another, and to explore the functional relationship of muscle dysregulation to the Gene Ontology. Its major intended use is in enrichment testing for functional genomics analysis.

Abstract Image

Abstract Image

肌肉基因集:神经肌肉领域功能基因组学的通用方法学辅助。
背景:建立大量基因集,然后在这些集合中系统地测试假设的方法是功能基因组学的一个强大工具,无论是在组学数据的通路分析中,还是在全基因组关联研究中揭示与复杂疾病相关的多基因效应中。分子特征数据库包括致癌和免疫特征的集合,使研究人员能够比较数百项先前研究的转录数据集,并在这些领域获得重要见解,但目前还不存在用于神经肌肉研究的此类资源。在之前的工作中,我们已经展示了基因集方法在理解肌肉细胞生理学和病理学方面的实用性。方法:在对公共肌肉数据进行系统调查后,我们通过强大的预处理和标准化的数据分析管道,通过4305个样本的基因表达谱。根据严格的全球质量控制,共有二百八十二个样本被丢弃。在剩下的研究中,通过文本挖掘和研究元数据的手动管理相结合,确定了578个感兴趣的比较。对于每次比较,显著失调的基因(FDR调整p 结果:将失调基因列表分为上调和下调,得到1156个肌肉基因集(MGS)。该资源可供下载(www.sys-myo.com/muscle_gene_sets),并可通过三个常用的功能基因组学平台(GSEA、EnrichR和WebGestalt)访问。为通过这些平台使用MGS提供了基本指导和建议。此外,建立了一致的肌肉基因集,以捕捉类似研究结果之间的重叠,对这些结果的分析突出了新的疾病相关发现的潜力。结论:MGS资源可用于研究任何基因列表在先前肌肉状况比较中的行为,将先前的研究相互比较,并探索肌肉失调与基因本体论的功能关系。它的主要用途是用于功能基因组学分析的富集测试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Skeletal Muscle
Skeletal Muscle CELL BIOLOGY-
CiteScore
9.10
自引率
0.00%
发文量
25
审稿时长
12 weeks
期刊介绍: The only open access journal in its field, Skeletal Muscle publishes novel, cutting-edge research and technological advancements that investigate the molecular mechanisms underlying the biology of skeletal muscle. Reflecting the breadth of research in this area, the journal welcomes manuscripts about the development, metabolism, the regulation of mass and function, aging, degeneration, dystrophy and regeneration of skeletal muscle, with an emphasis on understanding adult skeletal muscle, its maintenance, and its interactions with non-muscle cell types and regulatory modulators. Main areas of interest include: -differentiation of skeletal muscle- atrophy and hypertrophy of skeletal muscle- aging of skeletal muscle- regeneration and degeneration of skeletal muscle- biology of satellite and satellite-like cells- dystrophic degeneration of skeletal muscle- energy and glucose homeostasis in skeletal muscle- non-dystrophic genetic diseases of skeletal muscle, such as Spinal Muscular Atrophy and myopathies- maintenance of neuromuscular junctions- roles of ryanodine receptors and calcium signaling in skeletal muscle- roles of nuclear receptors in skeletal muscle- roles of GPCRs and GPCR signaling in skeletal muscle- other relevant aspects of skeletal muscle biology. In addition, articles on translational clinical studies that address molecular and cellular mechanisms of skeletal muscle will be published. Case reports are also encouraged for submission. Skeletal Muscle reflects the breadth of research on skeletal muscle and bridges gaps between diverse areas of science for example cardiac cell biology and neurobiology, which share common features with respect to cell differentiation, excitatory membranes, cell-cell communication, and maintenance. Suitable articles are model and mechanism-driven, and apply statistical principles where appropriate; purely descriptive studies are of lesser interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信