Gene–environment interaction analysis via deep learning

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY
Shuni Wu, Yaqing Xu, Qingzhao Zhang, Shuangge Ma
{"title":"Gene–environment interaction analysis via deep learning","authors":"Shuni Wu,&nbsp;Yaqing Xu,&nbsp;Qingzhao Zhang,&nbsp;Shuangge Ma","doi":"10.1002/gepi.22518","DOIUrl":null,"url":null,"abstract":"<p>Gene–environment (G–E) interaction analysis plays an important role in studying complex diseases. Extensive methodological research has been conducted on G–E interaction analysis, and the existing methods are mostly based on regression techniques. In many fields including biomedicine and omics, it has been increasingly recognized that deep learning may outperform regression with its unique flexibility (e.g., in accommodating unspecified nonlinear effects) and superior prediction performance. However, there has been a lack of development in deep learning for G–E interaction analysis. In this article, we fill this important knowledge gap and develop a new analysis approach based on deep neural network in conjunction with penalization. The proposed approach can simultaneously conduct model estimation and selection (of important main G effects and G–E interactions), while uniquely respecting the “main effects, interactions” variable selection hierarchy. Simulation shows that it has superior prediction and feature selection performance. The analysis of data on lung adenocarcinoma and skin cutaneous melanoma overall survival further establishes its practical utility. Overall, this study can advance G–E interaction analysis by delivering a powerful new analysis approach based on modern deep learning.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 3","pages":"261-286"},"PeriodicalIF":1.7000,"publicationDate":"2023-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22518","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22518","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 1

Abstract

Gene–environment (G–E) interaction analysis plays an important role in studying complex diseases. Extensive methodological research has been conducted on G–E interaction analysis, and the existing methods are mostly based on regression techniques. In many fields including biomedicine and omics, it has been increasingly recognized that deep learning may outperform regression with its unique flexibility (e.g., in accommodating unspecified nonlinear effects) and superior prediction performance. However, there has been a lack of development in deep learning for G–E interaction analysis. In this article, we fill this important knowledge gap and develop a new analysis approach based on deep neural network in conjunction with penalization. The proposed approach can simultaneously conduct model estimation and selection (of important main G effects and G–E interactions), while uniquely respecting the “main effects, interactions” variable selection hierarchy. Simulation shows that it has superior prediction and feature selection performance. The analysis of data on lung adenocarcinoma and skin cutaneous melanoma overall survival further establishes its practical utility. Overall, this study can advance G–E interaction analysis by delivering a powerful new analysis approach based on modern deep learning.

基于深度学习的基因-环境相互作用分析
基因-环境互作分析在复杂疾病研究中起着重要作用。目前国内外对G-E相互作用分析进行了大量的方法学研究,现有的方法多基于回归技术。在包括生物医学和组学在内的许多领域,人们越来越认识到深度学习可能以其独特的灵活性(例如,在适应未指明的非线性效应方面)和优越的预测性能优于回归。然而,在G-E相互作用分析的深度学习方面一直缺乏发展。在本文中,我们填补了这一重要的知识空白,并开发了一种基于深度神经网络与惩罚相结合的新的分析方法。该方法可以同时进行模型估计和选择(重要的主G效应和G - e相互作用),同时独特地尊重“主效应,相互作用”变量选择层次。仿真结果表明,该方法具有较好的预测和特征选择性能。对肺腺癌和皮肤黑色素瘤总体生存数据的分析进一步证实了其实用性。总的来说,本研究可以通过提供基于现代深度学习的强大的新分析方法来推进G-E交互分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信