大规模SNP-SNP相互作用检测的分布式进化框架

Fangting Li, Yuhai Zhao, Boxin Guan, Yuan Li
{"title":"大规模SNP-SNP相互作用检测的分布式进化框架","authors":"Fangting Li, Yuhai Zhao, Boxin Guan, Yuan Li","doi":"10.1109/BIBM55620.2022.9995483","DOIUrl":null,"url":null,"abstract":"Capturing the complex interactions of single nucleotide polymorphisms (SNPs) is considered essential for etiological analysis in genome-wide association analysis (GWAS). Evolutionary algorithms (EAs) have been extensively adopted for SNP-SNP interaction detection. Most existing EA-based methods focus on enhancing the search ability of EA itself. However, as the scale of SNP data further increases, the exponentially growing search space gradually becomes the dominant factor leading to the performance degradation of EA-based methods. To this end, a distributed evolutionary framework based on space partitioning (SP-EF) is proposed to detect SNP-SNP interactions on large-scale datasets. Distinct from the traditional population-distributed approaches, SP-EF first partitions the entire search space into several subspaces from the perspective of data. The space partitioning strategy is non-destructive since it guarantees that each feasible solution is assigned to a specific subspace. Thereafter, each subspace is explored by an EA optimizer independently and all the subspaces are optimized in parallel. Lastly, the final output is selected from the local optima in the historical search of each subspace. SP-EF can not only cope with the heavy computational burden of SNP combination evaluation but also enhance the diversity of the population to avoid local optima. Notably, SP-EF is load-balanced and scalable since it can flexibly partition the space according to the number of available computational nodes and the problem size. To show the practicability of SP-EF, we further design a discrete fireworks algorithm (DFWA) with three problem-guided operators as an EA optimizer. Experiments on artificial and real-world datasets demonstrate that our method significantly improves search speed and search accuracy.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Distributed Evolutionary Framework for Large-scale SNP-SNP Interaction Detection\",\"authors\":\"Fangting Li, Yuhai Zhao, Boxin Guan, Yuan Li\",\"doi\":\"10.1109/BIBM55620.2022.9995483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Capturing the complex interactions of single nucleotide polymorphisms (SNPs) is considered essential for etiological analysis in genome-wide association analysis (GWAS). Evolutionary algorithms (EAs) have been extensively adopted for SNP-SNP interaction detection. Most existing EA-based methods focus on enhancing the search ability of EA itself. However, as the scale of SNP data further increases, the exponentially growing search space gradually becomes the dominant factor leading to the performance degradation of EA-based methods. To this end, a distributed evolutionary framework based on space partitioning (SP-EF) is proposed to detect SNP-SNP interactions on large-scale datasets. Distinct from the traditional population-distributed approaches, SP-EF first partitions the entire search space into several subspaces from the perspective of data. The space partitioning strategy is non-destructive since it guarantees that each feasible solution is assigned to a specific subspace. Thereafter, each subspace is explored by an EA optimizer independently and all the subspaces are optimized in parallel. Lastly, the final output is selected from the local optima in the historical search of each subspace. SP-EF can not only cope with the heavy computational burden of SNP combination evaluation but also enhance the diversity of the population to avoid local optima. Notably, SP-EF is load-balanced and scalable since it can flexibly partition the space according to the number of available computational nodes and the problem size. To show the practicability of SP-EF, we further design a discrete fireworks algorithm (DFWA) with three problem-guided operators as an EA optimizer. Experiments on artificial and real-world datasets demonstrate that our method significantly improves search speed and search accuracy.\",\"PeriodicalId\":210337,\"journal\":{\"name\":\"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM55620.2022.9995483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9995483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

捕获单核苷酸多态性(snp)的复杂相互作用被认为是全基因组关联分析(GWAS)中病因分析的必要条件。进化算法(EAs)被广泛应用于SNP-SNP相互作用检测。现有的基于EA的方法大多侧重于增强EA本身的搜索能力。然而,随着SNP数据规模的进一步增加,指数增长的搜索空间逐渐成为导致基于ea的方法性能下降的主要因素。为此,提出了一种基于空间划分的分布式进化框架(SP-EF)来检测大规模数据集上的SNP-SNP相互作用。与传统的人口分布方法不同,SP-EF首先从数据的角度将整个搜索空间划分为几个子空间。空间划分策略是非破坏性的,因为它保证了每个可行解被分配到一个特定的子空间。然后,每个子空间由EA优化器独立探索,所有子空间并行优化。最后,从每个子空间的历史搜索的局部最优中选择最终输出。SP-EF算法不仅可以应对SNP组合评估的繁重计算负担,还可以增强种群的多样性,避免局部最优。值得注意的是,SP-EF具有负载均衡和可扩展性,因为它可以根据可用计算节点的数量和问题大小灵活地划分空间。为了证明SP-EF的实用性,我们进一步设计了一个带有三个问题导向算子的离散烟花算法(DFWA)作为EA优化器。在人工数据集和真实数据集上的实验表明,我们的方法显著提高了搜索速度和搜索精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Distributed Evolutionary Framework for Large-scale SNP-SNP Interaction Detection
Capturing the complex interactions of single nucleotide polymorphisms (SNPs) is considered essential for etiological analysis in genome-wide association analysis (GWAS). Evolutionary algorithms (EAs) have been extensively adopted for SNP-SNP interaction detection. Most existing EA-based methods focus on enhancing the search ability of EA itself. However, as the scale of SNP data further increases, the exponentially growing search space gradually becomes the dominant factor leading to the performance degradation of EA-based methods. To this end, a distributed evolutionary framework based on space partitioning (SP-EF) is proposed to detect SNP-SNP interactions on large-scale datasets. Distinct from the traditional population-distributed approaches, SP-EF first partitions the entire search space into several subspaces from the perspective of data. The space partitioning strategy is non-destructive since it guarantees that each feasible solution is assigned to a specific subspace. Thereafter, each subspace is explored by an EA optimizer independently and all the subspaces are optimized in parallel. Lastly, the final output is selected from the local optima in the historical search of each subspace. SP-EF can not only cope with the heavy computational burden of SNP combination evaluation but also enhance the diversity of the population to avoid local optima. Notably, SP-EF is load-balanced and scalable since it can flexibly partition the space according to the number of available computational nodes and the problem size. To show the practicability of SP-EF, we further design a discrete fireworks algorithm (DFWA) with three problem-guided operators as an EA optimizer. Experiments on artificial and real-world datasets demonstrate that our method significantly improves search speed and search accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信