A novel SFLA based method for gene expression biclustering

Priyojit Das, Sujay Saha
{"title":"A novel SFLA based method for gene expression biclustering","authors":"Priyojit Das, Sujay Saha","doi":"10.1109/ICRCICN.2017.8234506","DOIUrl":null,"url":null,"abstract":"Form the time of its invention, microarray technology is continuously growing and has been taking major role in biological research. This technology generates huge amount of gene expression data for biological analysis. Parallel computation methods are required to find functional associations from this large amount of biological data. An unsupervised machine learning technique, clustering algorithm groups similar genes based on entire conditions. But normal clustering methods cannot find different cellular processes from gene expression data because a biological activity can start functioning in the presence of some specific conditions. So, biclustering techniques are used instead of normal clustering. Biclustering basically identifies a set of genes that are co-expressed for some specific experimental conditions. Here we introduce an improved shuffled frog leaping algorithm(SFLA) based approach to find biclusters. SFLA is a hybrid of evolutionary memetic algorithm and collective intelligence based particle swarm optimization algorithm. Also It has faster convergence speed. By applying the proposed algorithm on yeast (Saccharomyces cerevisiae) cell cycle dataset, large number of biologically significant biclusters are obtained, which are verified by gene ontology database, compared to other existing algorithms. Also the biclusters have small MSR value and large size.","PeriodicalId":166298,"journal":{"name":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"118 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2017.8234506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Form the time of its invention, microarray technology is continuously growing and has been taking major role in biological research. This technology generates huge amount of gene expression data for biological analysis. Parallel computation methods are required to find functional associations from this large amount of biological data. An unsupervised machine learning technique, clustering algorithm groups similar genes based on entire conditions. But normal clustering methods cannot find different cellular processes from gene expression data because a biological activity can start functioning in the presence of some specific conditions. So, biclustering techniques are used instead of normal clustering. Biclustering basically identifies a set of genes that are co-expressed for some specific experimental conditions. Here we introduce an improved shuffled frog leaping algorithm(SFLA) based approach to find biclusters. SFLA is a hybrid of evolutionary memetic algorithm and collective intelligence based particle swarm optimization algorithm. Also It has faster convergence speed. By applying the proposed algorithm on yeast (Saccharomyces cerevisiae) cell cycle dataset, large number of biologically significant biclusters are obtained, which are verified by gene ontology database, compared to other existing algorithms. Also the biclusters have small MSR value and large size.
一种新的基于SFLA的基因表达聚类方法
自发明以来,微阵列技术不断发展,在生物学研究中发挥着重要作用。这项技术为生物分析提供了大量的基因表达数据。从大量的生物数据中寻找功能关联需要并行计算方法。聚类算法是一种无监督机器学习技术,基于整个条件对相似基因进行分组。但是正常的聚类方法不能从基因表达数据中找到不同的细胞过程,因为生物活性可以在某些特定条件下开始发挥作用。因此,使用双聚类技术代替普通聚类。双聚类基本上确定了一组基因,这些基因在某些特定的实验条件下共同表达。本文介绍了一种基于改进的洗阵青蛙跳跃算法(SFLA)的双聚类查找方法。粒子群优化算法是进化模因算法和基于集体智能的粒子群优化算法的混合。而且收敛速度更快。将该算法应用于酵母(Saccharomyces cerevisiae)细胞周期数据,获得了大量具有生物学意义的双聚类,并通过基因本体数据库对其进行了验证。双聚类的MSR值小,规模大。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信