Xiaolei Wang, Jiangyu Li, Yang Liu, Yu-feng Wang, Dongsheng Zhao
{"title":"构建基于Galaxy和高性能计算集群的本地化生物信息学平台","authors":"Xiaolei Wang, Jiangyu Li, Yang Liu, Yu-feng Wang, Dongsheng Zhao","doi":"10.1109/BMEI.2013.6747031","DOIUrl":null,"url":null,"abstract":"With the rapid development of high-throughput sequencing technology, biomedical research has entered into the era of big data. It causes problems about storage and analysis of massive biological data which need to be solved by high-performance computing. Therefore, we build the localized high-performance one-stop data analysis platform to provide convenient and efficient computational analysis services for biomedical researchers. We deploy Galaxy and integrate software tools and datasets into Galaxy in computing cluster, build stable web service, FTP service and management database in order to optimize and improve the performance of Galaxy, and use distributed resource management application interface to collaborate Galaxy with Sun Grid Engine for automatically scheduling and assigning computing resources. Currently the platform has been put into trial operation. The peak performance is 10 Teraflops and the capacity of storage is 40TB. The platform provides many functions such as sequence alignment, short sequence mapping, gene annotation, transcriptome analysis, metagenomic analysis and phylogenetic analysis, and approximately 700GB reference databases including human genome, viruses, bacteria, fungi, etc.","PeriodicalId":163211,"journal":{"name":"2013 6th International Conference on Biomedical Engineering and Informatics","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Building localized bioinformatics platform based on Galaxy and high performance computing cluster\",\"authors\":\"Xiaolei Wang, Jiangyu Li, Yang Liu, Yu-feng Wang, Dongsheng Zhao\",\"doi\":\"10.1109/BMEI.2013.6747031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of high-throughput sequencing technology, biomedical research has entered into the era of big data. It causes problems about storage and analysis of massive biological data which need to be solved by high-performance computing. Therefore, we build the localized high-performance one-stop data analysis platform to provide convenient and efficient computational analysis services for biomedical researchers. We deploy Galaxy and integrate software tools and datasets into Galaxy in computing cluster, build stable web service, FTP service and management database in order to optimize and improve the performance of Galaxy, and use distributed resource management application interface to collaborate Galaxy with Sun Grid Engine for automatically scheduling and assigning computing resources. Currently the platform has been put into trial operation. The peak performance is 10 Teraflops and the capacity of storage is 40TB. The platform provides many functions such as sequence alignment, short sequence mapping, gene annotation, transcriptome analysis, metagenomic analysis and phylogenetic analysis, and approximately 700GB reference databases including human genome, viruses, bacteria, fungi, etc.\",\"PeriodicalId\":163211,\"journal\":{\"name\":\"2013 6th International Conference on Biomedical Engineering and Informatics\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 6th International Conference on Biomedical Engineering and Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BMEI.2013.6747031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th International Conference on Biomedical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEI.2013.6747031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Building localized bioinformatics platform based on Galaxy and high performance computing cluster
With the rapid development of high-throughput sequencing technology, biomedical research has entered into the era of big data. It causes problems about storage and analysis of massive biological data which need to be solved by high-performance computing. Therefore, we build the localized high-performance one-stop data analysis platform to provide convenient and efficient computational analysis services for biomedical researchers. We deploy Galaxy and integrate software tools and datasets into Galaxy in computing cluster, build stable web service, FTP service and management database in order to optimize and improve the performance of Galaxy, and use distributed resource management application interface to collaborate Galaxy with Sun Grid Engine for automatically scheduling and assigning computing resources. Currently the platform has been put into trial operation. The peak performance is 10 Teraflops and the capacity of storage is 40TB. The platform provides many functions such as sequence alignment, short sequence mapping, gene annotation, transcriptome analysis, metagenomic analysis and phylogenetic analysis, and approximately 700GB reference databases including human genome, viruses, bacteria, fungi, etc.