T. Godhandaraman, N. Pruthviraj, V. Praveenkumar, A. Banuprasad, K. Karthick
{"title":"Big data in genomics","authors":"T. Godhandaraman, N. Pruthviraj, V. Praveenkumar, A. Banuprasad, K. Karthick","doi":"10.1109/ICAMMAET.2017.8186739","DOIUrl":null,"url":null,"abstract":"A big data on Healthcare applications which is require big data management as well as intensive computation. In this paper, focus on Genomics in cancer testing whether the healthcare applications can scale well on commercial big data platforms that implement Map Reduce framework. We selected short read gene data sequence alignment and assembly workloads in genome analysis and Apache Hadoop distributed parallelized data processing, analysis petabyte (PB) or Exabyte(EB). Currently usage of bioinformatics community by Hadoop.","PeriodicalId":425974,"journal":{"name":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAMMAET.2017.8186739","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
A big data on Healthcare applications which is require big data management as well as intensive computation. In this paper, focus on Genomics in cancer testing whether the healthcare applications can scale well on commercial big data platforms that implement Map Reduce framework. We selected short read gene data sequence alignment and assembly workloads in genome analysis and Apache Hadoop distributed parallelized data processing, analysis petabyte (PB) or Exabyte(EB). Currently usage of bioinformatics community by Hadoop.