Hadoop based clustering system for genome sequencing

Anju Ramesh Ekre, R. Mante
{"title":"Hadoop based clustering system for genome sequencing","authors":"Anju Ramesh Ekre, R. Mante","doi":"10.1109/ICONSTEM.2016.7560916","DOIUrl":null,"url":null,"abstract":"Genomics is an interdisciplinary branch of science that is bringing vital changes in the field of medicine and agriculture. It is believed that the scientific and technological advancements in 21st century will be related to the processing, manipulation and analysis of the vast information that is generated from genome sequencing of living organisms. A scientific and big data research domain includes the problem of genome sequencing. Genome sequence is also called as read sequence. Next-Generation sequencing is playing a crucial role in the development and advancements of read alignment algorithms. Computer scientists, mathematician and physicists are together helping for this research of alignment. However, increase in the data size and faster data access requirement for the scientists and researchers are increasing which is leading advancements in genome alignment towards acceleration approach. This paper includes a MapReduce acceleration scheme for faster sequence alignment. It works on multiple commodity hardware. With the use of MapReduce programming along with the clustering algorithm for distribution of genome data on multiple nodes may reduce the time, also it can lead towards accuracy in genome sequencing.","PeriodicalId":256750,"journal":{"name":"2016 Second International Conference on Science Technology Engineering and Management (ICONSTEM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Science Technology Engineering and Management (ICONSTEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONSTEM.2016.7560916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Genomics is an interdisciplinary branch of science that is bringing vital changes in the field of medicine and agriculture. It is believed that the scientific and technological advancements in 21st century will be related to the processing, manipulation and analysis of the vast information that is generated from genome sequencing of living organisms. A scientific and big data research domain includes the problem of genome sequencing. Genome sequence is also called as read sequence. Next-Generation sequencing is playing a crucial role in the development and advancements of read alignment algorithms. Computer scientists, mathematician and physicists are together helping for this research of alignment. However, increase in the data size and faster data access requirement for the scientists and researchers are increasing which is leading advancements in genome alignment towards acceleration approach. This paper includes a MapReduce acceleration scheme for faster sequence alignment. It works on multiple commodity hardware. With the use of MapReduce programming along with the clustering algorithm for distribution of genome data on multiple nodes may reduce the time, also it can lead towards accuracy in genome sequencing.
基于Hadoop的基因组测序集群系统
基因组学是一门跨学科的科学分支,正在给医学和农业领域带来重大变化。人们认为,21世纪的科技进步将与生物基因组测序产生的大量信息的处理、操纵和分析有关。科学和大数据研究领域包括基因组测序问题。基因组序列又称读序列。下一代测序在读取比对算法的发展和进步中起着至关重要的作用。计算机科学家、数学家和物理学家正在共同帮助这项对准研究。然而,数据量的增加和对科学家和研究人员更快的数据访问需求正在增加,这导致了基因组比对朝着加速方法的发展。本文包括一个MapReduce加速方案,用于更快的序列对齐。它适用于多种商用硬件。利用MapReduce编程和聚类算法将基因组数据分布在多个节点上,可以减少时间,也可以提高基因组测序的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信