A System for Phylogenetic Analyses over Alignments of Next Generation Sequence Data

Yuki Nishimura, T. Amagasa, Y. Inagaki, T. Hashimoto, H. Kitagawa
{"title":"A System for Phylogenetic Analyses over Alignments of Next Generation Sequence Data","authors":"Yuki Nishimura, T. Amagasa, Y. Inagaki, T. Hashimoto, H. Kitagawa","doi":"10.1109/CISIS.2016.51","DOIUrl":null,"url":null,"abstract":"A large quantity of DNA sequence data is being generated at high speed and in low cost by next generation sequencing (NGS) technology in recent years. NGS influences wide range of biology, including evolutionary biology, which is the field that infers the evolutionary relationships of genes or organisms from sequence data. In particular, the phylogenetic analyses using massive amount of data generated by NGS have been actively conducted. To infer the phylogenetic relationship, a number of alignments that comprise sets of sequences need to be maintained, i.e., they need to be updated whenever new sequence data become available. However, there have been no database that support updates of alignments, i.e., addition and/or removal of sequences from existing alignments. Instead, individual researchers independently update their alignments manually. To cope with this problem, we propose a system for phylogenetic analyses over alignments of NGS data. It takes as input NGS data, predicts the orthologue that each sequence belongs to, and updates the alignments. Moreover, by describing the related alignments in tree structure, it can maintain stored alignments in a systematic way. To prove the concept, we implement a prototype web application. We expect that our system help biological researchers carry out phylogenetic analysis on large scale data including those from NGS).","PeriodicalId":249236,"journal":{"name":"2016 10th International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS)","volume":"180 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 10th International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISIS.2016.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A large quantity of DNA sequence data is being generated at high speed and in low cost by next generation sequencing (NGS) technology in recent years. NGS influences wide range of biology, including evolutionary biology, which is the field that infers the evolutionary relationships of genes or organisms from sequence data. In particular, the phylogenetic analyses using massive amount of data generated by NGS have been actively conducted. To infer the phylogenetic relationship, a number of alignments that comprise sets of sequences need to be maintained, i.e., they need to be updated whenever new sequence data become available. However, there have been no database that support updates of alignments, i.e., addition and/or removal of sequences from existing alignments. Instead, individual researchers independently update their alignments manually. To cope with this problem, we propose a system for phylogenetic analyses over alignments of NGS data. It takes as input NGS data, predicts the orthologue that each sequence belongs to, and updates the alignments. Moreover, by describing the related alignments in tree structure, it can maintain stored alignments in a systematic way. To prove the concept, we implement a prototype web application. We expect that our system help biological researchers carry out phylogenetic analysis on large scale data including those from NGS).
下一代序列数据比对的系统发育分析系统
近年来,新一代测序技术(NGS)正在高速、低成本地生成大量的DNA序列数据。NGS影响了广泛的生物学,包括进化生物学,这是一个从序列数据推断基因或生物体进化关系的领域。特别是利用NGS产生的大量数据进行的系统发育分析正在积极进行。为了推断系统发育关系,需要维护包含序列集的许多比对,也就是说,只要有新的序列数据可用,就需要更新它们。然而,目前还没有数据库支持更新比对,即从现有比对中添加和/或删除序列。相反,个别研究人员独立地手动更新他们的排列。为了解决这个问题,我们提出了一个NGS数据比对的系统发育分析系统。它将NGS数据作为输入,预测每个序列所属的正交序列,并更新排列。此外,通过树形结构描述相关对齐,可以系统地维护存储的对齐。为了证明这个概念,我们实现了一个原型web应用程序。我们希望我们的系统能够帮助生物学研究人员对包括NGS数据在内的大规模数据进行系统发育分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信