K. Sedlář, Helena Skutková, P. Videnska, I. Rychlík, I. Provazník
{"title":"Bipartite graphs for metagenomic data analysis and visualization","authors":"K. Sedlář, Helena Skutková, P. Videnska, I. Rychlík, I. Provazník","doi":"10.1109/BIBM.2015.7359839","DOIUrl":null,"url":null,"abstract":"Metagenomics became very popular after expansion of next-generation sequencing techniques that allowed simple implementation of extensive studies. With a target gene sequencing approach, an identification of organisms in a metagenome is quite effortless since only a small reference database of the particular gene is needed. Moreover, by counting the copies of individual genes, also quantitative analysis can be applied. Unfortunately, current bioinformatics tools aim mainly on the analysis of a single metagenome. A cluster analysis, a heatmap of correlation coefficients, biclustering or other statistics techniques can only show relations inside the metagenome or the relation between the metagenome composition and other parameters. On the other hand, there is a lack of tools to provide a comparative analysis of two or more metagenomes. Suitable properties for this kind of analysis can be found in a bipartite graph. Here, we present a novel workflow for finding the suitable representation of metagenomic data to provide a comparative analysis of metagenomes. The resulting graph can take into account information about the actual composition of the metagenome as well as the environment it relates to. Thus, it can provide different view of the data to the naked eye that can complement other techniques such as principal coordinate analysis.","PeriodicalId":186217,"journal":{"name":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2015.7359839","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Metagenomics became very popular after expansion of next-generation sequencing techniques that allowed simple implementation of extensive studies. With a target gene sequencing approach, an identification of organisms in a metagenome is quite effortless since only a small reference database of the particular gene is needed. Moreover, by counting the copies of individual genes, also quantitative analysis can be applied. Unfortunately, current bioinformatics tools aim mainly on the analysis of a single metagenome. A cluster analysis, a heatmap of correlation coefficients, biclustering or other statistics techniques can only show relations inside the metagenome or the relation between the metagenome composition and other parameters. On the other hand, there is a lack of tools to provide a comparative analysis of two or more metagenomes. Suitable properties for this kind of analysis can be found in a bipartite graph. Here, we present a novel workflow for finding the suitable representation of metagenomic data to provide a comparative analysis of metagenomes. The resulting graph can take into account information about the actual composition of the metagenome as well as the environment it relates to. Thus, it can provide different view of the data to the naked eye that can complement other techniques such as principal coordinate analysis.