{"title":"On the visualization of the DNA sequence and its nucleotide content","authors":"A. Mylläri, T. Salakoski, A. Pasechnik","doi":"10.1145/1140378.1140385","DOIUrl":null,"url":null,"abstract":"Visual inspection can help reveal patterns that would be computationally rather difficult to reveal. We consider three different algorithms for visualizations of a DNA sequence and its nucleotide content: random walk, fractal and visualization based on the entropy-like parameters calculated using a sliding window. We present a program that uses these three methods and visualizes either the whole of a given sequence, or specified fragments. It also provides facilities to compare visualizations obtained for different sequences/fragments. Random walk visualization considers the sequence symbol-by-symbol; the other two methods also take into account how well nucleotides are \"mixed\" in the sequence. It allows an easy visualization of repeated patterns, segments with a high/low content of some nucleotides, such as CG-islands, etc. The program also helps to identify regions of interest for further study.","PeriodicalId":314801,"journal":{"name":"SIGSAM Bull.","volume":"292 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIGSAM Bull.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1140378.1140385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Visual inspection can help reveal patterns that would be computationally rather difficult to reveal. We consider three different algorithms for visualizations of a DNA sequence and its nucleotide content: random walk, fractal and visualization based on the entropy-like parameters calculated using a sliding window. We present a program that uses these three methods and visualizes either the whole of a given sequence, or specified fragments. It also provides facilities to compare visualizations obtained for different sequences/fragments. Random walk visualization considers the sequence symbol-by-symbol; the other two methods also take into account how well nucleotides are "mixed" in the sequence. It allows an easy visualization of repeated patterns, segments with a high/low content of some nucleotides, such as CG-islands, etc. The program also helps to identify regions of interest for further study.