Chien-Hung Huang, Sheng-Jia Weng, P. Korla, Jywe-Fei Fang, J. Tsai, Nilubon Kurubanjerdjit, K. Ng
{"title":"Identification and Analysis of Palindromes for RNA Sequences","authors":"Chien-Hung Huang, Sheng-Jia Weng, P. Korla, Jywe-Fei Fang, J. Tsai, Nilubon Kurubanjerdjit, K. Ng","doi":"10.1109/BIBE.2016.35","DOIUrl":null,"url":null,"abstract":"A palindrome is a string of the form S = A1 A2 or S = A1 aA2, where A1 and A2 are substrings of S and the reverse of A2 exactly matches with A1. DNA palindromes show direct influence on tumerogenesis due to the formation of DNA palindromes at microRNA genes, which involve in tumor development. Furthermore, RNA palindromes play a crucial role in genomic instability and gene amplification in some human cancers. Therefore, it is important to develop effective approaches to identify and characterize biological approximate palindromes. In this paper, we developed a simple algorithm to find all the exact and approximate palindromes up to k errors (k is specified by the user) for the two different types of RNA sequence data, i.e. mRNA sequences of fusion genes and human microRNAs (miRNAs). We confirmed that the palindromes in RNA sequences are A-U rich. According to the Kolmogorov-Smirnov test, it was found that miRNA has a frequency distribution of palindromes different from the fusion genes' mRNAs. The proposed algorithm is easy to implement, which provides an effective tool for investigating the relationship between palindromes and cancer biology.","PeriodicalId":377504,"journal":{"name":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2016.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A palindrome is a string of the form S = A1 A2 or S = A1 aA2, where A1 and A2 are substrings of S and the reverse of A2 exactly matches with A1. DNA palindromes show direct influence on tumerogenesis due to the formation of DNA palindromes at microRNA genes, which involve in tumor development. Furthermore, RNA palindromes play a crucial role in genomic instability and gene amplification in some human cancers. Therefore, it is important to develop effective approaches to identify and characterize biological approximate palindromes. In this paper, we developed a simple algorithm to find all the exact and approximate palindromes up to k errors (k is specified by the user) for the two different types of RNA sequence data, i.e. mRNA sequences of fusion genes and human microRNAs (miRNAs). We confirmed that the palindromes in RNA sequences are A-U rich. According to the Kolmogorov-Smirnov test, it was found that miRNA has a frequency distribution of palindromes different from the fusion genes' mRNAs. The proposed algorithm is easy to implement, which provides an effective tool for investigating the relationship between palindromes and cancer biology.