R. Tripathi, Pawan Sharma, P. Chakraborty, P. Varadwaj
{"title":"Count-based transcriptome analysis to identify differentially expressed genes for breast cancer","authors":"R. Tripathi, Pawan Sharma, P. Chakraborty, P. Varadwaj","doi":"10.1109/BSB.2016.7552147","DOIUrl":null,"url":null,"abstract":"Sequencing the coding regions or the whole cancer transcriptome can provide valuable information about the differential expression patterns of the genes. Previous researches centered on ~2% of coding human genome, assuming that the non-coding sequences were “junk” lacking significant functional information. Recent medical research show that a major percentage of the human genome (~70-90%) are non-coding, stored in the cell in the form of non-coding RNA (ncRNA) which overshadows the coding information limited only to a small percentage. These ncRNAs are composed of mostly ultraconserved elements, lacking protein-coding potential and regulating gene expression acting as enhancers whose aberrant expression may be involved in pathological process such as cancer. Here, we have described RNA-seq data analysis for the profiling of transcriptome of Breast cells and provided a generic outline of the whole pipeline from next-generation sequencing (NGS) output for quantification of differential gene expression across different conditions (e.g., control vs test). We have used tool Cufflinks-Cuffdiff to estimate transcript-level expression for gene discovery extracted from high-throughput RNA-seq data across distinct conditions that represent candidate biomarkers for future research. This study provides the survey of coding transcripts associated genes expression within a cancer system.","PeriodicalId":363820,"journal":{"name":"2016 International Conference on Bioinformatics and Systems Biology (BSB)","volume":"454 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Bioinformatics and Systems Biology (BSB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BSB.2016.7552147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Sequencing the coding regions or the whole cancer transcriptome can provide valuable information about the differential expression patterns of the genes. Previous researches centered on ~2% of coding human genome, assuming that the non-coding sequences were “junk” lacking significant functional information. Recent medical research show that a major percentage of the human genome (~70-90%) are non-coding, stored in the cell in the form of non-coding RNA (ncRNA) which overshadows the coding information limited only to a small percentage. These ncRNAs are composed of mostly ultraconserved elements, lacking protein-coding potential and regulating gene expression acting as enhancers whose aberrant expression may be involved in pathological process such as cancer. Here, we have described RNA-seq data analysis for the profiling of transcriptome of Breast cells and provided a generic outline of the whole pipeline from next-generation sequencing (NGS) output for quantification of differential gene expression across different conditions (e.g., control vs test). We have used tool Cufflinks-Cuffdiff to estimate transcript-level expression for gene discovery extracted from high-throughput RNA-seq data across distinct conditions that represent candidate biomarkers for future research. This study provides the survey of coding transcripts associated genes expression within a cancer system.