M. Pierzchała, Magdalena Ogłuszka, D. Gołuch, E. Poławska, T. Blicharski, A. Roszczyk, A. Nawrocka, P. Urbański, Kamila Stepanow, Aleksandra Ciepłoch, Mateusz Sachajko, M. Skowroński, Joanna Szczepanek, E. Juszczuk-Kubiak, Adrian Szczepański, M. Buszewska-Forajta, Xiangyang Miao, Vishnu Sharma, Sanjita Sharma, C. Pareek
{"title":"RNA-seq based SNP discovery in liver transcriptome of Polish Landrace pigs","authors":"M. Pierzchała, Magdalena Ogłuszka, D. Gołuch, E. Poławska, T. Blicharski, A. Roszczyk, A. Nawrocka, P. Urbański, Kamila Stepanow, Aleksandra Ciepłoch, Mateusz Sachajko, M. Skowroński, Joanna Szczepanek, E. Juszczuk-Kubiak, Adrian Szczepański, M. Buszewska-Forajta, Xiangyang Miao, Vishnu Sharma, Sanjita Sharma, C. Pareek","doi":"10.12775/trvs.2019.005","DOIUrl":null,"url":null,"abstract":"Background: RNA-seq technology is most commonly used in quantitative measurement of gene expression levels and identification of non-annotated transcripts. It is also used for the coding SNPs (cSNPs) discoveries in an efficient and cost-effective way. The aim of this study was to identify the putative genetic cSNPs variants in liver transcriptome of Polish Landrace pigs fed with high and low (normal) omega-6 and omega-3 polyunsaturated fatty acids (PUFAs) diets. Methods. RNA-seq based NGS experiment was performed on Polish Landrace pigs fed with high and low diets. Total RNA were isolated from liver tissues ofPUFAs dietary of Polish Landrace pigs. The RNA-seq libraries preparations were performed by mRNA enrichment, mRNA fragmentation, second strand cDNA synthesis, adaptor ligation, size selection and PCR amplification using the illumina TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego CA, USA), followed by NGS sequencing on MiSeq illumina platform. The quality control (QC) of raw RNA-seq data of liver transcriptome was performed using the Trimmomatic and FastQC tools. The paired-end mapping of the liver transcriptome RNA-seq data (n=12) was performed on the reference genome Sus scrofa v.10.2, followed by cSNPs discovery using GATK and SAMtools bioinformatics SNPs caller tools. Results: Two pooled paired-end libraries of 151bp liver transcriptome of Polish Landrace pigs were generated from MiSeq instrument and subsequent Fastq RNA-seq data were submitted to NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra). Our study identified 25.3 million paired-end reads: representing 13,509,248 paired-end reads of high PUFAs dietary group and 11,815,696 paired-end reads of low PUFAs dietary group liver transcriptome. The SNP discovery results revealed identification of 25909 homozygous and 23290 heterozygous cSNPs in the liver transcriptome of both dietary groups of Polish Landrace pigs. With regards to same or alternative SNPs alleles encoding amino acids regions, a total of 27141 synonymous cSNP and 5989 non-synonymous cSNPs were identified in liver transcriptome representing high PUFAs dietary group. However, a total of 15128 synonymous cSNPs and 3900 non-synonymous cSNPs were identified in liver transcriptome representing low PUFAs dietary groups of Polish Landrace pigs. The identification of single nucleotide variations (SNVs) representing substitutions of all four possibilities (A,T,G,C) were ranged 2872 to 6868 SNVs (high PUFAs) and 2574 to 3654 SNVs (low PUFAs) in the homozygous cSNPs and 2452 to 2678 SNVs (high PUFAs) and 2094 to 2230 SNVs (low PUFAs) in the heterozygous cSNPs of liver transcriptomes of Polish Landrace pigs, respectively.","PeriodicalId":402923,"journal":{"name":"Translational Research in Veterinary Science","volume":"122 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Research in Veterinary Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12775/trvs.2019.005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: RNA-seq technology is most commonly used in quantitative measurement of gene expression levels and identification of non-annotated transcripts. It is also used for the coding SNPs (cSNPs) discoveries in an efficient and cost-effective way. The aim of this study was to identify the putative genetic cSNPs variants in liver transcriptome of Polish Landrace pigs fed with high and low (normal) omega-6 and omega-3 polyunsaturated fatty acids (PUFAs) diets. Methods. RNA-seq based NGS experiment was performed on Polish Landrace pigs fed with high and low diets. Total RNA were isolated from liver tissues ofPUFAs dietary of Polish Landrace pigs. The RNA-seq libraries preparations were performed by mRNA enrichment, mRNA fragmentation, second strand cDNA synthesis, adaptor ligation, size selection and PCR amplification using the illumina TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego CA, USA), followed by NGS sequencing on MiSeq illumina platform. The quality control (QC) of raw RNA-seq data of liver transcriptome was performed using the Trimmomatic and FastQC tools. The paired-end mapping of the liver transcriptome RNA-seq data (n=12) was performed on the reference genome Sus scrofa v.10.2, followed by cSNPs discovery using GATK and SAMtools bioinformatics SNPs caller tools. Results: Two pooled paired-end libraries of 151bp liver transcriptome of Polish Landrace pigs were generated from MiSeq instrument and subsequent Fastq RNA-seq data were submitted to NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra). Our study identified 25.3 million paired-end reads: representing 13,509,248 paired-end reads of high PUFAs dietary group and 11,815,696 paired-end reads of low PUFAs dietary group liver transcriptome. The SNP discovery results revealed identification of 25909 homozygous and 23290 heterozygous cSNPs in the liver transcriptome of both dietary groups of Polish Landrace pigs. With regards to same or alternative SNPs alleles encoding amino acids regions, a total of 27141 synonymous cSNP and 5989 non-synonymous cSNPs were identified in liver transcriptome representing high PUFAs dietary group. However, a total of 15128 synonymous cSNPs and 3900 non-synonymous cSNPs were identified in liver transcriptome representing low PUFAs dietary groups of Polish Landrace pigs. The identification of single nucleotide variations (SNVs) representing substitutions of all four possibilities (A,T,G,C) were ranged 2872 to 6868 SNVs (high PUFAs) and 2574 to 3654 SNVs (low PUFAs) in the homozygous cSNPs and 2452 to 2678 SNVs (high PUFAs) and 2094 to 2230 SNVs (low PUFAs) in the heterozygous cSNPs of liver transcriptomes of Polish Landrace pigs, respectively.