M. Pierzchała, Magdalena Ogłuszka, D. Gołuch, E. Poławska, T. Blicharski, A. Roszczyk, A. Nawrocka, P. Urbański, Kamila Stepanow, Aleksandra Ciepłoch, Mateusz Sachajko, M. Skowroński, Joanna Szczepanek, E. Juszczuk-Kubiak, Adrian Szczepański, M. Buszewska-Forajta, Xiangyang Miao, Vishnu Sharma, Sanjita Sharma, C. Pareek
{"title":"基于RNA-seq的波兰长白猪肝脏转录组SNP发现","authors":"M. Pierzchała, Magdalena Ogłuszka, D. Gołuch, E. Poławska, T. Blicharski, A. Roszczyk, A. Nawrocka, P. Urbański, Kamila Stepanow, Aleksandra Ciepłoch, Mateusz Sachajko, M. Skowroński, Joanna Szczepanek, E. Juszczuk-Kubiak, Adrian Szczepański, M. Buszewska-Forajta, Xiangyang Miao, Vishnu Sharma, Sanjita Sharma, C. Pareek","doi":"10.12775/trvs.2019.005","DOIUrl":null,"url":null,"abstract":"Background: RNA-seq technology is most commonly used in quantitative measurement of gene expression levels and identification of non-annotated transcripts. It is also used for the coding SNPs (cSNPs) discoveries in an efficient and cost-effective way. The aim of this study was to identify the putative genetic cSNPs variants in liver transcriptome of Polish Landrace pigs fed with high and low (normal) omega-6 and omega-3 polyunsaturated fatty acids (PUFAs) diets. Methods. RNA-seq based NGS experiment was performed on Polish Landrace pigs fed with high and low diets. Total RNA were isolated from liver tissues ofPUFAs dietary of Polish Landrace pigs. The RNA-seq libraries preparations were performed by mRNA enrichment, mRNA fragmentation, second strand cDNA synthesis, adaptor ligation, size selection and PCR amplification using the illumina TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego CA, USA), followed by NGS sequencing on MiSeq illumina platform. The quality control (QC) of raw RNA-seq data of liver transcriptome was performed using the Trimmomatic and FastQC tools. The paired-end mapping of the liver transcriptome RNA-seq data (n=12) was performed on the reference genome Sus scrofa v.10.2, followed by cSNPs discovery using GATK and SAMtools bioinformatics SNPs caller tools. Results: Two pooled paired-end libraries of 151bp liver transcriptome of Polish Landrace pigs were generated from MiSeq instrument and subsequent Fastq RNA-seq data were submitted to NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra). Our study identified 25.3 million paired-end reads: representing 13,509,248 paired-end reads of high PUFAs dietary group and 11,815,696 paired-end reads of low PUFAs dietary group liver transcriptome. The SNP discovery results revealed identification of 25909 homozygous and 23290 heterozygous cSNPs in the liver transcriptome of both dietary groups of Polish Landrace pigs. With regards to same or alternative SNPs alleles encoding amino acids regions, a total of 27141 synonymous cSNP and 5989 non-synonymous cSNPs were identified in liver transcriptome representing high PUFAs dietary group. However, a total of 15128 synonymous cSNPs and 3900 non-synonymous cSNPs were identified in liver transcriptome representing low PUFAs dietary groups of Polish Landrace pigs. The identification of single nucleotide variations (SNVs) representing substitutions of all four possibilities (A,T,G,C) were ranged 2872 to 6868 SNVs (high PUFAs) and 2574 to 3654 SNVs (low PUFAs) in the homozygous cSNPs and 2452 to 2678 SNVs (high PUFAs) and 2094 to 2230 SNVs (low PUFAs) in the heterozygous cSNPs of liver transcriptomes of Polish Landrace pigs, respectively.","PeriodicalId":402923,"journal":{"name":"Translational Research in Veterinary Science","volume":"122 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RNA-seq based SNP discovery in liver transcriptome of Polish Landrace pigs\",\"authors\":\"M. Pierzchała, Magdalena Ogłuszka, D. Gołuch, E. Poławska, T. Blicharski, A. Roszczyk, A. Nawrocka, P. Urbański, Kamila Stepanow, Aleksandra Ciepłoch, Mateusz Sachajko, M. Skowroński, Joanna Szczepanek, E. Juszczuk-Kubiak, Adrian Szczepański, M. Buszewska-Forajta, Xiangyang Miao, Vishnu Sharma, Sanjita Sharma, C. Pareek\",\"doi\":\"10.12775/trvs.2019.005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: RNA-seq technology is most commonly used in quantitative measurement of gene expression levels and identification of non-annotated transcripts. It is also used for the coding SNPs (cSNPs) discoveries in an efficient and cost-effective way. The aim of this study was to identify the putative genetic cSNPs variants in liver transcriptome of Polish Landrace pigs fed with high and low (normal) omega-6 and omega-3 polyunsaturated fatty acids (PUFAs) diets. Methods. RNA-seq based NGS experiment was performed on Polish Landrace pigs fed with high and low diets. Total RNA were isolated from liver tissues ofPUFAs dietary of Polish Landrace pigs. The RNA-seq libraries preparations were performed by mRNA enrichment, mRNA fragmentation, second strand cDNA synthesis, adaptor ligation, size selection and PCR amplification using the illumina TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego CA, USA), followed by NGS sequencing on MiSeq illumina platform. The quality control (QC) of raw RNA-seq data of liver transcriptome was performed using the Trimmomatic and FastQC tools. The paired-end mapping of the liver transcriptome RNA-seq data (n=12) was performed on the reference genome Sus scrofa v.10.2, followed by cSNPs discovery using GATK and SAMtools bioinformatics SNPs caller tools. Results: Two pooled paired-end libraries of 151bp liver transcriptome of Polish Landrace pigs were generated from MiSeq instrument and subsequent Fastq RNA-seq data were submitted to NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra). Our study identified 25.3 million paired-end reads: representing 13,509,248 paired-end reads of high PUFAs dietary group and 11,815,696 paired-end reads of low PUFAs dietary group liver transcriptome. The SNP discovery results revealed identification of 25909 homozygous and 23290 heterozygous cSNPs in the liver transcriptome of both dietary groups of Polish Landrace pigs. With regards to same or alternative SNPs alleles encoding amino acids regions, a total of 27141 synonymous cSNP and 5989 non-synonymous cSNPs were identified in liver transcriptome representing high PUFAs dietary group. However, a total of 15128 synonymous cSNPs and 3900 non-synonymous cSNPs were identified in liver transcriptome representing low PUFAs dietary groups of Polish Landrace pigs. The identification of single nucleotide variations (SNVs) representing substitutions of all four possibilities (A,T,G,C) were ranged 2872 to 6868 SNVs (high PUFAs) and 2574 to 3654 SNVs (low PUFAs) in the homozygous cSNPs and 2452 to 2678 SNVs (high PUFAs) and 2094 to 2230 SNVs (low PUFAs) in the heterozygous cSNPs of liver transcriptomes of Polish Landrace pigs, respectively.\",\"PeriodicalId\":402923,\"journal\":{\"name\":\"Translational Research in Veterinary Science\",\"volume\":\"122 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational Research in Veterinary Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12775/trvs.2019.005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Research in Veterinary Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12775/trvs.2019.005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
背景:RNA-seq技术最常用于基因表达水平的定量测量和无注释转录物的鉴定。它也被用于编码snp (csnp)的发现,以一种高效和经济的方式。本研究的目的是鉴定饲喂高和低(正常)ω -6和ω -3多不饱和脂肪酸(PUFAs)日粮的波兰长白猪肝脏转录组中假定的遗传csnp变异。方法。采用RNA-seq法对饲喂高、低饲粮的波兰长白猪进行了NGS试验。从波兰长白猪pufas饲粮的肝脏组织中分离总RNA。RNA-seq文库制备采用illumina TruSeq RNA Sample Prep Kit v2 (illumina, San Diego CA, USA)进行mRNA富集、mRNA片段化、第二链cDNA合成、接头连接、大小选择和PCR扩增,然后在MiSeq illumina平台上进行NGS测序。使用Trimmomatic和FastQC工具对肝脏转录组RNA-seq原始数据进行质量控制(QC)。在参考基因组Sus scrofa v.10.2上对肝脏转录组RNA-seq数据(n=12)进行配对端定位,然后使用GATK和SAMtools生物信息学SNPs调用者工具发现csnp。结果:MiSeq仪器生成了波兰长白猪151bp肝脏转录组的两个配对端库,随后Fastq RNA-seq数据提交至NCBI SRA数据库(https://www.ncbi.nlm.nih.gov/sra)。我们的研究确定了2530万个对端reads:高PUFAs饮食组的13,509,248个对端reads和低PUFAs饮食组的11,815,696个对端reads。SNP发现结果显示,在波兰长白猪的肝脏转录组中分别鉴定出25909个纯合csnp和23290个杂合csnp。对于编码氨基酸区域的相同或替代snp等位基因,在高PUFAs饮食组的肝脏转录组中共鉴定出27141个同义性cSNP和5989个非同义性cSNP。然而,在波兰长白猪低PUFAs饲粮组的肝脏转录组中,共鉴定出15128个同义csnp和3900个非同义csnp。在波兰长白猪肝脏转录组的杂合csnp中,代表所有四种可能性(A、T、G、C)替换的单核苷酸变异(snv)分别为2872 ~ 6868个snv(高PUFAs)和2574 ~ 3654个snv(低PUFAs),以及2452 ~ 2678个snv(高PUFAs)和2094 ~ 2230个snv(低PUFAs)。
RNA-seq based SNP discovery in liver transcriptome of Polish Landrace pigs
Background: RNA-seq technology is most commonly used in quantitative measurement of gene expression levels and identification of non-annotated transcripts. It is also used for the coding SNPs (cSNPs) discoveries in an efficient and cost-effective way. The aim of this study was to identify the putative genetic cSNPs variants in liver transcriptome of Polish Landrace pigs fed with high and low (normal) omega-6 and omega-3 polyunsaturated fatty acids (PUFAs) diets. Methods. RNA-seq based NGS experiment was performed on Polish Landrace pigs fed with high and low diets. Total RNA were isolated from liver tissues ofPUFAs dietary of Polish Landrace pigs. The RNA-seq libraries preparations were performed by mRNA enrichment, mRNA fragmentation, second strand cDNA synthesis, adaptor ligation, size selection and PCR amplification using the illumina TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego CA, USA), followed by NGS sequencing on MiSeq illumina platform. The quality control (QC) of raw RNA-seq data of liver transcriptome was performed using the Trimmomatic and FastQC tools. The paired-end mapping of the liver transcriptome RNA-seq data (n=12) was performed on the reference genome Sus scrofa v.10.2, followed by cSNPs discovery using GATK and SAMtools bioinformatics SNPs caller tools. Results: Two pooled paired-end libraries of 151bp liver transcriptome of Polish Landrace pigs were generated from MiSeq instrument and subsequent Fastq RNA-seq data were submitted to NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra). Our study identified 25.3 million paired-end reads: representing 13,509,248 paired-end reads of high PUFAs dietary group and 11,815,696 paired-end reads of low PUFAs dietary group liver transcriptome. The SNP discovery results revealed identification of 25909 homozygous and 23290 heterozygous cSNPs in the liver transcriptome of both dietary groups of Polish Landrace pigs. With regards to same or alternative SNPs alleles encoding amino acids regions, a total of 27141 synonymous cSNP and 5989 non-synonymous cSNPs were identified in liver transcriptome representing high PUFAs dietary group. However, a total of 15128 synonymous cSNPs and 3900 non-synonymous cSNPs were identified in liver transcriptome representing low PUFAs dietary groups of Polish Landrace pigs. The identification of single nucleotide variations (SNVs) representing substitutions of all four possibilities (A,T,G,C) were ranged 2872 to 6868 SNVs (high PUFAs) and 2574 to 3654 SNVs (low PUFAs) in the homozygous cSNPs and 2452 to 2678 SNVs (high PUFAs) and 2094 to 2230 SNVs (low PUFAs) in the heterozygous cSNPs of liver transcriptomes of Polish Landrace pigs, respectively.