{"title":"PFusionDB: a comprehensive database of plant-specific fusion transcripts.","authors":"Ajay Arya, Simran Arora, Fiza Hamid, Shailesh Kumar","doi":"10.1007/s13205-024-04132-1","DOIUrl":null,"url":null,"abstract":"<p><p>Fusion transcripts (FTs) are well known cancer biomarkers, relatively understudied in plants. Here, we developed PFusionDB (www.nipgr.ac.in/PFusionDB), a novel plant-specific fusion-transcript database. It is a comprehensive repository of 80,170, 39,108, 83,330, and 11,500 unique fusions detected in 1280, 637, 697, and 181 RNA-Seq samples of <i>Arabidopsis thaliana</i>, <i>Oryza sativa japonica</i>, <i>Oryza sativa indica</i>, and <i>Cicer arietinum</i> respectively. Here, a total of 76,599 (<i>Arabidopsis thaliana</i>), 35,480 (<i>Oryza sativa japonica</i>), 72,099 (<i>Oryza sativa indica</i>), and 9524 (<i>Cicer arietinum</i>) fusion transcripts are non-recurrent i.e., only found in one sample. Identification of FTs was performed by using a total of five tools viz. EricScript-Plants, STAR-Fusion, TrinityFusion, SQUID, and MapSplice. At PFusionDB, available fundamental details of fusion events includes the information of parental genes, junction sequence, expression levels of fusion transcripts, breakpoint coordinates, strand information, tissue type, treatment information, fusion type, PFusionDB ID, and Sequence Read Archive (SRA) ID. Further, two search modules: 'Simple Search' and 'Advanced Search', along with a 'Browse' option to data download, are present for the ease of users. Three distinct modules viz. 'BLASTN', 'SW Align', and 'Mapping' are also available for efficient query sequence mapping and alignment to FTs. PFusionDB serves as a crucial resource for delving into the intricate world of fusion transcript in plants, providing researchers with a foundation for further exploration and analysis. Database URL: www.nipgr.ac.in/PFusionDB.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13205-024-04132-1.</p>","PeriodicalId":7067,"journal":{"name":"3 Biotech","volume":"14 11","pages":"282"},"PeriodicalIF":2.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519250/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"3 Biotech","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s13205-024-04132-1","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/28 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Fusion transcripts (FTs) are well known cancer biomarkers, relatively understudied in plants. Here, we developed PFusionDB (www.nipgr.ac.in/PFusionDB), a novel plant-specific fusion-transcript database. It is a comprehensive repository of 80,170, 39,108, 83,330, and 11,500 unique fusions detected in 1280, 637, 697, and 181 RNA-Seq samples of Arabidopsis thaliana, Oryza sativa japonica, Oryza sativa indica, and Cicer arietinum respectively. Here, a total of 76,599 (Arabidopsis thaliana), 35,480 (Oryza sativa japonica), 72,099 (Oryza sativa indica), and 9524 (Cicer arietinum) fusion transcripts are non-recurrent i.e., only found in one sample. Identification of FTs was performed by using a total of five tools viz. EricScript-Plants, STAR-Fusion, TrinityFusion, SQUID, and MapSplice. At PFusionDB, available fundamental details of fusion events includes the information of parental genes, junction sequence, expression levels of fusion transcripts, breakpoint coordinates, strand information, tissue type, treatment information, fusion type, PFusionDB ID, and Sequence Read Archive (SRA) ID. Further, two search modules: 'Simple Search' and 'Advanced Search', along with a 'Browse' option to data download, are present for the ease of users. Three distinct modules viz. 'BLASTN', 'SW Align', and 'Mapping' are also available for efficient query sequence mapping and alignment to FTs. PFusionDB serves as a crucial resource for delving into the intricate world of fusion transcript in plants, providing researchers with a foundation for further exploration and analysis. Database URL: www.nipgr.ac.in/PFusionDB.
Supplementary information: The online version contains supplementary material available at 10.1007/s13205-024-04132-1.
3 BiotechAgricultural and Biological Sciences-Agricultural and Biological Sciences (miscellaneous)
CiteScore
6.00
自引率
0.00%
发文量
314
期刊介绍:
3 Biotech publishes the results of the latest research related to the study and application of biotechnology to:
- Medicine and Biomedical Sciences
- Agriculture
- The Environment
The focus on these three technology sectors recognizes that complete Biotechnology applications often require a combination of techniques. 3 Biotech not only presents the latest developments in biotechnology but also addresses the problems and benefits of integrating a variety of techniques for a particular application. 3 Biotech will appeal to scientists and engineers in both academia and industry focused on the safe and efficient application of Biotechnology to Medicine, Agriculture and the Environment.