Alexandra M Kasianova, Anna V Klepikova, Oleg A Gusev, Guzel R Gazizova, Maria D Logacheva, Aleksey A Penin
{"title":"全长异构体构造器(FLIC)——一个基于长读的异构体发现工具。","authors":"Alexandra M Kasianova, Anna V Klepikova, Oleg A Gusev, Guzel R Gazizova, Maria D Logacheva, Aleksey A Penin","doi":"10.1093/bioinformatics/btaf551","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Advances in high-throughput sequencing have illuminated the complexity of transcriptome landscape in eukaryotes. An inherent part of this complexity is the presence of multiple isoforms generated by the alternative splicing and the use of alternative transcription start and polyadenylation sites. However, currently available tools have limited capacity to infer full-length isoforms.</p><p><strong>Results: </strong>We developed a new pipeline, FLIC (Full-Length Isoform Constructor). FLIC is based on the long-read transcriptome data and integrates several key features: 1) utilizing biological replicate concordance to filter out noise and artifacts; 2) employing peak calling to precisely identify transcription start and polyadenylation sites; 3) enabling robust isoform reconstruction with minimal reliance on existing annotations. We evaluated FLIC using a dedicated set of real and simulated data of Arabidopsis thaliana cDNA sequencing. Results demonstrate that FLIC accurately reconstructs known and novel isoforms, outperforming existing tools, especially in the absence of reference annotations. A direct comparison with CAGE, currently regarded as the gold standard for transcription start site identification, shows that FLIC is equally accurate, while being much less time-consuming. Thus, FLIC provides a valuable tool for comprehensive transcript characterization, particularly for non-model organisms or when dealing with incomplete or inaccurate annotations.</p><p><strong>Availability: </strong>FLIC is available at https://github.com/albidgy/FLIC.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Full-length isoform constructor (FLIC) - a tool for isoform discovery based on long reads.\",\"authors\":\"Alexandra M Kasianova, Anna V Klepikova, Oleg A Gusev, Guzel R Gazizova, Maria D Logacheva, Aleksey A Penin\",\"doi\":\"10.1093/bioinformatics/btaf551\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>Advances in high-throughput sequencing have illuminated the complexity of transcriptome landscape in eukaryotes. An inherent part of this complexity is the presence of multiple isoforms generated by the alternative splicing and the use of alternative transcription start and polyadenylation sites. However, currently available tools have limited capacity to infer full-length isoforms.</p><p><strong>Results: </strong>We developed a new pipeline, FLIC (Full-Length Isoform Constructor). FLIC is based on the long-read transcriptome data and integrates several key features: 1) utilizing biological replicate concordance to filter out noise and artifacts; 2) employing peak calling to precisely identify transcription start and polyadenylation sites; 3) enabling robust isoform reconstruction with minimal reliance on existing annotations. We evaluated FLIC using a dedicated set of real and simulated data of Arabidopsis thaliana cDNA sequencing. Results demonstrate that FLIC accurately reconstructs known and novel isoforms, outperforming existing tools, especially in the absence of reference annotations. A direct comparison with CAGE, currently regarded as the gold standard for transcription start site identification, shows that FLIC is equally accurate, while being much less time-consuming. Thus, FLIC provides a valuable tool for comprehensive transcript characterization, particularly for non-model organisms or when dealing with incomplete or inaccurate annotations.</p><p><strong>Availability: </strong>FLIC is available at https://github.com/albidgy/FLIC.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btaf551\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Full-length isoform constructor (FLIC) - a tool for isoform discovery based on long reads.
Motivation: Advances in high-throughput sequencing have illuminated the complexity of transcriptome landscape in eukaryotes. An inherent part of this complexity is the presence of multiple isoforms generated by the alternative splicing and the use of alternative transcription start and polyadenylation sites. However, currently available tools have limited capacity to infer full-length isoforms.
Results: We developed a new pipeline, FLIC (Full-Length Isoform Constructor). FLIC is based on the long-read transcriptome data and integrates several key features: 1) utilizing biological replicate concordance to filter out noise and artifacts; 2) employing peak calling to precisely identify transcription start and polyadenylation sites; 3) enabling robust isoform reconstruction with minimal reliance on existing annotations. We evaluated FLIC using a dedicated set of real and simulated data of Arabidopsis thaliana cDNA sequencing. Results demonstrate that FLIC accurately reconstructs known and novel isoforms, outperforming existing tools, especially in the absence of reference annotations. A direct comparison with CAGE, currently regarded as the gold standard for transcription start site identification, shows that FLIC is equally accurate, while being much less time-consuming. Thus, FLIC provides a valuable tool for comprehensive transcript characterization, particularly for non-model organisms or when dealing with incomplete or inaccurate annotations.
Availability: FLIC is available at https://github.com/albidgy/FLIC.
Supplementary information: Supplementary data are available at Bioinformatics online.