{"title":"Parsing GTF and FASTA files using the eccLib Library.","authors":"Tomasz Chady, Zuzanna Karolina Filutowska","doi":"10.1093/bioinformatics/btaf558","DOIUrl":null,"url":null,"abstract":"<p><strong>Summary: </strong>Leveraging the Python/C API, eccLib was developed as a high-performance library designed for parsing genomic files and analysing genomic contexts. To the best of the authors' knowledge, it is the fastest Python-based solution available. With eccLib, users can efficiently parse GTF/GFFv3 and FASTA files and utilise the provided methods for additional analysis.</p><p><strong>Availability and implementation: </strong>This library is implemented in C and distributed under the GPL-3.0 licence. It is compatible with any system that has the Python interpreter (CPython) installed. The use of C enables numerous optimisations at both the implementation and algorithmic levels, which are either unachievable or impractical in Python.</p><p><strong>Contact: </strong>tomcha@st.amu.edu.pl, platyna@amu.edu.pl, eccdna@eccdna.pl.</p><p><strong>Supplementary information: </strong>This library is available for installation from the Python Package Index (PyPI) under the name eccLib https://pypi.org/project/eccLib/. The source code is available at https://gitlab.platinum.edu.pl/eccdna/eccLib. The version described by this document (1.1.0) is archived as https://doi.org/10.5281/zenodo.17024282. More detailed documentation can be accessed at https://gitlab-pages.platinum.edu.pl/eccdna/eccLib/.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf558","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Summary: Leveraging the Python/C API, eccLib was developed as a high-performance library designed for parsing genomic files and analysing genomic contexts. To the best of the authors' knowledge, it is the fastest Python-based solution available. With eccLib, users can efficiently parse GTF/GFFv3 and FASTA files and utilise the provided methods for additional analysis.
Availability and implementation: This library is implemented in C and distributed under the GPL-3.0 licence. It is compatible with any system that has the Python interpreter (CPython) installed. The use of C enables numerous optimisations at both the implementation and algorithmic levels, which are either unachievable or impractical in Python.
Supplementary information: This library is available for installation from the Python Package Index (PyPI) under the name eccLib https://pypi.org/project/eccLib/. The source code is available at https://gitlab.platinum.edu.pl/eccdna/eccLib. The version described by this document (1.1.0) is archived as https://doi.org/10.5281/zenodo.17024282. More detailed documentation can be accessed at https://gitlab-pages.platinum.edu.pl/eccdna/eccLib/.