Skip to Main content Skip to Navigation
Journal articles

Explore, edit and leverage genomic annotations using Python GTF toolkit

Abstract : Motivation: While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format. Results: We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, TSS location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.
Document type :
Journal articles
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-02078147
Contributor : Lionel Spinelli <>
Submitted on : Monday, March 25, 2019 - 10:05:55 AM
Last modification on : Friday, February 14, 2020 - 10:13:53 AM
Long-term archiving on: : Wednesday, June 26, 2019 - 12:50:00 PM

File

pygtftk_puthier.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Fabrice Lopez, Guillaume Charbonnier, Yasmina Kermezli, Mohamed Belhocine, Quentin Ferré, et al.. Explore, edit and leverage genomic annotations using Python GTF toolkit. Bioinformatics, Oxford University Press (OUP), 2019, ⟨10.1093/bioinformatics/btz116⟩. ⟨hal-02078147⟩

Share

Metrics