OLOGRAM : Determining significance of total overlap length between genomic regions sets

Abstract : Motivation: Various bioinformatics analyses provide sets of genomic coordinates of interest. Whether two such sets possess a functional relation is a frequent question. This is often determined by interpreting the statistical significance of their overlaps. However, only few existing methods consider the lengths of the overlap, and they do not provide a resolutive p-value. Results: Here, we introduce OLOGRAM, which performs overlap statistics between sets of genomic regions described in BEDs or GTF. It uses Monte Carlo simulation, taking into account both the distributions of region and inter-region lengths, to fit a negative binomial model of the total overlap length. Exclusion of user-defined genomic areas during the shuffling is supported.
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-02383808
Contributor : Lionel Spinelli <>
Submitted on : Thursday, November 28, 2019 - 8:38:40 AM
Last modification on : Thursday, January 23, 2020 - 6:22:13 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2020-05-28

Please log in to resquest access to the document

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Q Ferré, G. Charbonnier, N Sadouni, F Lopez, Y Kermezli, et al.. OLOGRAM : Determining significance of total overlap length between genomic regions sets. Bioinformatics, Oxford University Press (OUP), 2019, ⟨10.1093/bioinformatics/btz810⟩. ⟨hal-02383808⟩

Share

Metrics

Record views

30