S. A. Lambert, A. Jolma, L. F. Campitelli, P. K. Das, Y. Yin et al., The human transcription factors, Cell, vol.172, pp.650-665, 2018.

A. Mathelier, W. Shi, and W. W. Wasserman, Identification of altered cis-regulatory elements in human disease, Trends Genet, vol.31, pp.67-76, 2015.

D. S. Johnson, A. Mortazavi, R. M. Myers, and B. Wold, Genome-wide mapping of in vivo protein-DNA interactions, Science, vol.316, pp.1497-1502, 2007.

L. Teytelman, D. M. Thurtle, J. Rine, and A. Van-oudenaarden, Highly expressed loci are vulnerable to misleading ChIP localization of multiple unrelated proteins, Proc. Natl. Acad. Sci. U.S.A, vol.110, pp.18602-18607, 2013.

D. Jain, S. Baldi, A. Zabel, T. Straub, and P. B. Becker, Active promoters give rise to false positive 'Phantom Peaks' in ChIP-seq experiments, Nucleic Acids Res, vol.43, pp.6959-6968, 2015.

R. Worsley-hunt and W. W. Wasserman, Non-targeted transcription factors motifs are a systemic component of ChIP-seq datasets, Genome Biol, vol.15, p.412, 2014.

G. D. Stormo, Modeling the specificity of protein-DNA interactions, Quant Biol, vol.1, pp.115-130, 2013.

M. T. Weirauch, A. Cote, R. Norel, M. Annala, Y. Zhao et al., Evaluation of methods for modeling transcription factor sequence specificity, Nat. Biotechnol, vol.31, pp.126-134, 2013.

I. Kulakovskiy, V. Levitsky, D. Oshchepkov, L. Bryzgalov, I. Vorontsov et al., From binding motifs in ChIP-Seq data to improved models of transcription factor binding sites, J. Bioinform. Comput. Biol, vol.11, p.1340004, 2013.

R. Eggeling, T. Roos, P. Myllymäki, and I. Grosse, Inferring intra-motif dependencies of DNA binding sites from ChIP-seq data, BMC Bioinformatics, vol.16, p.375, 2015.

M. Siebert and J. Oding, Bayesian Markov models consistently outperform PWMs at predicting motifs in nucleotide sequences, Nucleic Acids Res, vol.44, pp.6055-6069, 2016.

M. Slattery, T. Zhou, L. Yang, A. C. Dantas-machado, R. Gordân et al., Absence of a simple code: how transcription factors read the genome, Trends Biochem. Sci, vol.39, pp.381-399, 2014.

J. Keilwagen and J. Grau, Varying levels of complexity in transcription factor binding motifs, Nucleic Acids Res, vol.43, p.119, 2015.

L. Yang, Y. Orenstein, A. Jolma, Y. Yin, J. Taipale et al., Transcription factor family-specific DNA shape readout revealed by quantitative specificity models, Mol. Syst. Biol, vol.13, p.910, 2017.

A. Mathelier, B. Xin, T. Chiu, L. Yang, R. Rohs et al., DNA shape features improve transcription factor binding site predictions in vivo, Cell Syst, vol.3, pp.278-286, 2016.
DOI : 10.1016/j.cels.2016.07.001

URL : https://doi.org/10.1016/j.cels.2016.07.001

J. Chèneby, M. Gheorghe, M. Artufel, A. Mathelier, and B. Ballester, ReMap 2018: an updated atlas of regulatory regions from an integrative analysis of DNA-binding ChIP-seq experiments, Nucleic Acids Res, vol.46, pp.267-275, 2018.

I. Yevshin, R. Sharipov, T. Valeev, A. Kel, and F. Kolpakov, GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments, Nucleic Acids Res, vol.45, pp.61-67, 2017.

K. Zhou, S. Liu, W. Sun, L. Zheng, H. Zhou et al., ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data, Nucleic Acids Res, vol.45, pp.43-50, 2017.

S. Mei, Q. Qin, Q. Wu, H. Sun, R. Zheng et al., Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse, Nucleic Acids Res, vol.45, pp.658-662, 2017.

A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano et al., The UCSC Genome Browser Database: update, Nucleic Acids Res, vol.34, pp.590-598, 2006.

S. B. Montgomery, O. L. Griffith, M. C. Sleumer, C. M. Bergman, M. Bilenky et al., ORegAnno: an open access database and curation system for literature-derived promoters, transcription factor binding sites and regulatory variation, Bioinformatics, vol.22, pp.637-640, 2006.
DOI : 10.1093/bioinformatics/btk027

URL : https://academic.oup.com/bioinformatics/article-pdf/22/5/637/538215/btk027.pdf

W. J. Kent, The human genome browser at UCSC, Genome Res, vol.12, pp.996-1006, 2002.

O. Fornes, M. Gheorghe, P. A. Richmond, D. J. Arenillas, W. W. Wasserman et al., MANTA2, update of the Mongo database for the analysis of transcription factor binding site alterations, Sci Data, vol.5, p.180141, 2018.

A. Khan, O. Fornes, A. Stigliani, M. Gheorghe, J. A. Castro-mondragon et al., JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework, Nucleic Acids Res, vol.46, p.1284, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01646126

R. Worsley-hunt, A. Mathelier, L. Del-peso, and W. W. Wasserman, Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment, BMC Genomics, vol.15, p.472, 2014.

Y. Guo, S. Mahony, and D. K. Gifford, High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints, PLoS Comput. Biol, vol.8, p.1002638, 2012.
DOI : 10.1371/journal.pcbi.1002638

URL : https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1002638&type=printable

T. L. Bailey and P. Machanick, Inferring direct DNA binding from ChIP-seq, Nucleic Acids Res, vol.40, p.128, 2012.

I. V. Kulakovskiy, V. A. Boeva, A. V. Favorov, and V. J. Makeev, Deep and wide digging for binding motifs in ChIP-Seq data, Bioinformatics, vol.26, pp.2622-2623, 2010.

R. Jothi, S. Cuddapah, A. Barski, K. Cui, and K. Zhao, Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data, Nucleic Acids Res, vol.36, pp.5221-5231, 2008.

E. G. Wilbanks and M. T. Facciotti, Evaluation of algorithm performance in ChIP-Seq peak detection, PLoS One, vol.5, p.11471, 2010.

A. Mathelier and W. W. Wasserman, The next generation of transcription factor binding site prediction, PLoS Comput. Biol, vol.9, p.1003214, 2013.

Y. Zhao, S. Ruan, M. Pandey, and G. D. Stormo, Improved models for transcription factor binding site identification using nonindependent interactions, Genetics, vol.191, pp.781-790, 2012.

M. F. Berger, A. A. Philippakis, A. M. Qureshi, F. S. He, P. W. Estep et al., Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities, Nat. Biotechnol, vol.24, pp.1429-1435, 2006.

H. S. Rhee and B. F. Pugh, Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution, Cell, vol.147, pp.1408-1419, 2011.

K. Y. Yip, C. Cheng, N. Bhardwaj, J. B. Brown, J. Leng et al., Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors, Genome Biol, vol.13, p.48, 2012.

W. W. Wasserman and A. Sandelin, Applied bioinformatics for the identification of regulatory elements, Nat. Rev. Genet, vol.5, pp.276-287, 2004.

R. Y. Patel and G. D. Stormo, Discriminative motif optimization based on perceptron training, Bioinformatics, vol.30, pp.941-948, 2014.

T. Chiu, L. Yang, T. Zhou, B. J. Main, S. C. Parker et al., GBshape: a genome browser database for DNA shape annotations, Nucleic Acids Res, vol.43, pp.103-109, 2015.

A. R. Quinlan and I. M. Hall, BEDTools: a flexible suite of utilities for comparing genomic features, Bioinformatics, vol.26, pp.841-842, 2010.

W. N. Venables and B. D. Ripley, Modern Applied Statistics with, 2002.

J. N. Kapur, P. K. Sahoo, and A. K. Wong, A new method for gray-level picture thresholding using the entropy of the histogram, Comput. Vis. Graph. Image Process, vol.29, p.140, 1985.

C. E. Shannon, A Mathematical Theory of Communication, Bell Syst. Tech. J, vol.27, pp.623-656, 1948.

C. A. Schneider, W. S. Rasband, and K. W. Eliceiri, NIH Image to ImageJ: 25 years of image analysis, Nat. Methods, vol.9, pp.671-675, 2012.

T. L. Bailey, M. Boden, F. A. Buske, M. Frith, C. E. Grant et al., MEME SUITE: tools for motif discovery and searching, Nucleic Acids Res, vol.37, pp.202-208, 2009.

M. L. Bulyk, E. Gentalen, D. J. Lockhart, and G. M. Church, Quantifying DNA-protein interactions by double-stranded DNA arrays, Nat. Biotechnol, vol.17, pp.573-577, 1999.

M. A. Hume, L. A. Barrera, S. S. Gisselbrecht, and M. L. Bulyk, UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions, Nucleic Acids Res, vol.43, pp.117-122, 2015.

H. B. Mann and D. R. Whitney, On a test of whether one of two random variables is stochastically larger than the other, Ann. Math. Stat, vol.18, pp.50-60, 1947.

N. Yamada, W. K. Lai, N. Farrell, B. F. Pugh, and S. Mahony, Characterizing protein-DNA binding event subtypes in ChIP-exo data, Bioinformatics, 2018.

S. Heinz, C. Benner, N. Spann, E. Bertolino, Y. C. Lin et al., Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell Identities, Mol. Cell, vol.38, pp.576-589, 2010.

H. Xing, Y. Mo, W. Liao, and M. Q. Zhang, Genome-wide localization of protein-DNA binding, 2012.

, 13 Bayesian change-point method with ChIP-seq data, Nucleic Acids Research, vol.8, p.1002613, 2018.

Y. Zhang, T. Liu, C. A. Meyer, J. Eeckhoute, D. S. Johnson et al., Model-based analysis of ChIP-Seq (MACS), Genome Biol, vol.9, p.137, 2008.

Y. Hochberg and Y. Benjamini, More powerful procedures for multiple significance testing, Stat. Med, vol.9, pp.811-818, 1990.

E. Afgan, D. Baker, B. Batut, M. Van-den-beek, D. Bouvier et al., The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update, Nucleic Acids Res, vol.46, pp.537-544, 2018.

D. Warde-farley, S. L. Donaldson, O. Comes, K. Zuberi, R. Badrawi et al., The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function, Nucleic Acids Res, vol.38, pp.214-234, 2010.

L. Chen and Z. S. Qin, traseR: an R package for performing trait-associated SNP enrichment analysis in genomic intervals, Bioinformatics, vol.32, pp.1214-1216, 2015.

M. D. Mailman, M. Feolo, Y. Jin, M. Kimura, K. Tryka et al., The NCBI dbGaP database of genotypes and phenotypes, Nat. Genet, vol.39, pp.1181-1186, 2007.

D. Welter, J. Macarthur, J. Morales, T. Burdett, P. Hall et al., The NHGRI GWAS Catalog, a curated resource of SNP-trait associations, Nucleic Acids Res, vol.42, pp.1001-1006, 2014.

A. Siepel, Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes, Genome Res, vol.15, pp.1034-1050, 2005.

S. Neph, M. S. Kuehn, A. P. Reynolds, E. Haugen, R. E. Thurman et al., BEDOPS: high-performance genomic feature operations, Bioinformatics, vol.28, pp.1919-1920, 2012.

A. Pohl and M. Beato, bwtool: a tool for bigWig files, Bioinformatics, vol.30, pp.1618-1619, 2014.

M. F. Berger and M. L. Bulyk, Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins, Methods Mol. Biol, vol.338, pp.245-260, 2006.

D. Xie, A. P. Boyle, L. Wu, J. Zhai, T. Kawli et al., Dynamic trans-Acting factor colocalization in human cells, Cell, vol.155, pp.713-724, 2013.

A. P. Boyle, C. L. Araya, C. Brdlik, P. Cayting, C. Cheng et al., Comparative analysis of regulatory information and circuits across distant species, Nature, vol.512, pp.453-456, 2014.

T. W. Whitfield, J. Wang, P. J. Collins, E. Christopher-partridge, S. Aldred et al., Functional analysis of transcription factor binding sites in human promoters, Genome Biol, vol.13, p.50, 2012.

D. Hnisz, B. J. Abraham, T. I. Lee, A. Lau, V. Saint-andré et al., Super-enhancers in the control of cell identity and disease, Cell, vol.155, pp.934-947, 2013.

B. Wilczy´nskiwilczy´-wilczy´nski and E. E. Furlong, Dynamic CRM occupancy reflects a temporal map of developmental progression, 2010.

, Mol. Syst. Biol, vol.6, p.383

W. A. Whyte, D. A. Orlando, D. Hnisz, B. J. Abraham, C. Y. Lin et al., Master transcription factors and mediator establish super-enhancers at key cell identity genes, Cell, vol.153, pp.307-319, 2013.

Q. He, A. F. Bardet, B. Patton, J. Purvis, J. Johnston et al., High conservation of transcription factor binding and evidence for combinatorial regulation across six Drosophila species, Nat. Genet, vol.43, pp.414-420, 2011.

W. W. Fisher, J. J. Li, A. S. Hammonds, J. B. Brown, B. D. Pfeiffer et al., DNA regions bound at low occupancy by transcription factors do not drive patterned reporter gene expression in Drosophila, Proc. Natl. Acad. Sci. U.S.A, vol.109, pp.21330-21335, 2012.

D. L. Longo and J. M. Drazen, Data sharing, N. Engl. J. Med, vol.374, pp.276-277, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01112977