Skip to Main content Skip to Navigation
Journal articles

Fine-Tuning Enhancer Models to Predict Transcriptional Targets across Multiple Genomes

Abstract : Networks of regulatory relations between transcription factors (TF) and their target genes (TG)- implemented through TF binding sites (TFBS)- are key features of biology. An idealized approach to solving such networks consists of starting from a consensus TFBS or a position weight matrix (PWM) to generate a high accuracy list of candidate TGs for biological validation. Developing and evaluating such approaches remains a formidable challenge in regulatory bioinformatics. We perform a benchmark study on 34 Drosophila TFs to assess existing TFBS and cis-regulatory module (CRM) detection methods, with a strong focus on the use of multiple genomes. Particularly, for CRM-modelling we investigate the addition of orthologous sites to a known PWM to construct phyloPWMs and we assess the added value of phylogenentic footprinting to predict contextual motifs around known TFBSs. For CRM-prediction, we compare motif conservation with network-level conservation approaches across multiple genomes. Choosing the optimal training and scoring strategies strongly enhances the performance of TG prediction for more than half of the tested TFs. Finally, we analyse a 35th TF, namely Eyeless, and find a significant overlap between predicted TGs and candidate TGs identified by microarray expression studies. In summary we identify several ways to optimize TF-specific TG predictions, some of which can be applied to all TFs, and others that can be applied only to particular TFs. The ability to model known TF-TG relations, together with the use of multiple genomes, results in a significant step forward in solving the architecture of gene regulatory networks.
Complete list of metadatas

Cited literature [51 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-01624314
Contributor : Lionel Spinelli <>
Submitted on : Thursday, September 6, 2018 - 12:36:31 PM
Last modification on : Thursday, June 4, 2020 - 10:24:08 AM
Long-term archiving on: : Friday, December 7, 2018 - 4:59:25 PM

File

journal.pone.0001115.PDF
Publication funded by an institution

Identifiers

Collections

Citation

S. Aerts, J. Helden, O. Sand, B. A. Hassan. Fine-Tuning Enhancer Models to Predict Transcriptional Targets across Multiple Genomes. PLoS ONE, Public Library of Science, 2007, 2 (11), pp.e1115. ⟨10.1371/journal.pone.0001115⟩. ⟨hal-01624314⟩

Share

Metrics

Record views

158

Files downloads

191