Skip to Main content Skip to Navigation
Journal articles

TAGOOS: genome-wide supervised learning of non-coding loci associated to complex phenotypes

Abstract : Genome-wide association studies (GWAS) associate single nucleotide polymorphisms (SNPs) to complex phenotypes. Most human SNPs fall in non-coding regions and are likely regulatory SNPs, but linkage disequilibrium (LD) blocks make it difficult to distinguish functional SNPs. Therefore, putative functional SNPs are usually annotated with molecular markers of gene regulatory regions and prioritized with dedicated prediction tools. We integrated associated SNPs, LD blocks and regulatory features into a supervised model called TAGOOS (TAG SNP bOOSting) and computed scores genome-wide. The TAGOOS scores enriched and prioritized unseen associated SNPs with an odds ratio of 4.3 and 3.5 and an area under the curve (AUC) of 0.65 and 0.6 for intronic and intergenic regions, respectively. The TAGOOS score was correlated with the maximal significance of associated SNPs and expression quantitative trait loci (eQTLs) and with the number of biological samples annotated for key regulatory features. Analysis of loci and regions associated to cleft lip and human adult height phenotypes recovered known functional loci and predicted new functional loci enriched in transcriptions factors related to the phenotypes. In conclusion, we trained a supervised model based on associated SNPs to prioritize putative functional regions. The TAGOOS scores, annotations and UCSC genome tracks are available here: https: //tagoos.readthedocs.io.
Complete list of metadatas

Cited literature [67 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-02119716
Contributor : Pascal Rihet <>
Submitted on : Saturday, May 4, 2019 - 8:57:59 AM
Last modification on : Saturday, November 21, 2020 - 3:09:26 AM
Long-term archiving on: : Wednesday, October 2, 2019 - 1:18:37 AM

File

gkz320.pdf
Publisher files allowed on an open archive

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Collections

Citation

Aitor Gonzalez, Marie Artufel, Pascal Rihet. TAGOOS: genome-wide supervised learning of non-coding loci associated to complex phenotypes. Nucleic Acids Research, Oxford University Press, 2019, ⟨10.1093/nar/gkz320⟩. ⟨hal-02119716⟩

Share

Metrics

Record views

142

Files downloads

306