Consistent annotation of gene expression arrays

Abstract : Background Gene expression arrays are valuable and widely used tools for biomedical research. Today's commercial arrays attempt to measure the expression level of all of the genes in the genome. Effectively translating the results from the microarray into a biological interpretation requires an accurate mapping between the probesets on the array and the genes that they are targeting. Although major array manufacturers provide annotations of their gene expression arrays, the methods used by various manufacturers are different and the annotations are difficult to keep up to date in the rapidly changing world of biological sequence databases. Results We have created a consistent microarray annotation protocol applicable to all of the major array manufacturers. We constantly keep our annotations updated with the latest Ensembl Gene predictions, and thus cross-referenced with a large number of external biomedical sequence database identifiers. We show that these annotations are accurate and address in detail reasons for the minority of probesets that cannot be annotated. Annotations are publicly accessible through the Ensembl Genome Browser and programmatically through the Ensembl Application Programming Interface. They are also seamlessly integrated into the BioMart data-mining tool and the biomaRt package of BioConductor. Conclusions Consistent, accurate and updated gene expression array annotations remain critical for biological research. Our annotations facilitate accurate biological interpretation of gene expression profiles.
Document type :
Journal articles
Complete list of metadatas

https://hal-amu.archives-ouvertes.fr/hal-01615157
Contributor : Lionel Spinelli <>
Submitted on : Wednesday, December 19, 2018 - 10:36:17 AM
Last modification on : Thursday, February 14, 2019 - 11:48:02 AM
Long-term archiving on : Wednesday, March 20, 2019 - 2:29:50 PM

File

document(6).pdf
Publication funded by an institution

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-01615157, version 1

Collections

Citation

Benoit Ballester, Nathan Johnson, Glenn Proctor, Paul Flicek. Consistent annotation of gene expression arrays. BMC Genomics, BioMed Central, 2010, 11, pp.294. ⟨hal-01615157⟩

Share

Metrics

Record views

83

Files downloads

49