Ontology application and use at the ENCODE DCC.

Journal: Database : the journal of biological databases and curation
Published Date:

Abstract

The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory network and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All ENCODE experimental data, metadata and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage and distribution to community resources and the scientific community. As the volume of data increases, the organization of experimental details becomes increasingly complicated and demands careful curation to identify related experiments. Here, we describe the ENCODE DCC's use of ontologies to standardize experimental metadata. We discuss how ontologies, when used to annotate metadata, provide improved searching capabilities and facilitate the ability to find connections within a set of experiments. Additionally, we provide examples of how ontologies are used to annotate ENCODE metadata and how the annotations can be identified via ontology-driven searches at the ENCODE portal. As genomic datasets grow larger and more interconnected, standardization of metadata becomes increasingly vital to allow for exploration and comparison of data between different scientific projects.

Authors

  • Venkat S Malladi
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Drew T Erickson
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Nikhil R Podduturi
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Laurence D Rowe
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Esther T Chan
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Jean M Davidson
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Benjamin C Hitz
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Marcus Ho
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Brian T Lee
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Stuart Miyasato
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Gregory R Roe
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Matt Simison
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Cricket A Sloan
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • J Seth Strattan
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Forrest Tanaka
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • W James Kent
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • J Michael Cherry
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Eurie L Hong
    Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA and Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA euriehong@stanford.edu.