Brain Data Standards - A method for building data-driven cell-type ontologies.

Journal: Scientific data
PMID:

Abstract

Large-scale single-cell 'omics profiling is being used to define a complete catalogue of brain cell types, something that traditional methods struggle with due to the diversity and complexity of the brain. But this poses a problem: How do we organise such a catalogue - providing a standard way to refer to the cell types discovered, linking their classification and properties to supporting data? Cell ontologies provide a partial solution to these problems, but no existing ontology schemas support the definition of cell types by direct reference to supporting data, classification of cell types using classifications derived directly from data, or links from cell types to marker sets along with confidence scores. Here we describe a generally applicable schema that solves these problems and its application in a semi-automated pipeline to build a data-linked extension to the Cell Ontology representing cell types in the Primary Motor Cortex of humans, mice and marmosets. The methods and resulting ontology are designed to be scalable and applicable to similar whole-brain atlases currently in preparation.

Authors

  • Shawn Zheng Kai Tan
    Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.
  • Huseyin Kir
    Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.
  • Brian D Aevermann
    J. Craig Venter Institute, 4120 Capricorn Lane, La Jolla, CA, 92037, USA.
  • Tom Gillespie
    University of California San Diego, La Jolla, CA, USA.
  • Nomi Harris
    Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
  • Michael J Hawrylycz
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Nikolas L Jorstad
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Ed S Lein
    Allen Institute for Brain Science, Seattle, Washington, 98103, USA.
  • Nicolas Matentzoglu
    School of Computer Science, University of Manchester, Oxford Road, Manchester, UK. nicolas.matentzoglu@manchester.ac.uk.
  • Jeremy A Miller
    Allen Institute for Brain Science, Seattle, Washington, 98103, USA.
  • Tyler S Mollenkopf
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Christopher J Mungall
    Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
  • Patrick L Ray
    Department of Philosophy, University at Buffalo, Buffalo, NY USA.
  • Raymond E A Sanchez
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Brian Staats
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Jim Vermillion
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Ambika Yadav
    Allen Institute for Brain Science, Seattle, WA, USA.
  • Yun Zhang
    Biology Institute, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China.
  • Richard H Scheuermann
    J. Craig Venter Institute, 4120 Capricorn Lane, La Jolla, CA, 92037, USA. RScheuermann@jcvi.org.
  • David Osumi-Sutherland
    European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK.