Crowdsourcing and curation: perspectives from biology and natural language processing.

Journal: Database : the journal of biological databases and curation
Published Date:

Abstract

Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodology and its applicability to biocuration. This paper explores crowdsourcing for biocuration through several case studies that highlight different ways of leveraging 'the crowd'; these raise issues about the kind(s) of expertise needed, the motivations of participants, and questions related to feasibility, cost and quality. The paper is an outgrowth of a panel session held at BioCreative V (Seville, September 9-11, 2015). The session consisted of four short talks, followed by a discussion. In their talks, the panelists explored the role of expertise and the potential to improve crowd performance by training; the challenge of decomposing tasks to make them amenable to crowdsourcing; and the capture of biological data and metadata through community editing.Database URL: http://www.mitre.org/publications/technical-papers/crowdsourcing-and-curation-perspectives.

Authors

  • Lynette Hirschman
    The MITRE Corporation, Bedford, MA, USA lynette@mitre.org.
  • Karën Fort
    University of Paris-Sorbonne/STIH Team, Paris, France.
  • Stéphanie Boué
    Philip Morris International R&D, Philip Morris Products S.A., Neuchâtel, Switzerland.
  • Nikos Kyrpides
    Joint Genome Institute, Walnut, Creek, CA, USA.
  • Rezarta Islamaj Doğan
    National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
  • Kevin Bretonnel Cohen
    Computational Bioscience Program, University Colorado School of Medicine, Aurora, CO, USA.