Morphological traits and machine learning for genetic lineage prediction of two reef-building corals.

Journal: PloS one
Published Date:

Abstract

Integrating multiple lines of evidence that support molecular taxonomy analysis has proven to be a robust method for species delimitation in scleractinian corals. However, morphology often conflicts with genetic approaches due to high phenotypic plasticity and convergence. Understanding morphological variation among species is crucial to studying coral distribution, life history, ecology, and evolution. Here, we present an application of Random Forest models for coral species identification based on morphological annotation of the corallum and corallites. We show that the integration of molecular and morphological trait analysis can be improved using machine learning. Morphological traits were documented for Porites and Pocillopora coral species that were collected and genotyped through genome-wide, genetical hierarchical clustering, and coalescence analyses for the Tara Pacific Expedition. While Porites only included three tentative species, most Pocillopora species were accounted by included specimens from the western Indian Ocean, tropical Southwestern Pacific, and southeast Polynesia. Two Random Forest models per genus were trained on the morphological annotations using the genetic lineage labels. One model was developed for in-situ image identification and used corallum traits measured from in-situ photographs. Another model for integrative species identification combined corallum and corallite data measured on scanning electron micrographs. Random Forest models outperformed traditional dimension reduction methods like PCA and FAMD followed by k-means and hierarchical clustering by classifying the correct genetic lineage despite morphological clusters overlapping. This machine learning approach is reproducible, cost-effective, and accessible, reducing the need for taxonomic expertise. It can complement molecular and phylogenetic studies and support image identification, highlighting its potential to advance a coral integrative taxonomy workflow.

Authors

  • Guinther Mitushasi
    Shimoda Marine Research Center, University of Tsukuba, 5-10-1 Shimoda, Shizuoka, Japan.
  • Yuko F Kitano
    Japan Wildlife Research Center, Tokyo, Japan.
  • Nicolas Oury
    UMR ENTROPIE (UMR - Université de La Réunion, IRD, IFREMER, Université de Nouvelle-Calédonie, CNRS), Université de La Réunion, St Denis, La Réunion, France.
  • Hélène Magalon
    UMR ENTROPIE (UMR - Université de La Réunion, IRD, IFREMER, Université de Nouvelle-Calédonie, CNRS), Université de La Réunion, St Denis, La Réunion, France.
  • David A Paz-García
    Centro de Investigaciones Biológicas del Noroeste (CIBNOR), Laboratorio de Genética para la Conservación, Av. IPN 195, Col. Playa Palo de Santa Rita Sur, La Paz, Baja California Sur, México.
  • Eric Armstrong
    PSL Research University, EPHE, CNRS, Université de Perpignan, Perpignan, France.
  • Benjamin C C Hume
    Department of Biology, University of Konstanz, Konstanz, Germany.
  • Barbara Porro
    French National Institute for Agriculture, Food, and Environment (INRAE), Université Côte d'Azur, ISA, France.
  • Clémentine Moulin
    Fondation Tara Océan, Base Tara, 8 rue de Prague, 75 012, Paris, France.
  • Emilie Boissin
    PSL Research University: EPHE-UPVD-CNRS, USR CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, Perpignan, France.
  • Guillaume Bourdin
    School of Marine Sciences, University of Maine, Orono, Maine, United States of America.
  • Guillaume Iwankow
    Sorbonne Université, CNRS, Station Biologique de Roscoff, AD2M, UMR, ECOMAP, Roscoff, France.
  • Julie Poulain
    Research Federation for the study of Global Ocean Systems Ecology and Evolution, FR2022/Tara GOSEE, 3 rue Michel-Ange, Paris, France.
  • Sarah Romac
    Sorbonne Université, CNRS, Station Biologique de Roscoff, AD2M, UMR, ECOMAP, Roscoff, France.
  • Maggie M Reddy
    School of Biological and Chemical Sciences, Ryan Institute, University of Galway, University Road, H91, Galway, Ireland.
  • Serge Planes
    PSL Research University: EPHE-UPVD-CNRS, USR CRIOBE, Laboratoire d'Excellence CORAIL, Université de Perpignan, Perpignan, France.
  • Denis Allemand
    Centre Scientifique de Monaco, 8 Quai Antoine Ier, MC-98000, Monaco, Principality of Monaco.
  • Christian R Voolstra
    Department of Biology, University of Konstanz, Konstanz 78457, Germany.
  • Didier Forcioli
    LIA ROPSE, Laboratoire International Associé Université Côte d'Azur-Centre Scientifique de Monaco, Monaco, Principality of Monaco.
  • Sylvain Agostini
    Shimoda Marine Research Center, University of Tsukuba, 5-10-1 Shimoda, Shizuoka, Japan.