A novel random forest approach to revealing interactions and controls on chlorophyll concentration and bacterial communities during coastal phytoplankton blooms.

Journal: Scientific reports
PMID:

Abstract

Increasing occurrence of harmful algal blooms across the land-water interface poses significant risks to coastal ecosystem structure and human health. Defining significant drivers and their interactive impacts on blooms allows for more effective analysis and identification of specific conditions supporting phytoplankton growth. A novel iterative Random Forests (iRF) machine-learning model was developed and applied to two example cases along the California coast to identify key stable interactions: (1) phytoplankton abundance in response to various drivers due to coastal conditions and land-sea nutrient fluxes, (2) microbial community structure during algal blooms. In Example 1, watershed derived nutrients were identified as the least significant interacting variable associated with Monterey Bay phytoplankton abundance. In Example 2, through iRF analysis of field-based 16S OTU bacterial community and algae datasets, we independently found stable interactions of prokaryote abundance patterns associated with phytoplankton abundance that have been previously identified in laboratory-based studies. Our study represents the first iRF application to marine algal blooms that helps to identify ocean, microbial, and terrestrial conditions that are considered dominant causal factors on bloom dynamics.

Authors

  • Yiwei Cheng
    Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA. yiweicheng@gmail.com.
  • Ved N Bhoot
    Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
  • Karl Kumbier
    Statistics Department, University of California, Berkeley, CA, USA.
  • Marilou P Sison-Mangus
    Department of Ocean Sciences, University of California, Santa Cruz, CA, USA.
  • James B Brown
    Molecular Ecosystems Biology Department, Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, United States.
  • Raphael Kudela
    Department of Ocean Sciences, University of California, Santa Cruz, CA, USA.
  • Michelle E Newcomer
    Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.