Prediction of microplastic abundance in surface water of the ocean and influencing factors based on ensemble learning.

Journal: Environmental pollution (Barking, Essex : 1987)
Published Date:

Abstract

Microplastics are regarded as emergent contaminants posing a serious threat to the marine ecosystem. It is time-consuming and labor-intensive to determine the number of microplastics in different seas using traditional sampling and detection methods. Machine learning can provide a promising tool for prediction, but there is a lack of research on this. To screen high-performance models for the prediction of microplastic abundance in the marine surface water and explore the influencing factors, three ensemble learning models, random forest (RF), gradient boosted decision tree (GBDT), and extreme gradient boosting (XGBoost), were developed and compared. A total of 1169 samples were collected, and multi-classification prediction models were constructed with 16 features of the data as inputs and six classes of microplastic abundance intervals as outputs. Our results show that the XGBoost model has the best performance of prediction, with a total accuracy rate of 0.719 and an ROC AUC (Receiver Operating Characteristic curve, Area Under Curve) value of 0.914. Seawater phosphate (PHOS) and seawater temperature (TEMP) have negative effects on the abundance of microplastics in surface seawater, while the distance between the sampling point and the coast (DIS), wind stress (WS), human development index (HDI), and sampling latitude (LAT) have positive effects. This work not only predicts the abundance of microplastics in different seas but also offers a framework for the use of machine learning in the study of marine microplastics.

Authors

  • Yu Zhen
    Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of Education), Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin, 300350, China.
  • Lei Wang
    Department of Nursing, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing, China.
  • Hongwen Sun
    MOE Key Laboratory of Pollution Processes and Environmental Criteria, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China.
  • Chunguang Liu
    Yangzhong Intelligent Electric Research Center, North China Electric Power University, Yangzhong 212211, China.