Multiclass CNN Approach for Automatic Classification of Dolphin Vocalizations.

Journal: Sensors (Basel, Switzerland)
PMID:

Abstract

Monitoring dolphins in the open sea is essential for understanding their behavior and the impact of human activities on the marine ecosystems. Passive Acoustic Monitoring (PAM) is a non-invasive technique for tracking dolphins, providing continuous data. This study presents a novel approach for classifying dolphin vocalizations from a PAM acoustic recording using a convolutional neural network (CNN). Four types of common bottlenose dolphin () vocalizations were identified from underwater recordings: whistles, echolocation clicks, burst pulse sounds, and feeding buzzes. To enhance classification performances, edge-detection filters were applied to spectrograms, with the aim of removing unwanted noise components. A dataset of nearly 10,000 spectrograms was used to train and test the CNN through a 10-fold cross-validation procedure. The results showed that the CNN achieved an average accuracy of 95.2% and an F1-score of 87.8%. The class-specific results showed a high accuracy for whistles (97.9%), followed by echolocation clicks (94.5%), feeding buzzes (94.0%), and burst pulse sounds (92.3%). The highest F1-score was obtained for whistles, exceeding 95%, while the other three vocalization typologies maintained an F1-score above 80%. This method provides a promising step toward improving the passive acoustic monitoring of dolphins, contributing to both species conservation and the mitigation of conflicts with fisheries.

Authors

  • Francesco Di Nardo
    Department of Information Engineering, Università Politecnica delle Marche, via Brecce Bianche, 60131, Ancona, Italy. f.dinardo@staff.univpm.it.
  • Rocco De Marco
    Institute of Biological Resources and Marine Biotechnology (IRBIM), National Research Council (CNR), 60125 Ancona, Italy.
  • Daniel Li Veli
    Institute of Biological Resources and Marine Biotechnology (IRBIM), National Research Council (CNR), 60125 Ancona, Italy.
  • Laura Screpanti
    Dipartimento di Ingegneria dell'informazione, Università Politecnica delle Marche, 60131 Ancona, Italy.
  • Benedetta Castagna
    Dipartimento di Ingegneria dell'informazione, Università Politecnica delle Marche, 60131 Ancona, Italy.
  • Alessandro Lucchetti
    Institute of Biological Resources and Marine Biotechnology (IRBIM), National Research Council (CNR), 60125 Ancona, Italy.
  • David Scaradozzi
    Dipartimento di Ingegneria dell'informazione, Università Politecnica delle Marche, 60131 Ancona, Italy.