A Public Dataset of Annotated Orcinus orca Acoustic Signals for Detection and Ecotype Classification.
Journal:
Scientific data
Published Date:
Jul 3, 2025
Abstract
Killer whales (Orcinus orca) exhibit significant ecological and genetic diversity, with three primary sympatric populations in the Northeast Pacific: Resident, Bigg's (Transient), and Offshore. Each population is characterized by distinct foraging habits, social structures, and vocal repertoires, which complicate accurate monitoring and conservation efforts. This dataset, compiled from diverse sources, provides a comprehensive resource for the detection and classification of killer whale vocalizations. The dataset includes annotated acoustic recordings spanning 11 years from various locations in Alaska, British Columbia, and Washington, collected using multiple hydrophone systems. It addresses the challenge of differentiating killer whale calls from other marine species and environmental noise, including specific instances of confounding signals that may help enhance model robustness. Detailed annotations capture a diverse suite of vocalizations and their associated metadata, facilitating the development of advanced machine learning models for ecological monitoring. This curated dataset aims to improve the accuracy of killer whale detection algorithms, support conservation efforts, and advance our understanding of killer whale acoustic communication across different populations.