Skip to content

ethz-tb/orcai-ms-code

Repository files navigation

Additional code used for orcAI manuscript

orcAI has been published in the Journal Marine Mammal Science as:

Bonhoeffer, S. et al. 2025. “orcAI: A Machine Learning Tool to Detect and Classify Acoustic Signals of Killer Whales in Audio Recordings.” Marine Mammal Science e70083. https://doi.org/10.1111/mms.70083.

@article{https://doi.org/10.1111/mms.70083,
  author = {Bonhoeffer, Sebastian and Selbmann, Anna and Angst, Daniel C. and Ochsner, Nicolas and Miller, Patrick J. O. and Samarra, Filipa I. P. and Baumgartner, Chérine D.},
  title = {orcAI: A Machine Learning Tool to Detect and Classify Acoustic Signals of Killer Whales in Audio Recordings},
  journal = {Marine Mammal Science},
  volume = {n/a},
  number = {n/a},
  pages = {e70083},
  keywords = {bioacoustics, cetaceans, deep learning},
  doi = {https://doi.org/10.1111/mms.70083},
  url = {https://onlinelibrary.wiley.com/doi/abs/10.1111/mms.70083},
  eprint = {https://onlinelibrary.wiley.com/doi/pdf/10.1111/mms.70083},
  note = {e70083 9425791},
  abstract = {ABSTRACT Acoustic monitoring is an essential tool for investigating animal communication and behavior when visual contact is limited, but the scalability of bioacoustic projects is often limited by time-intensive manual auditing of focal signals. To address this bottleneck, we introduce orcAI—a novel deep learning framework for the automated detection and classification of a broad acoustic repertoire of killer whales (Orcinus orca), including vocalizations (e.g., pulsed calls, whistles) and incidental sounds (e.g., breathing, tail slaps). orcAI combines a ResNet-based Convolutional Neural Network (ResNet-CNN) with Long Short-Term Memory (LSTM) layers to capture both spatial features and temporal context, enabling the model to classify signals and to accurately determine their temporal boundaries in spectrograms. Trained on a comprehensive dataset from herring-feeding killer whales off Iceland, the framework was designed to be adaptable to other populations upon training with equivalent data. Our final model achieves up to 98.2\% accuracy on test data and is delivered as an open-source tool with an easy-to-use command-line interface. By providing a ready-to-use model that processes raw audio and outputs annotations, orcAI serves as a useful tool for advancing the study of killer whale vocal behavior and, more broadly, for understanding marine mammal communication and ecology.}
}

Contents

About

Additional code used for orcAI manuscript

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published