Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring

  • Authors:
  • R. Bardeli;D. Wolff;F. Kurth;M. Koch;K. -H. Tauchert;K. -H. Frommolt

  • Affiliations:
  • Fraunhofer Institute Intelligent Analysis and Information Systems IAIS, Schloss Birlinghoven, 53754 Sankt Augustin, Germany;Department of Musicology/Sound Studies, Institute for Communication Sciences at the University of Bonn, Adenauerallee 4-6, 53113 Bonn, Germany;Department of Computer Science III, University of Bonn, Römerstraíe 164, 53117 Bonn, Germany and Fraunhofer-FKIE, 53343 Wachtberg, Germany;Humboldt-Universität zu Berlin, Department of Biology, Invalidenstr. 43, 10115 Berlin, Germany;Museum für Naturkunde, Leibniz Institute for Research on Evolution and Biodiversity at the Humboldt University Berlin, Invalidenstr. 43, 10115 Berlin, Germany;Museum für Naturkunde, Leibniz Institute for Research on Evolution and Biodiversity at the Humboldt University Berlin, Invalidenstr. 43, 10115 Berlin, Germany

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2010

Quantified Score

Hi-index 0.10

Visualization

Abstract

Trends in bird population sizes are an important indicator in nature conservation but measuring such sizes is a very difficult, labour intensive process. Enormous progress in audio signal processing and pattern recognition in recent years makes it possible to incorporate automated methods into the detection of bird vocalisations. These methods can be employed to support the census of population sizes. We report about a study testing the feasibility of bird monitoring supported by automatic bird song detection. In particular, we describe novel algorithms for the detection of the vocalisations of two endangered bird species and show how these can be used in automatic habitat mapping. These methods are based on detecting temporal patterns in a given frequency band typical for the species. Special effort is put into the suppression of the noise present in real-world audio scenes. Our results show that even in real-world recording conditions high recognition rates with a tolerable rate of false positive detections are possible.