Real-time binaural sound source localization using sparse coding and SOM

  • Authors:
  • Do-Hyeong Hwang;Jong-Suk Choi

  • Affiliations:
  • Korea Institute Science and Technology, Seoul, Korea;Korea Institute Science and Technology, Seoul, Korea

  • Venue:
  • ICIRA'10 Proceedings of the Third international conference on Intelligent robotics and applications - Volume Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a binaural sound source localization method using both a sparse coding and a self-organizing map (SOM) in real-time system. We use the sparse coding for feature extraction to estimate the azimuth of sound source. It is used for decomposing input sound signals into three components such as time, frequency and magnitude. Therefore, although the frequency characteristic of ITD (Interaural Time Difference) is changed by shape of head, we utilized it to estimate the azimuth of the sound source considering the time-frequency features simultaneously. Then we adapted the SOM to estimate the azimuth of sound source which is a type of artificial neural networks. This system is constructed by open-source software, Flowdesigner, which gives us a data-flow oriented developmental environment for efficient real-time system.