Sound source localization using sparse coding and SOM

  • Authors:
  • Hong-Shik Kim;Jong-Suk Choi

  • Affiliations:
  • Korea Institute of Science and Technology, Seoul, Republic of Korea;Korea Institute of Science and Technology, Seoul, Republic of Korea

  • Venue:
  • ETFA'09 Proceedings of the 14th IEEE international conference on Emerging technologies & factory automation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many kinds of sound source localization systems have been developed for detecting a direction of sound source. They are commonly using time delay of arrival (TDOA) or interaural time difference (ITD) algorithm for sound source localization where, especially, the ITD is the difference in arrival time of a sound between two ears. It is largely changed depending on frequency components of sound even though the sound source is located in the same place. In this paper we propose a binaural sound localization system using sparse coding based ITD (SITD) and self-organizing map (SOM). The sparse coding is used for decomposing given sounds into three components: time, frequency and magnitude. Moreover we estimate the azimuth angle through the SOM. This localization system is installed in our robot that has two ears, head and body. We use PeopleBot as a body of the robot.