Environmental Sound Recognition by Multilayered Neural Networks

  • Authors:
  • Yoshiyuki Toyoda;Jie Huang;Shuxue Ding;Yong Liu

  • Affiliations:
  • University of Aizu;University of Aizu;University of Aizu;University of Aizu

  • Venue:
  • CIT '04 Proceedings of the The Fourth International Conference on Computer and Information Technology
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Environmental sound recognition is an important function of robotic audition. Although HMM- or TDNN-based methods can also be used for environmental sound recognition, unlike speech recognition, it is not possible to create a perfect database covering all kinds of environmental sounds. Environmental sound recognition depends more on the robot computer system task. From this point of view, the methods for environmnetal sound recognition must also be task-dependent and be evaluated based on accuracy, speed and simplicity. In this research, we tried to use a multilayered perceptron NN system for environmental sound recognition. The input data is the one-dimensional combination of the instantaneous spectrum at the power peak and the power pattern in time domain. The spectrum of environmental sounds do not change as remarkedly as that of speech of voice, so the combination of power and frequency pattern will retain the major features of environmental sounds but with drastically reduced data. Two experiments were conducted using an original database and a database created by the RWCP. The recognition rate for 45 environmental sound data sets was about 92%. The new method is fast and simple compared to the HMM-based methods, and suitable for an on-board system of a robot for home use, e.g. a security monitoring robot or a home-helper robot.