Speech/music discrimination via energy density analysis

  • Authors:
  • Stanisław Kacprzak;Mariusz Ziółko

  • Affiliations:
  • Department of Electronics, AGH University of Science and Technology, Kraków, Poland;Department of Electronics, AGH University of Science and Technology, Kraków, Poland

  • Venue:
  • SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we suggest to apply a new feature, called Minimum Energy Density (MED), in discrimination of audio signals between speech and music. Our method is based on the analysis of local energy for 1 or 2.5 seconds audio signals. An elementary analysis of the probability for the power distribution is an effective tool supporting the decision making system. We compare our feature with Percentage of Low Energy Frames (LEF), Modified Low Energy Ratio (MLER) and examine their efficiency for two separate speech/music corpora.