Time-dependent genre recognition by means of instantaneous frequency spectrum based on Hilbert-Huang transform

  • Authors:
  • Tatiana Endrjukaite;Naoko Kosugi

  • Affiliations:
  • NTT Communication Science Labs., Kanagawa, Japan;NTT Communication Science Labs., Kanagawa, Japan

  • Venue:
  • Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new method of music genre recognition. Even for people it is difficult to define musical genres, because a genre is something more than a set of rules. Automation of this task could improve the work of multiple audio-related WEB portals, such as audio-libraries, and could simplify human activity in other music-related areas. For music genre recognition, we introduce the instantaneous frequency spectrum (IFS) whose calculation is based on the Hilbert-Huang transform. In our method, IFSs of audio signals are generated from their instantaneous frequencies and used to calculate music genre templates. The experimental results for three test music pieces show that the method can accurately detect and differentiate genres of tunes. Slicing test music into frames and recognizing genres for short fragments of a whole music piece gives a precise description of a piece's internal structure, which could help to enhance people's understanding of the music. Presentation of this information also is an advance in music visualization.