Study of best algorithm combinations for speech processing tasks in machine learning using median vs. mean clusters in MARF

  • Authors:
  • Serguei A. Mokhov

  • Affiliations:
  • Concordia University, Montreal, Quebec, Canada

  • Venue:
  • Proceedings of the 2008 C3S2E conference
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work reports experimental results and their analysis in various speech processing tasks using SpeakerIdentApp, a text-independent speaker identification application, based on Modular Audio Recognition Framework (MARF)'s API and its implementation in terms of best of the available algorithm configurations for each particular task using median clusters as opposed to the default mean clusters. This study focuses on the tasks of identification of speakers' as of who they are, their gender, and accent through machine learning. This work significantly complements two preceding statistical studies undertaken using only mean clusters and shows the difference in selection of the best algorithm combinations using the median cluster approach. To the author's knowledge there was no any previous comprehensive study in this regard.