Bidirectional LSTM networks for improved phoneme classification and recognition

  • Authors:
  • Alex Graves;Santiago Fernández;Jürgen Schmidhuber

  • Affiliations:
  • IDSIA, Manno-Lugano, Switzerland;IDSIA, Manno-Lugano, Switzerland;TU Munich, Munich, Germany

  • Venue:
  • ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we carry out two experiments on the TIMIT speech corpus with bidirectional and unidirectional Long Short Term Memory (LSTM) networks. In the first experiment (framewise phoneme classification) we find that bidirectional LSTMoutperforms both unidirectional LSTMand conventional Recurrent Neural Networks (RNNs). In the second (phoneme recognition) we find that a hybrid BLSTM-HMM system improves on an equivalent traditional HMM system, as well as unidirectional LSTM-HMM.