Language-independent, neural network-based, text-to-phones conversion

  • Authors:
  • Mario Malcangi;David Frontini

  • Affiliations:
  • Universití degli Studi di Milano, DICo-Dipartimento di Informatica e Comunicazione, Via Comelico 39, 20135 Milano, Italy and Laboratorio DSP&RTS (Digital Signal Processing & Real-Time Systems ...;Universití degli Studi di Milano, DICo-Dipartimento di Informatica e Comunicazione, Via Comelico 39, 20135 Milano, Italy and Laboratorio DSP&RTS (Digital Signal Processing & Real-Time Systems ...

  • Venue:
  • Neurocomputing
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

A speech synthesizer based on an artificial neural network (ANN) is being developed for application to deeply embedded systems for language-independent speech commands on hands-free interfaces. A feed-forward, backpropagation, artificial neural network has been trained for this purpose using a custom-developed, regular expression-based, text-to-phone transcription engine to generate training patterns. Initial experimental results show the expected properties of language independence and in-system learning capability of this approach. The ANN demonstrates the capacity to generalize and map the words missing at training time, as well as to reduce contradictions related to different pronunciations for the same word.