Efficient Parsing of Romanian Language for Text-to-Speech Purposes

  • Authors:
  • Andrei Şaupe;Lucian Radu Teodorescu;Mihai Alexandru Ordean;Răzvan Boldizsar;Mihaela Ordean;Gheorghe Cosmin Silaghi

  • Affiliations:
  • iQuest Technologies, Cluj-Napoca, Romania;iQuest Technologies, Cluj-Napoca, Romania;iQuest Technologies, Cluj-Napoca, Romania;iQuest Technologies, Cluj-Napoca, Romania;iQuest Technologies, Cluj-Napoca, Romania;Babeş-Bolyai University of Cluj-Napoca, Romania

  • Venue:
  • TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents the design of the text analysis component of a TTS system for the Romanian language. Our text analysis is performed in two steps: document structure detection and text normalization. The output is a tree-based representation of the processed data. Parsing is made efficient with the help of the Boost Spirit LL parser [1], the usage of this tool allowing for a greater flexibility in the source code and in the output representation.