A Framework for Language-Independent Analysis and Prosodic Feature Annotation of Text Corpora

Authors:
Dimitris Spiliotopoulos;Georgios Petasis;Georgios Kouroupetroglou
Affiliations:
Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece GR-15784;Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece GR-15784;Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece GR-15784
Venue:
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Year:
2008

Citing 5
Cited 1

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Speaking the Users' Languages

IEEE Intelligent Systems
ILEX: an architecture for a dynamic hypertext generation system

Natural Language Engineering
Modeling Improved Prosody Generation from High-Level Linguistically Annotated Corpora

IEICE - Transactions on Information and Systems
A Greek morphological lexicon and its exploitation by natural language processing applications

PCI'01 Proceedings of the 8th Panhellenic conference on Informatics

Integrating contrast in a framework for predicting prosody

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Concept-to-Speech systems include Natural Language Generators that produce linguistically enriched text descriptions which can lead to significantly improved quality of speech synthesis. There are cases, however, where either the generator modules produce pieces of non-analyzed, non-annotated plain text, or such modules are not available at all. Moreover, the language analysis is restricted by the usually limited domain coverage of the generator due to its embedded grammar. This work reports on a language-independent framework basis, linguistic resources and language analysis procedures (word/sentence identification, part-of-speech, prosodic feature annotation) for text annotation/processing for plain or enriched text corpora. It aims to produce an automated XML- annotated enriched prosodic markup for English and Greek texts, for improved synthetic speech. The markup includes information for both training the synthesizer and for actual input for synthesising. Depending on the domain and target, different methods may be used for automatic classification of entities (words, phrases, sentences) to one or more preset categories such as "emphatic event", "new/old information", "second argument to verb", "proper noun phrase", etc. The prosodic features are classified according to the analysis of the speech-specific characteristics for their role in prosody modelling and passed through to the synthesizer via an extended SOLE-ML description. Evaluation results show that using selectable hybrid methods for part-of-speech tagging high accuracy is achieved. Annotation of a large generated text corpus containing 50% enriched text and 50% canned plain text produces a fully annotated uniform SOLE-ML output containing all prosodic features found in the initial enriched source. Furthermore, additional automatically-derived prosodic feature annotation and speech synthesis related values are assigned, such as word-placement in sentences and phrases, previous and next word entity relations, emphatic phrases containing proper nouns, and more.