Spoken language resources for Cantonese speech processing

Authors:
Tan Lee;W. K. Lo;P. C. Ching;Helen Meng
Affiliations:
Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong;Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
Venue:
Speech Communication
Year:
2002

Citing 5
Cited 3

Evaluation of spoken language systems: the ATIS domain

HLT '90 Proceedings of the workshop on Speech and Natural Language
Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news

Speech Communication
A Chinese Text-to-Speech System Based on Part-of-Speech Analysis, Prosodic Modeling and Non-Uniform Units

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
European Speech Databases for Telephone Applications

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
The design for the wall street journal-based CSR corpus

HLT '91 Proceedings of the workshop on Speech and Natural Language

A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Analysis and modeling of F0 contours for cantonese text-to-speech

ACM Transactions on Asian Language Information Processing (TALIP)
HKUST/MTS: a very large scale mandarin telephone speech corpus

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes the development of CU Corpora, a series of large-scale speech corpora for Cantonese. Cantonese is the most commonly spoken Chinese dialect in Southern China and Hong Kong. CU Corpora are the first of their kind and intended to serve as an important infrastructure for the advancement of speech recognition and synthesis technologies for this widely used Chinese dialect. They contain a large amomat of speech data that cover various linguistic units of spoken Cantonese, including isolated syllables, polysyllabic words and continuous sentences. While some of the corpora are created for specific applications of common interest, the others are designed with emphasis on the coverage and distributions of different phonetic units, including the contextual ones. The speech data are annotated manually so as to provide sufficient orthographic and phonetic information for the development of different applications. Statistical analysis of the annotated data shows that CU Corpora contain rich and balanced phonetic content. The usefulness of the corpora is also demonstrated with a number of speech recognition and speech synthesis applications.