Data driven approaches to speech and language processing

Authors:
Gérard Chollet;Kevin McTait;Dijana Petrovska-Delacrétaz
Affiliations:
CNRS-LTCI, GET-ENST, Paris cedex 13, France;CNRS-LTCI, GET-ENST, Paris cedex 13, France;GET-INT, Institut National des Télécommunications, Evry cedex, France
Venue:
Nonlinear Speech Modeling and Applications
Year:
2005

Citing 79
Cited 1

A theory of the learnable

Communications of the ACM
A framework of a mechanical translation between Japanese and English by analogy principle

Proc. of the international NATO symposium on Artificial and human intelligence
A massively parallel architecture for a self-organizing neural pattern recognition machine

Computer Vision, Graphics, and Image Processing
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
A statistical approach to machine translation

Computational Linguistics
Optimization for dynamic inverted index maintenance

SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Handbook of algorithms and data structures: in Pascal and C (2nd ed.)

Handbook of algorithms and data structures: in Pascal and C (2nd ed.)
Self-organized language modeling for speech recognition

Readings in speech recognition
Information retrieval: data structures and algorithms

Information retrieval: data structures and algorithms
Inverted files

Information retrieval
Genetic programming (videotape): the movie

Genetic programming (videotape): the movie
C4.5: programs for machine learning

C4.5: programs for machine learning
Relevance feedback and inference networks

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
An efficient probabilistic context-free parsing algorithm that computes prefix probabilities

Computational Linguistics
Video Rewrite: driving visual speech with audio

Proceedings of the 24th annual conference on Computer graphics and interactive techniques
How may I help you?

Speech Communication - Special issue on interactive voice technology for telecommunication applications (IVITA '96)
Inference of variable-length linguistic and acoustic units by multigrams

Speech Communication
Statistical methods for speech recognition

Statistical methods for speech recognition
Lip movement synthesis from speech based on hidden Markov models

Speech Communication - Special issue on auditory-visual speech processing
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Machine learning and data mining

Communications of the ACM
Prefix B-trees

ACM Transactions on Database Systems (TODS)
An Extension of the String-to-String Correction Problem

Journal of the ACM (JACM)
Managing gigabytes (2nd ed.): compressing and indexing documents and images

Managing gigabytes (2nd ed.): compressing and indexing documents and images
Introduction to data compression (2nd ed.)

Introduction to data compression (2nd ed.)
The NIST speaker recognition evaluation - overview methodology, systems, results, perspective

Speech Communication - Speaker recognition and its commercial and forensic applications
Bayesian Networks and Decision Graphs

Bayesian Networks and Decision Graphs
Self-Organizing Maps

Self-Organizing Maps
Statistical Language Learning

Statistical Language Learning
Machine Learning

Machine Learning
Data Mining Techniques in Speech Synthesis

Data Mining Techniques in Speech Synthesis
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Data Structures and Algorithms

Data Structures and Algorithms
The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information

The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information
The Janus-III Translation System: Speech-to-Speech Translation in Multiple Domains

Machine Translation
Modeling and Animating Realistic Faces from Images

International Journal of Computer Vision
Introduction to Information Theory and Data Compression

Introduction to Information Theory and Data Compression
A segmental speech coder based on a concatenative TTS

Speech Communication
Introducing statistical dependencies and structural constraints in variable-length sequence models

ICG! '96 Proceedings of the 3rd International Colloquium on Grammatical Inference: Learning Syntax from Sentences
Fusion of Audio-Visual Information for Integrated Speech Processing

AVBPA '01 Proceedings of the Third International Conference on Audio- and Video-Based Biometric Person Authentication
Learning language using genetic algorithms

Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing
Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Very Low Bit Rate Speech Coding: Comparison of Data-Driven Units with Syllable Segments

TSD '99 Proceedings of the Second International Workshop on Text, Speech and Dialogue
Adaptation Guided Retrieval in EBMT: A Case-Based Approach to Machine Translation

EWCBR '96 Proceedings of the Third European Workshop on Advances in Case-Based Reasoning
Speech Spectrum Representation and Coding Using Multigrams with Distance

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Application of Minimal Perfect Hashing in Main Memory Indexing

Application of Minimal Perfect Hashing in Main Memory Indexing
Learning words from sights and sounds: a computational model

Learning words from sights and sounds: a computational model
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Tagging English text with a probabilistic model

Computational Linguistics
Vector-based natural language call routing

Computational Linguistics
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
A practical part-of-speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Decoding algorithm in statistical machine translation

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Paradigmatic cascades: a linguistically sound model of pronunciation by analogy

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A word-to-word model of translational equivalence

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Automatically creating bilingual lexicons for Machine Translation from bilingual text

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Modeling with structures in statistical machine translation

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Word-sense disambiguation using statistical methods

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Similarity-based estimation of word cooccurrence probabilities

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Pilot implementation of a Bilingual Knowledge Bank

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
A statistical approach to language translation

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Lexical knowledge acquisition from bilingual corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Learning translation templates from bilingual text

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Example-Based Machine Translation in the Pangloss system

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Towards history-based grammars: using richer models for probabilistic parsing

HLT '91 Proceedings of the workshop on Speech and Natural Language
The Candide system for machine translation

HLT '94 Proceedings of the workshop on Human Language Technology
Architectures for speech-to-speech translation using finite-state models

S2S '02 Proceedings of the ACL-02 workshop on Speech-to-speech translation: algorithms and systems - Volume 7
A phrase-based, joint probability model for statistical machine translation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
JANUS: a speech-to-speech translation system using connectionist and symbolic processing strategies

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
TONGUES: rapid development of a speech-to-speech translation system

HLT '02 Proceedings of the second international conference on Human Language Technology Research
The NESPOLE! speech-to-speech translation system

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Searching through a speech memory for text-independent speaker verification

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Statistical parsing with a context-free grammar and word statistics

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Voice transformation using PSOLA technique

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
An audio-visual imposture scenario by talking face animation

Nonlinear Speech Modeling and Applications
Speech-to-video synthesis using MPEG-4 compliant visual features

IEEE Transactions on Circuits and Systems for Video Technology
An HMM-based speech-to-video synthesizer

IEEE Transactions on Neural Networks

Some notes on nonlinearities of speech

Nonlinear Speech Modeling and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech and language processing systems can be categorised according to whether they make use of predefined linguistic information and rules or are data driven and therefore exploit machine learning techniques to automatically extract and process relevant units of information which are then indexed and retrieved as appropriate. As an example, most state of the art automatic speech processing systems rely on a representation based on predefined phonetic symbols. The use of language dependent representations, whilst linguistically intuitive, has several drawbacks i.e. portability across languages, development time. Therefore, in this article, we review and present our recent experiments exploiting the idea inherent in the ALISP (Automatic Language Independent Speech Processing) approach, with particular respect to speech processing, where the intermediate representation between the acoustic and linguistic levels area is automatically inferred from speech data. We then present prospective directions in which the ALISP principles could be exploited by different domains such as audio, speech, text, image and video processing.