Word Topic Models for Spoken Document Retrieval and Transcription

Authors:
Berlin Chen
Affiliations:
National Taiwan Normal University
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2009

Citing 25
Cited 10

Information retrieval using a singular value decomposition model of latent semantic structure

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
A tutorial on hidden Markov models and selected applications in speech recognition

Readings in speech recognition
A dynamic language model for speech recognition

HLT '91 Proceedings of the workshop on Speech and Natural Language
Class-based n-gram models of natural language

Computational Linguistics
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Learning in graphical models

Learning in graphical models
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval

Proceedings of the eighth international conference on Information and knowledge management
Phonetic confusion matrix based spoken document retrieval

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Indexing and retrieval of broadcast news

Speech Communication - Special issue on accessing information in spoken audio
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The development of the HTK Broadcast News transcription system: an overview

Speech Communication - Special issue on automatic transcription of broadcast news data
Modern Information Retrieval

Modern Information Retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Language Modeling for Information Retrieval

Language Modeling for Information Retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002

ACM SIGIR Forum
A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

ACM Transactions on Asian Language Information Processing (TALIP)
LDA-based document models for ad-hoc retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
An empirical study on language model adaptation

ACM Transactions on Asian Language Information Processing (TALIP)
Search Engines that Learn from Implicit Feedback

Computer
A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization

IEEE Transactions on Audio, Speech, and Language Processing

A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization

ACM Transactions on Asian Language Information Processing (TALIP)
Topic modeling for spoken document retrieval using word- and syllable-level information

SSCS '09 Proceedings of the third workshop on Searching spontaneous conversational speech
Spoken document retrieval using topic models

Proceedings of the 3rd International Universal Communication Symposium
Japanese Spontaneous Spoken Document Retrieval Using NMF-Based Topic Models

AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Usage patterns and latent semantic analyses for task goal inference of multimodal user interactions

Proceedings of the 15th international conference on Intelligent user interfaces
Topic-Dependent Language Model with Voting on Noun History

ACM Transactions on Asian Language Information Processing (TALIP)
A risk minimization framework for extractive speech summarization

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Probabilistic modulation spectrum factorization for robust speech recognition

ROCLING '11 ROCLING 2011 Poster Papers
Extractive speech summarization using evaluation metric-related training criteria

Information Processing and Management: an International Journal
Leveraging relevance cues for language modeling in speech recognition

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Statistical language modeling (LM), which aims to capture the regularities in human natural language and quantify the acceptability of a given word sequence, has long been an interesting yet challenging research topic in the speech and language processing community. It also has been introduced to information retrieval (IR) problems, and provided an effective and theoretically attractive probabilistic framework for building IR systems. In this article, we propose a word topic model (WTM) to explore the co-occurrence relationship between words, as well as the long-span latent topical information, for language modeling in spoken document retrieval and transcription. The document or the search history as a whole is modeled as a composite WTM model for generating a newly observed word. The underlying characteristics and different kinds of model structures are extensively investigated, while the performance of WTM is thoroughly analyzed and verified by comparison with the well-known probabilistic latent semantic analysis (PLSA) model as well as the other models. The IR experiments are performed on the TDT Chinese collections (TDT-2 and TDT-3), while the large vocabulary continuous speech recognition (LVCSR) experiments are conducted on the Mandarin broadcast news collected in Taiwan. Experimental results seem to indicate that WTM is a promising alternative to the existing models.