Unsupervised learning of dependency structure for language modeling

Authors:
Jianfeng Gao;Hisami Suzuki
Affiliations:
Microsoft Research, Asia, Beijing, China;Microsoft Research, Redmond WA
Venue:
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Year:
2003

Citing 7
Cited 9

Toward a unified approach to statistical language modeling for Chinese

ACM Transactions on Asian Language Information Processing (TALIP)
Discovery of linguistic relations using lexical attraction

Discovery of linguistic relations using lexical attraction
Probabilistic top-down parsing and language modeling

Computational Linguistics
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Efficient parsing for bilexical context-free grammars and head automaton grammars

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Immediate-head parsing for language models

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Exploiting headword dependency and predictive clustering for language modeling

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

Dependence language model for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to the special issue on statistical language modeling

ACM Transactions on Asian Language Information Processing (TALIP)
Using Semantic Dependencies to Mine Depressive Symptoms from Consultation Records

IEEE Intelligent Systems
Stochastic discourse modeling in spoken dialogue systems using semantic dependency graphs

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Comparison of performance of enhanced morpheme-based language model with different word-based language models for improving the performance of Tamil speech recognition system

ACM Transactions on Asian Language Information Processing (TALIP)
A Graph Based Method for Building Multilingual Weakly Supervised Dependency Parsers

GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Dependency Language Modeling Using KNN and PLSI

MICAI '09 Proceedings of the 8th Mexican International Conference on Artificial Intelligence
Simple unsupervised grammar induction from raw text with cascaded finite state models

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Long distance dependency in language modeling: an empirical study

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a dependency language model (DLM) that captures linguistic constraints via a dependency structure, i.e., a set of probabilistic dependencies that express the relations between headwords of each phrase in a sentence by an acyclic, planar, undirected graph. Our contributions are three-fold. First, we incorporate the dependency structure into an n-gram language model to capture long distance word dependency. Second, we present an unsupervised learning method that discovers the dependency structure of a sentence using a bootstrapping procedure. Finally, we evaluate the proposed models on a realistic application (Japanese Kana-Kanji conversion). Experiments show that the best DLM achieves an 11.3% error rate reduction over the word trigram model.