Learning random walk models for inducing word dependency distributions

Authors:
Kristina Toutanova;Christopher D. Manning;Andrew Y. Ng
Affiliations:
Stanford University, Stanford, CA;Stanford University, Stanford, CA;Stanford University, Stanford, CA
Venue:
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Year:
2004

Citing 15
Cited 29

Stemming algorithms: a case study for detailed evaluation

Journal of the American Society for Information Science - Special issue: evaluation of information retrieval systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Similarity-Based Models of Word Cooccurrence Probabilities

Machine Learning - Special issue on natural language learning
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Integrating Symbolic and Statistical Methods for Prepositional Phrase Attachment

Proceedings of the Twelfth International Florida Artificial Intelligence Research Society Conference
Head-driven statistical models for natural language parsing

Head-driven statistical models for natural language parsing
Structural ambiguity and lexical relations

Computational Linguistics - Special issue on using large corpora: I
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
A rule-based approach to prepositional phrase attachment disambiguation

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Measures of distributional similarity

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
An unsupervised approach to prepositional phrase attachment using contextually similar words

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A maximum entropy model for prepositional phrase attachment

HLT '94 Proceedings of the workshop on Human Language Technology
Link analysis, eigenvectors and stability

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Statistical parsing with a context-free grammar and word statistics

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence

Integrating word relationships into language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using random walk models

Proceedings of the 14th ACM international conference on Information and knowledge management
Contextual search and name disambiguation in email using graphs

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Using the web as an implicit training set: application to structural ambiguity resolution

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Language model-based document clustering using random walks

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
LexNet: a graphical environment for graph-based NLP

COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
Inferential language models for information retrieval

ACM Transactions on Asian Language Information Processing (TALIP)
The effect of corpus size in combining supervised and unsupervised training for disambiguation

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Extending query translation to cross-language query expansion with markov chain models

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Learning to rank typed graph walks: local and global approaches

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Similarity based smoothing in language modeling

Acta Cybernetica
Modeling multi-step relevance propagation for expert finding

Proceedings of the 17th ACM conference on Information and knowledge management
Using English information in non-English web search

Proceedings of the 2nd ACM workshop on Improving non english web searching
A lattice-based framework for enhancing statistical parsers with information from unlabeled corpora

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Prepositions in applications: A survey and introduction to the special issue

Computational Linguistics
Learning graph walk based similarity measures for parsed text

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A graph-theoretic model of lexical syntactic acquisition

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
LexRank: graph-based lexical centrality as salience in text summarization

Journal of Artificial Intelligence Research
Sequence prediction exploiting similarity information

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A graphical framework for contextual search and name disambiguation in email

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
Improving graph-walk-based similarity with reranking: Case studies for personal information management

ACM Transactions on Information Systems (TOIS)
Using Markov chains to exploit word relationships in information retrieval

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Learning recommendations in social media systems by weighting multiple relations

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Simple semi-supervised learning for prepositional phrase attachment

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
A neurocomputational approach to prepositional phrase attachment ambiguity resolution

Neural Computation
Query expansion using path-constrained random walks

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Feeding the second screen: semantic linking based on subtitles

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many NLP tasks rely on accurately estimating word dependency probabilities P(ω1|ω2), where the words w1 and w2 have a particular relationship (such as verb-object). Because of the sparseness of counts of such dependencies, smoothing and the ability to use multiple sources of knowledge are important challenges. For example, if the probability P(N|V) of noun N being the subject of verb V is high, and V takes similar objects to V', and V' is synonymous to V", then we want to conclude that P(N|V") should also be reasonably high---even when those words did not cooccur in the training data.To capture these higher order relationships, we propose a Markov chain model, whose stationary distribution is used to give word probability estimates. Unlike the manually defined random walks used in some link analysis algorithms, we show how to automatically learn a rich set of parameters for the Markov chain's transition probabilities. We apply this model to the task of prepositional phrase attachment, obtaining an accuracy of 87.54%.