Learning dependencies between case frame slots

Authors:
Hang Li;Naoki Abe
Affiliations:
Theory NEC Laboratory, RWCP, NEC, Kawasaki, Japan;Theory NEC Laboratory, RWCP, NEC, Kawasaki, Japan
Venue:
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Year:
1996

Citing 4
Cited 4

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Stochastic Complexity in Statistical Inquiry Theory

Stochastic Complexity in Statistical Inquiry Theory
Structural ambiguity and lexical relations

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
A rule-based approach to prepositional phrase attachment disambiguation

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2

Generalizing case frames using a thesaurus and the MDL principle

Computational Linguistics
Learning probabilistic subcategorization preference by identifying case dependencies and optimal noun class generalization level

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A procedure for multi-class discrimination and some linguistic applications

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
General-to-specific model selection for subcategorization preference

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2

Quantified Score

Hi-index	0.01

Visualization

Abstract

We address the problem of automatically acquiring case frame patterns (selectional patterns) from large corpus data. In particular, we propose a method of learning dependencies between case frame slots. We view the problem of learning case frame patterns as that of learning a multi-dimensional discrete joint distribution, where random variables represent case slots. We then formalize the dependencies between case slots as the probabilistic dependencies between these random variables. Since the number of parameters in a multi-dimensional joint distribution is exponential in general, it is infeasible to accurately estimate them in practice. To overcome this difficulty, we settle with approximating the target joint distribution by the product of low order component distributions, based on corpus data. In particular we propose to employ an efficient learning algorithm based on the MDL principle to realize this task. Our experimental results indicate that for certain classes of verbs, the accuracy achieved in a disambiguation experiment is improved by using the acquired knowledge of dependencies.