Inducing a semantically annotated lexicon via EM-based clustering

Authors:
Mats Rooth;Stefan Riezler;Detlef Prescher;Glenn Carroll;Franz Beil
Affiliations:
University of Stuttgart, Germany;University of Stuttgart, Germany;University of Stuttgart, Germany;University of Stuttgart, Germany;University of Stuttgart, Germany
Venue:
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Year:
1999

Citing 5
Cited 49

Selection and information: a class-based approach to lexical relationships

Selection and information: a class-based approach to lexical relationships
Similarity-Based Models of Word Cooccurrence Probabilities

Machine Learning - Special issue on natural language learning
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An experiment on learning appropriate Selectional Restrictions from a parsed corpus

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Inside-outside estimation of a lexicalized PCFG for German

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

Class-based probability estimation using a semantic hierarchy

Computational Linguistics
Automatic labeling of semantic roles

Computational Linguistics
Identifying semantic relations in text

Exploring artificial intelligence in the new millennium
A probabilistic account of logical metonymy

Computational Linguistics
Using the web to obtain frequencies for unseen bigrams

Computational Linguistics - Special issue on web as corpus
Word clustering and disambiguation based on co-occurrence data

Natural Language Engineering
Exploiting auxiliary distributions in stochastic unification-based grammars

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Learning word clusters from data types

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Using a probabilistic class-based lexicon for lexical ambiguity resolution

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Test Data Likelihood for PLSA Models

Information Retrieval
An unsupervised learning method for associative relationships between verb phrases

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Probabilistic models of verb-argument structure

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Class-based probability estimation using a semantic hierarchy

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Inducing probabilistic syllable classes using multivariate clustering

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Computational Linguistics
Identifying events using similarity and context

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Experiments on the Automatic Induction of German Semantic Verb Classes

Computational Linguistics
Automated induction of sense in context

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Building a dynamic lexicon from a digital library

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
A general feature space for automatic verb classification

Natural Language Engineering
Can human verb associations help identify salient features for semantic verb classification?

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Using hidden Markov random fields to combine distributional and pattern-based word clustering

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Discriminative learning of selectional preference from unlabeled text

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Acquiring word-meaning mappings for natural language interfaces

Journal of Artificial Intelligence Research
Linguistic distances

LD '06 Proceedings of the Workshop on Linguistic Distances
Revealing phonological similarities between related languages from automatically generated parallel corpora

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Hypernym discovery based on distributional similarity and hierarchical structures

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
A non-negative tensor factorization model for selectional preference induction

GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
Classifying Japanese polysemous verbs based on fuzzy C-means clustering

TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
A latent dirichlet allocation method for selectional preferences

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Latent variable models of selectional preference

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Improving the use of pseudo-words for evaluating selectional preferences

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Collocation extraction beyond the independence assumption

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Automatic selectional preference acquisition for Latin verbs

ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Semantic role features for machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Metaphor identification using verb and noun clustering

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A non-negative tensor factorization model for selectional preference induction

Natural Language Engineering
A flexible, corpus-driven model of regular and inverse selectional preferences

Computational Linguistics
Unsupervised learning of verb argument structures

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Unsupervised learning of selectional restrictions and detection of argument coercions

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Probabilistic models of similarity in syntactic context

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
LDA-Frames: an unsupervised approach to generating semantic frames

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Inferring selectional preferences from part-of-speech N-grams

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
"Could you make me a favour and do coffee, please?": implications for automatic error correction in English and Dutch

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Learning semantics and selectional preference of adjective-noun pairs

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Regular polysemy: a distributional model

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Evaluating automatic annotation: automatically detecting and enriching instances of the dative alternation

Language Resources and Evaluation
Exploiting language models to recognize unseen actions

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Statistical metaphor processing

Computational Linguistics

Quantified Score

Hi-index	0.01

Visualization

Abstract

We present a technique for automatic induction of slot annotations for subcategorization frames, based on induction of hidden classes in the EM framework of statistical estimation. The models are empirically evaluated by a general decision test. Induction of slot labeling for subcategorization frames is accomplished by a further application of EM, and applied experimentally on frame observations derived from parsing large corpora. We outline an interpretation of the learned representations as theoretical-linguistic decompositional lexical entries.