Inductive probabilistic taxonomy learning using singular value decomposition

Authors:
Francesca Fallucchi;Fabio massimo Zanzotto
Affiliations:
Department of computer science, systems and production, university of rome ???tor vergata???, italy emails: fallucchi@info.uniroma2.it, zanzotto@info.uniroma2.it;Department of computer science, systems and production, university of rome ???tor vergata???, italy emails: fallucchi@info.uniroma2.it, zanzotto@info.uniroma2.it
Venue:
Natural Language Engineering
Year:
2011

Citing 25
Cited 1

Selection and information: a class-based approach to lexical relationships

Selection and information: a class-based approach to lexical relationships
Support-Vector Networks

Machine Learning
WordNet: a lexical database for English

Communications of the ACM
DIRT @SBT@discovery of inference rules from text

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Ontology Learning for the Semantic Web

Ontology Learning for the Semantic Web
Measuring Similarity between Ontologies

EKAW '02 Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web
An introduction to variable and feature selection

The Journal of Machine Learning Research
A divisive information theoretic feature clustering algorithm for text classification

The Journal of Machine Learning Research
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Word sense disambiguation using Conceptual Density

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
More accurate tests for the statistical significance of result differences

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Taxonomy learning: factoring the structure of a taxonomy into a semantic classification decision

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Learning surface text patterns for a Question Answering system

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites

Computational Linguistics
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Finding predominant word senses in untagged text

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
The distributional inclusion hypotheses and lexical entailment

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Espresso: leveraging generic patterns for automatically harvesting semantic relations

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Semantic taxonomy induction from heterogenous evidence

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Discovering asymmetric entailment relations between verbs using selectional preferences

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Adapting WordNet to the Medical Domain using Lexicosyntactic Patterns in the Ohsumed Corpus.

AICCSA '06 Proceedings of the IEEE International Conference on Computer Systems and Applications
Learning concept hierarchies from text corpora using formal concept analysis

Journal of Artificial Intelligence Research
Ambiguous part-of-speech tagging for improving accuracy and domain portability of syntactic parsers

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A machine learning approach to textual entailment recognition

Natural Language Engineering
Measuring the semantic similarity of texts

EMSEE '05 Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment

Learning a taxonomy from a set of text documents

Applied Soft Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Capturing word meaning is one of the challenges of natural language processing (NLP). Formal models of meaning, such as networks of words or concepts, are knowledge repositories used in a variety of applications. To be effectively used, these networks have to be large or, at least, adapted to specific domains. Learning word meaning from texts is then an active area of research. Lexico-syntactic pattern methods are one of the possible solutions. Yet, these models do not use structural properties of target semantic relations, e.g. transitivity, during learning. In this paper, we propose a novel lexico-syntactic pattern probabilistic method for learning taxonomies that explicitly models transitivity and naturally exploits vector space model techniques for reducing space dimensions. We define two probabilistic models: the direct probabilistic model and the induced probabilistic model. The first is directly estimated on observations over text collections. The second uses transitivity on the direct probabilistic model to induce probabilities of derived events. Within our probabilistic model, we also propose a novel way of using singular value decomposition as unsupervised method for feature selection in estimating direct probabilities. We empirically show that the induced probabilistic taxonomy learning model outperforms state-of-the-art probabilistic models and our unsupervised feature selection method improves performance.