Discovering relations between noun categories

Authors:
Thahir P. Mohamed;Estevam R. Hruschka, Jr.;Tom M. Mitchell
Affiliations:
University Of Pittsburgh;Federal University of Sao Carlos;Carnegie Mellon University
Venue:
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Year:
2011

Citing 9
Cited 9

Snowball: extracting relations from large plain-text collections

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Discovering relations among named entities from large corpora

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Coupling semi-supervised learning of categories and relations

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Coupled semi-supervised learning for information extraction

Proceedings of the third ACM international conference on Web search and data mining
Discovering relations between named entities from a large raw corpus using tree similarity-based clustering

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing

Collective intelligence as a source for machine learning self-supervision

Proceedings of the 4th International Workshop on Web Intelligence & Communities
Discovering and exploring relations on the web

Proceedings of the VLDB Endowment
Instance-driven attachment of semantic annotations over conceptual hierarchies

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
PATTY: a taxonomy of relational patterns with semantic types

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Knowledge harvesting in the big-data era

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Discovering semantic relations from the web and organizing them with PATTY

ACM SIGMOD Record
Discovering relations using matrix factorization methods

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Classifying entities into an incomplete ontology

Proceedings of the 2013 workshop on Automated knowledge base construction
Coupling as Strategy for Reducing Concept-Drift in Never-ending Learning Environments

Fundamenta Informaticae - Cognitive Informatics and Computational Intelligence: Theory and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional approaches to Relation Extraction from text require manually defining the relations to be extracted. We propose here an approach to automatically discovering relevant relations, given a large text corpus plus an initial ontology defining hundreds of noun categories (e.g., Athlete, Musician, Instrument). Our approach discovers frequently stated relations between pairs of these categories, using a two step process. For each pair of categories (e.g., Musician and Instrument) it first co-clusters the text contexts that connect known instances of the two categories, generating a candidate relation for each resulting cluster. It then applies a trained classifier to determine which of these candidate relations is semantically valid. Our experiments apply this to a text corpus containing approximately 200 million web pages and an ontology containing 122 categories from the NELL system [Carlson et al., 2010b], producing a set of 781 proposed candidate relations, approximately half of which are semantically valid. We conclude this is a useful approach to semi-automatic extension of the ontology for large-scale information extraction systems such as NELL.