Semantic taxonomy induction from heterogenous evidence

Authors:
Rion Snow;Daniel Jurafsky;Andrew Y. Ng
Affiliations:
Stanford University, Stanford, CA;Stanford University, Stanford, CA;Stanford University, Stanford, CA
Venue:
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Year:
2006

Citing 12
Cited 118

CYC: a large-scale investment in knowledge infrastructure

Communications of the ACM
Automatic construction of a hypernym-labeled noun hierarchy from text

Automatic construction of a hypernym-labeled noun hierarchy from text
Noun-phrase co-occurrence statistics for semiautomatic semantic lexicon construction

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Noun classification from predicate-argument structures

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Fine grained classification of named entities

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Learning semantic constraints for the automatic discovery of part-whole relations

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Using LSA and noun coordination information to improve the precision and recall of automatic hyponymy extraction

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Supersense tagging of unknown nouns in WordNet

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Randomized algorithms and NLP: using locality sensitive hash function for high speed noun clustering

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Finding instance names and alternative glosses on the web: wordnet reloaded

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing

Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Googleology is Bad Science

Computational Linguistics
Automatically refining the wikipedia infobox ontology

Proceedings of the 17th international conference on World Wide Web
Information extraction from Wikipedia: moving down the long tail

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
YAGO: A Large Ontology from Wikipedia and WordNet

Web Semantics: Science, Services and Agents on the World Wide Web
Classification-Based Filtering of Semantic Relatedness in Hypernymy Extraction

GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Metric-based ontology learning

Proceedings of the 2nd international workshop on Ontologies and information systems for the semantic web
Learning the distance metric in a personal ontology

Proceedings of the 2nd international workshop on Ontologies and information systems for the semantic web
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Ontology Learning and Reasoning -- Dealing with Uncertainty and Inconsistency

Uncertainty Reasoning for the Semantic Web I
Exploiting Hyponymy in Extracting Relations and Enhancing Ontologies

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Combining image captions and visual analysis for image concept classification

Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Using Wikipedia to bootstrap open information extraction

ACM SIGMOD Record
Automatic Acquisition of Qualia Structure from Corpus Data

IEICE - Transactions on Information and Systems
Constructing folksonomies from user-specified relations on flickr

Proceedings of the 18th international conference on World wide web
A Term-Based Driven Clustering Approach for Name Disambiguation

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Extracting hypernym pairs from the web

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
The WordNet Weaver: Multi-criteria Voting for Semi-automatic Extension of a Wordnet

Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Acquisition of a New Type of Lexical-Semantic Relation from German Corpora

Proceedings of the 2008 conference on New Trends in Multimedia and Network Information Systems
Multilingual Evidence Improves Clustering-based Taxonomy Extraction

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Web-derived resources for web information retrieval: from conceptual hierarchies to attribute hierarchies

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Representing words as regions in vector space

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Lexical patterns or dependency patterns: which is better for hypernym extraction?

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Evaluating the inferential utility of lexical-semantic resources

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Using cycles and quasi-cycles to disambiguate dictionary glosses

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Online word games for semantic data collection

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Deriving a large scale taxonomy from Wikipedia

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Turning web text and search queries into factual knowledge: hierarchical class attribute extraction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Finding cars, goddesses and enzymes: parametrizable acquisition of labeled instances for open-domain information extraction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Intelligence in wikipedia

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Unsupervised methods for determining object and relation synonyms on the web

Journal of Artificial Intelligence Research
KnowNet: a proposal for building highly connected and dense knowledge bases from the web

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Experiments with an annotation scheme for a knowledge-rich noun phrase interpretation system

LAW '07 Proceedings of the Linguistic Annotation Workshop
Towards a universal wordnet by learning from combined evidence

Proceedings of the 18th ACM conference on Information and knowledge management
Context sensitive synonym discovery for web search queries

Proceedings of the 18th ACM conference on Information and knowledge management
Experiments on pattern-based relation learning

Proceedings of the 18th ACM conference on Information and knowledge management
Large-scale taxonomy mapping for restructuring and integrating wikipedia

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
A metric-based framework for automatic taxonomy induction

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Automatic set instance extraction using the web

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Extracting lexical reference rules from Wikipedia

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Latent variable models of concept-attribute attachment

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Supporting inferences in semantic space: representing words as regions

IWCS-8 '09 Proceedings of the Eighth International Conference on Computational Semantics
Using Topic Models to Interpret MEDLINE's Medical Subject Headings

AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
Probabilistic Ontology Learner in Semantic Turkey

AI*IA '09: Proceedings of the XIth International Conference of the Italian Association for Artificial Intelligence Reggio Emilia on Emergent Perspectives in Artificial Intelligence
Enhancement of lexical concepts using cross-lingual web mining

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Hypernym discovery based on distributional similarity and hierarchical structures

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Character-level analysis of semi-structured documents for set expansion

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
SVD feature selection for probabilistic taxonomy learning

GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
Random walks for text semantic similarity

TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Augmenting WordNet-based inference with argument mapping

TextInfer '09 Proceedings of the 2009 Workshop on Applied Textual Inference
Using hypernymy acquisition to tackle (part of) textual entailment

TextInfer '09 Proceedings of the 2009 Workshop on Applied Textual Inference
Automatic acquisition of attribute host by selectional constraint resolution

MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Comparison of generality based algorithm variants for automatic taxonomy generation

IIT'09 Proceedings of the 6th international conference on Innovations in information technology
Refining non-taxonomic relation labels with external structured data to support ontology learning

Data & Knowledge Engineering
Growing a tree in the forest: constructing folksonomies by integrating structured metadata

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Improved extraction assessment through better language models

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Extracting glosses to disambiguate word senses

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
BabelNet: building a very large multilingual semantic network

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Unsupervised ontology induction from text

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Global learning of focused entailment graphs

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Knowledge-rich Word Sense Disambiguation rivaling supervised systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
An active learning approach to finding related terms

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
From frequency to meaning: vector space models of semantics

Journal of Artificial Intelligence Research
Learning first-order Horn clauses from web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Constraints based taxonomic relation classification

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A semi-supervised method to learn and construct taxonomies using the web

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A mixture model with sharing for lexical semantics

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
MENTA: inducing multilingual taxonomies from wikipedia

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Automated translation of semantic relationships

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Co-STAR: a co-training style algorithm for hyponymy relation acquisition from structured and unstructured text

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Automatic discovery of word semantic relations using paraphrase alignment and distributional lexical semantics analysis

Natural Language Engineering
Bootstrapping location relations from text

Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
The role of queries in ranking labeled instances extracted from text

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Inductive probabilistic taxonomy learning using singular value decomposition

Natural Language Engineering
Taxonomy induction based on a collaboratively built knowledge repository

Artificial Intelligence
Heterogeneous knowledge sources in graph-based expansion of the polish wordnet

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Ontology population and enrichment: state of the art

Knowledge-driven multimedia information extraction and ontology evolution
Which noun phrases denote which concepts?

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Global learning of typed entailment rules

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Nonlinear evidence fusion and propagation for hyponymy relation mining

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Fine-grained class label markup of search queries

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Semi-supervised frame-semantic parsing for unknown predicates

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Ranking class labels using query sessions

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Recovering semantics of tables on the web

Proceedings of the VLDB Endowment
A supervised method of feature weighting for measuring semantic relatedness

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Semantic relations in bilingual lexicons

ACM Transactions on Speech and Language Processing (TSLP)
Towards strict sentence intersection: decoding and evaluation strategies

MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
Learning a taxonomy from a set of text documents

Applied Soft Computing
Learning entailment relations by global graph structure optimization

Computational Linguistics
Sequence clustering and labeling for unsupervised query intent discovery

Proceedings of the fifth ACM international conference on Web search and data mining
Class label enhancement via related instances

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Domain-assisted product aspect hierarchy generation: towards hierarchical organization of unstructured consumer reviews

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Random walk inference and learning in a large scale knowledge base

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Cross-cutting models of lexical semantics

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Automatically structuring domain knowledge from text: An overview of current research

Information Processing and Management: an International Journal
Evaluation method for automated wordnet expansion

SIIS'11 Proceedings of the 2011 international conference on Security and Intelligent Information Systems
Probase: a probabilistic taxonomy for text understanding

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Optimizing index for taxonomy keyword search

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Learning bilingual lexicons using the visual similarity of labeled web images

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
A graph-based algorithm for inducing lexical taxonomies from scratch

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Ontology learning from text: A look back and into the future

ACM Computing Surveys (CSUR)
Entailment above the word level in distributional semantics

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Taxonomy induction using hierarchical random graphs

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Efficient tree-based approximation for entailment graph learning

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Constructing task-specific taxonomies for document collection browsing

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Resolving task specification and path inconsistency in taxonomy construction

Proceedings of the 3rd Workshop on the People's Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP
BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network

Artificial Intelligence
Web 2.0, Language Resources and standards to automatically build a multilingual Named Entity Lexicon

Language Resources and Evaluation
Transforming Wikipedia into a large scale multilingual concept network

Artificial Intelligence
Lexical activation area attachment algorithm for wordnet expansion

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Extraction, evaluation and integration of lexical-semantic relations for the automated construction of a lexical ontology

AOW '07 Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85
Open domain knowledge extraction: inference on a web scale

Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Topic hierarchy construction for the organization of multi-source user generated contents

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Extracting meronyms for a biology knowledge base using distant supervision

Proceedings of the 2013 workshop on Automated knowledge base construction
Tailoring the automated construction of large-scale taxonomies using the web

Language Resources and Evaluation
Acquisition of open-domain classes via intersective semantics

Proceedings of the 23rd international conference on World wide web
A hierarchical Dirichlet model for taxonomy expansion for search engines

Proceedings of the 23rd international conference on World wide web
Image categorization using a semantic hierarchy model with sparse set of salient regions

Frontiers of Computer Science: Selected Publications from Chinese Universities

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a novel algorithm for inducing semantic taxonomies. Previous algorithms for taxonomy induction have typically focused on independent classifiers for discovering new single relationships based on hand-constructed or automatically discovered textual patterns. By contrast, our algorithm flexibly incorporates evidence from multiple classifiers over heterogenous relationships to optimize the entire structure of the taxonomy, using knowledge of a word's coordinate terms to help in determining its hypernyms, and vice versa. We apply our algorithm on the problem of sense-disambiguated noun hyponym acquisition, where we combine the predictions of hypernym and coordinate term classifiers with the knowledge in a preexisting semantic taxonomy (WordNet 2.1). We add 10,000 novel synsets to WordNet 2.1 at 84% precision, a relative error reduction of 70% over a non-joint algorithm using the same component classifiers. Finally, we show that a taxonomy built using our algorithm shows a 23% relative F-score improvement over WordNet 2.1 on an independent testset of hypernym pairs.