Discovering relations among named entities from large corpora

Authors:
Takaaki Hasegawa;Satoshi Sekine;Ralph Grishman
Affiliations:
Nippon Telegraph and Telephone Corporation, Yokosuka, Kanagawa, Japan;New York University, New York, NY;New York University, New York, NY
Venue:
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Year:
2004

Citing 5
Cited 84

Snowball: extracting relations from large plain-text collections

DL '00 Proceedings of the fifth ACM conference on Digital libraries
DIRT @SBT@discovery of inference rules from text

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting Patterns and Relations from the World Wide Web

WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Learning surface text patterns for a Question Answering system

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Kernel methods for relation extraction

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

Mining knowledge from text using information extraction

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Mining community structure of named entities from free text

Proceedings of the 14th ACM international conference on Information and knowledge management
Relation extraction using label propagation based semi-supervised learning

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Names and similarities on the web: fact extraction in the fast lane

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Preemptive information extraction using unrestricted relation discovery

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Unsupervised relation disambiguation using spectral clustering

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
On-demand information extraction

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
HAL-based cascaded model for variable-length semantic pattern induction from psychiatry web resources

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Clustering for unsupervised relation identification

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Representing a web page as sets of named entities of multiple types: a model and some preliminary applications

Proceedings of the 17th international conference on World Wide Web
Relation discovery from web data for competency management

Web Intelligence and Agent Systems
Extracting Semantic Networks from Text Via Relational Clustering

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Self-supervised relation extraction from the Web

Knowledge and Information Systems
A method for extracting knowledge from medical texts including numerical representation

International Journal of Computer Applications in Technology
A Feature-Based Approach for Relation Extraction from Thai News Documents

PAISI '09 Proceedings of the Pacific Asia Workshop on Intelligence and Security Informatics
Label propagation via bootstrapped support vectors for semantic relation extraction between named entities

Computer Speech and Language
Improving Relation Extraction by Exploiting Properties of the Target Relation

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Mining and re-ranking for answering biographical queries on the web

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Organizing and searching the world wide web of facts - step one: the one-million fact extraction challenge

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Topic identification for fine-grained opinion analysis

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A linguistic knowledge discovery tool: very large ngram database search with arbitrary wildcards

COLING '08 22nd International Conference on on Computational Linguistics: Demonstration Papers
Outclassing Wikipedia in open-domain information extraction: weakly-supervised acquisition of attributes over conceptual hierarchies

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Boosting unsupervised relation extraction by using NER

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Unsupervised information extraction approach using graph mutual reinforcement

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Unsupervised relation disambiguation with order identification capabilities

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Seeded discovery of base relations in large corpora

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Semi-supervised relation extraction with label propagation

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Turning web text and search queries into factual knowledge: hierarchical class attribute extraction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Unsupervised methods for determining object and relation synonyms on the web

Journal of Artificial Intelligence Research
Extracting keyphrases to represent relations in social networks from web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
What you seek is what you get: extraction of class attributes from query logs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Discriminatively Modeling Commonality of Term Types for Extracting Relation from Small Corpora

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Cross-lingual predicate cluster acquisition to improve bilingual event extraction by inductive learning

UMSLLS '09 Proceedings of the Workshop on Unsupervised and Minimally Supervised Learning of Lexical Semantics
Comparison of similarity models for the relation discovery task

LD '06 Proceedings of the Workshop on Linguistic Distances
Unsupervised relation extraction by mining Wikipedia texts using information from the web

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Convolution kernels on constituent, dependency and sequential structures for relation extraction

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Semi-supervised learning for semantic relation classification using stratified sampling strategy

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Cost-effective web search in bootstrapping for named entity recognition

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Acquisition of instance attributes via labeled and related instances

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Unsupervised techniques for discovering ontology elements from Wikipedia article links

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
A hybrid approach to unsupervised relation discovery based on linguistic analysis and semantic typing

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Clustering-based stratified seed sampling for semi-supervised relation classification

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Ranking related entities: components and analyses

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Incorporating terminology evolution for query translation in text retrieval with association rules

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Bootstrapping location relations from text

Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
Embellishing text search queries to protect user privacy

Proceedings of the VLDB Endowment
Recognizing relation expression between named entities based on inherent and context-dependent features of relational words

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Semi-supervised semantic pattern discovery with guidance from unsupervised pattern clusters

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
SITAC: discovering semantically identical temporally altering concepts in text archives

Proceedings of the 14th International Conference on Extending Database Technology
REACTOR: a framework for semantic relation extraction and tagging over enterprise data

Proceedings of the 20th international conference companion on World wide web
Event discovery in social media feeds

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
In-domain relation discovery with meta-constraints via posterior regularization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
End-to-end relation extraction using distant supervision from external semantic repositories

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Developing Position Structure-Based Framework for Chinese Entity Relation Extraction

ACM Transactions on Asian Language Information Processing (TALIP)
Probabilistic matrix factorization leveraging contexts for unsupervised relation extraction

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Unsupervised relation extraction using dependency trees for automatic generation of multiple-choice questions

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
A combination of topic models with max-margin learning for relation detection

TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Filtering and clustering relations for unsupervised information extraction in open domain

Proceedings of the 20th ACM international conference on Information and knowledge management
Facilitating pattern discovery for relation extraction with semantic-signature-based clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
Self-supervised relation extraction from the web

ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Discovering overlapping communities of named entities

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Learning non-taxonomical semantic relations from domain texts

Journal of Intelligent Information Systems
Datasets for generic relation extraction*

Natural Language Engineering
A generative model for unsupervised discovery of relations and argument classes from clinical texts

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discovering relations between noun categories

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Structured relation discovery using generative models

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discovering relations between named entities from a large raw corpus using tree similarity-based clustering

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Automatic relation extraction with model order selection and discriminative label identification

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
REPENTINO – a wide-scope gazetteer for entity recognition in portuguese

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Association link network: an incremental web resources link model for learning resources management

ICWL'11 Proceedings of the 10th international conference on Advances in Web-Based Learning
LINDEN: linking named entities with knowledge base via semantic knowledge

Proceedings of the 21st international conference on World Wide Web
Clustering techniques for open relation extraction

PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
A domain-independent approach to finding related entities

Information Processing and Management: an International Journal
Editorial: Occupation inference through detection and classification of biographical activities

Data & Knowledge Engineering
Extracting information networks from the blogosphere

ACM Transactions on the Web (TWEB)
A weighting scheme for open information extraction

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Unsupervised relation discovery with sense disambiguation

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Named entity disambiguation in streaming data

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Ensemble semantics for large-scale unsupervised relation extraction

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Automatic evaluation of relation extraction systems on large-scale

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Structural linguistics and unsupervised information extraction

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
MOTIF-RE: motif-based hypernym/hyponym relation extraction from wikipedia links

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Discovering relations using matrix factorization methods

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discovering the significant relations embedded in documents would be very useful not only for information retrieval but also for question answering and summarization. Prior methods for relation discovery, however, needed large annotated corpora which cost a great deal of time and effort. We propose an unsupervised method for relation discovery from large corpora. The key idea is clustering pairs of named entities according to the similarity of context words intervening between the named entities. Our experiments using one year of newspapers reveals not only that the relations among named entities could be detected with high recall and precision, but also that appropriate labels could be automatically provided for the relations.