Snowball: extracting relations from large plain-text collections

Authors:
Eugene Agichtein;Luis Gravano
Affiliations:
Department of Computer Science, Columbia University, 12 14 Amsterdam Avenue, New York, NY;Department of Computer Science, Columbia University, 12 14 Amsterdam Avenue, New York, NY
Venue:
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Year:
2000

Citing 13
Cited 284

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Information retrieval: data structures and algorithms

Information retrieval: data structures and algorithms
Integration of heterogeneous databases without common domains using queries based on textual similarity

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning dictionaries for information extraction by multi-level bootstrapping

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Mining the Web for acronyms using the duality of patterns and relations

Proceedings of the 2nd international workshop on Web information and data management
Learning to construct knowledge bases from the World Wide Web

Artificial Intelligence - Special issue on Intelligent internet systems
Information Extraction: Techniques and Challenges

SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
Extracting Patterns and Relations from the World Wide Web

WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Mixed-initiative development of language processing systems

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Description of the UMass system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
Automatically generating extraction patterns from untagged text

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Snowball: a prototype system for extracting relations from large text collections

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
QProber: A system for automatic classification of hidden-Web databases

ACM Transactions on Information Systems (TOIS)
Automatic thesaurus generation for Chinese documents

Journal of the American Society for Information Science and Technology
Ontology extraction and conceptual modeling for web information

Information modeling for internet applications
Learning Rules for Conceptual Structure on the Web

Journal of Intelligent Information Systems
A portable method for acquiring information extraction patterns without annotated corpora

Natural Language Engineering
Unsupervised learning of soft patterns for generating definitions from online news

Proceedings of the 13th international conference on World Wide Web
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Ontology-Based Scalable and Portable Information Extraction System to Extract Biological Knowledge from Huge Collection of Biomedical Web Documents

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Acquisition of categorized named entities for web search

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Weakly-supervised relation classification for information extraction

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Querying web metadata: Native score management and text support in databases

ACM Transactions on Database Systems (TODS)
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Predicting accuracy of extracting information from unstructured text collections

Proceedings of the 14th ACM international conference on Information and knowledge management
Hot Item Mining and Summarization from Multiple Auction Web Sites

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Extracting pronunciation-translated names from Chinese texts using bootstrapping approach

SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
Building automatically a business registration ontology

dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
Adaptive information extraction

ACM Computing Surveys (CSUR)
To search or to crawl?: towards a query optimizer for text-centric tasks

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Combining linguistic and statistical analysis to extract relations from web documents

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering relations among named entities from large corpora

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Dependency tree kernels for relation extraction

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Classifying semantic relations in bioscience texts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Exploring various knowledge in relation extraction

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Multi-field information extraction and cross-document fusion

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Relation extraction using label propagation based semi-supervised learning

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Expressing implicit semantic relations without supervision

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Names and similarities on the web: fact extraction in the fast lane

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A bootstrapping approach to unsupervised detection of cue phrase variants

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Multi-way relation classification: application to protein-protein interactions

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Integrating probabilistic extraction models and data mining to discover relations and patterns in text

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Preemptive information extraction using unrestricted relation discovery

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
How can information extraction ease formalizing treatment processes in clinical practice guidelines?

Artificial Intelligence in Medicine
An exploration of the principles underlying redundancy-based factoid question answering

ACM Transactions on Information Systems (TOIS)
Soft pattern matching models for definitional question answering

ACM Transactions on Information Systems (TOIS)
Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia

Data & Knowledge Engineering
Towards domain-independent information extraction from web tables

Proceedings of the 16th international conference on World Wide Web
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
A rote extractor with edit distance-based generalisation and multi-corpora precision calculation

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Unsupervised relation disambiguation using spectral clustering

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
URES: an unsupervised web relation extraction system

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
On-demand information extraction

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
A redundancy-based method for the extraction of relation instances from the Web

International Journal of Human-Computer Studies
Towards a query optimizer for text-centric tasks

ACM Transactions on Database Systems (TODS)
Enabling more sophisticated gene expression analysis for understanding diseases and optimizing treatments

ACM SIGKDD Explorations Newsletter - Special issue on data mining for health informatics
The role of documents vs. queries in extracting class attributes from text

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A relational approach to incrementally extracting and querying structure in unstructured data

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Mining relational data from text: From strictly supervised to weakly supervised learning

Information Systems
Discovering semantic biomedical relations utilizing the Web

ACM Transactions on Knowledge Discovery from Data (TKDD)
Classification-aware hidden-web text database selection

ACM Transactions on Information Systems (TOIS)
Collective knowledge systems: Where the Social Web meets the Semantic Web

Web Semantics: Science, Services and Agents on the World Wide Web
A stopping criterion for active learning

Computer Speech and Language
Flint: Google-basing the Web

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Mining and analysing scale-free protein protein interaction network

International Journal of Bioinformatics Research and Applications
Pattern-based automatic taxonomy learning from the Web

AI Communications
Improving the performance of question answering with semantically equivalent answer patterns

Data & Knowledge Engineering
Relation discovery from web data for competency management

Web Intelligence and Agent Systems
Information extraction from Wikipedia: moving down the long tail

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Open information extraction from the web

Communications of the ACM - Surviving the data deluge
YAGO: A Large Ontology from Wikipedia and WordNet

Web Semantics: Science, Services and Agents on the World Wide Web
Ontology-driven, unsupervised instance population

Web Semantics: Science, Services and Agents on the World Wide Web
Text Retrieval Oriented Auto-construction of Conceptual Relationship

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Self-supervised relation extraction from the Web

Knowledge and Information Systems
Using structured text for large-scale attribute extraction

Proceedings of the 17th ACM conference on Information and knowledge management
Supporting the automatic construction of entity aware search engines

Proceedings of the 10th ACM workshop on Web information and data management
Towards a System for Ontology-Based Information Extraction from PDF Documents

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Information Extraction

Foundations and Trends in Databases
Semantic relation extraction from socially-generated tags: a methodology for metadata generation

DCMI '08 Proceedings of the 2008 International Conference on Dublin Core and Metadata Applications
A quality-aware optimizer for information extraction

ACM Transactions on Database Systems (TODS)
Building query optimizers for information extraction: the SQoUT project

ACM SIGMOD Record
Automatic Extraction of the Fine Category of Person Named Entities from Text Corpora

IEICE - Transactions on Information and Systems
StatSnowball: a statistical approach to extracting entity relationships

Proceedings of the 18th international conference on World wide web
SOFIE: a self-organizing framework for information extraction

Proceedings of the 18th international conference on World wide web
Low-Cost Supervision for Multiple-Source Attribute Extraction

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
An Integrated Approach for Concept Learning and Relation Extraction

ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Building a Graph of Names and Contextual Patterns for Named Entity Classification

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Label propagation via bootstrapped support vectors for semantic relation extraction between named entities

Computer Speech and Language
Improving Relation Extraction by Exploiting Properties of the Target Relation

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Information Extraction and Semantic Annotation of Wikipedia

Proceedings of the 2008 conference on Ontology Learning and Population: Bridging the Gap between Text and Knowledge
Task Driven Coreference Resolution for Relation Extraction

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Assessing the correlation between contextual patterns and biological entity tagging

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
The semantics of a definiendum constrains both the lexical semantics and the lexicosyntactic patterns in the definiens

BioNLP '06 Proceedings of the Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis
Learning to rank for quantity consensus queries

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Investigation of unsupervised pattern learning techniques for bootstrap construction of a medical treatment lexicon

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Optimization of Feature-Opinion Pairs in Chinese Customer Reviews

IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
A context pattern induction method for named entity extraction

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Methods for domain-independent information extraction from the web: an experimental comparison

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Organizing and searching the world wide web of facts - step one: the one-million fact extraction challenge

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Machine reading

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Instance-based ontology population exploiting named-entity substitution

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Structural, transitive and latent models for biographic fact extraction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Boosting unsupervised relation extraction by using NER

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Entity annotation based on inverse index operations

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Unsupervised information extraction approach using graph mutual reinforcement

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Unsupervised relation disambiguation with order identification capabilities

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning field compatibilities to extract database records from unstructured text

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Semi-supervised relation extraction with label propagation

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Subtree mining for relation extraction from Wikipedia

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Harvesting relations from the web: quantifiying the impact of filtering functions

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Relation extraction from wikipedia using subtree mining

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
May all your wishes come true: a study of wishes and how to recognize them

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Coupling semi-supervised learning of categories and relations

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Surrogate learning: from feature independence to semi-supervised classification

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Relation detection between named entities: report of a shared task

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
What you seek is what you get: extraction of class attributes from query logs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Feature generation and representations for protein-protein interaction classification

Journal of Biomedical Informatics
Recognizing entailment in intelligent tutoring systems*

Natural Language Engineering
Discriminatively Modeling Commonality of Term Types for Extracting Relation from Small Corpora

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
A probabilistic model of redundancy in information extraction

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence
Query by analogical example: relational search using web search engine indices

Proceedings of the 18th ACM conference on Information and knowledge management
Semi-supervised learning of semantic classes for query understanding: from the web and for the web

Proceedings of the 18th ACM conference on Information and knowledge management
Identifying comparable entities on the web

Proceedings of the 18th ACM conference on Information and knowledge management
Extracting position relations from the web

Proceedings of the eleventh international workshop on Web information and data management
The semantics of a definiendum constrains both the lexical semantics and the lexicosyntactic patterns in the definiens

LNLBioNLP '06 Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
Distant supervision for relation extraction without labeled data

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Conceptual Indexing of Text Using Ontologies and Lexical Resources

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Extracting Enterprise Vocabularies Using Linked Open Data

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Generalized expectation criteria for bootstrapping extractors using record-text alignment

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Geo-mining: discovery of road and transport networks using directional patterns

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Convolution kernels on constituent, dependency and sequential structures for relation extraction

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Semi-supervised learning for semantic relation classification using stratified sampling strategy

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Character-level analysis of semi-structured documents for set expansion

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Coupled semi-supervised learning for information extraction

Proceedings of the third ACM international conference on Web search and data mining
Answer formulation for question-answering

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Towards rich query interpretation: walking back and forth for mining query templates

Proceedings of the 19th international conference on World wide web
Relational duality: unsupervised extraction of semantic relations between entities on the web

Proceedings of the 19th international conference on World wide web
A scalable machine-learning approach for semi-structured named entity recognition

Proceedings of the 19th international conference on World wide web
Combining relations for information extraction from free text

ACM Transactions on Information Systems (TOIS)
PORE: positive-only relation extraction from wikipedia text

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Graph mutual reinforcement based bootstrapping

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
An alignment-based approach to semi-supervised relation extraction including multiple arguments

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Multi-class named entity recognition via bootstrapping with dependency tree-based patterns

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Pattern-based semantic tagging for ontology population

SOCASE'08 Proceedings of the 2008 AAMAS international conference on Service-oriented computing: agents, semantics, and engineering
I4E: interactive investigation of iterative information extraction

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Analysis of a probabilistic model of redundancy in unsupervised information extraction

Artificial Intelligence
Automatic construction of a large-scale situation ontology by mining how-to instructions from the web

Web Semantics: Science, Services and Agents on the World Wide Web
BioSnowball: automated population of Wikis

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Open information extraction using Wikipedia

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Find your advisor: robust knowledge gathering from the web

Procceedings of the 13th International Workshop on the Web and Databases
Popularity-guided top-k extraction of entity attributes

Procceedings of the 13th International Workshop on the Web and Databases
Large scale relation detection

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Semantic role labeling for open information extraction

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Empirical studies in learning to read

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Clustering-based stratified seed sampling for semi-supervised relation classification

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Multi-modal multi-correlation person-centric news retrieval

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Entity-relationship queries over wikipedia

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
FactRank: random walks on a web of facts

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Self-supervised mining of human activity from CGM

PKAW'10 Proceedings of the 11th international conference on Knowledge management and acquisition for smart systems and services
A two-view cotraining rule induction system for information extraction

ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
Using text to build semantic networks for pharmacogenomics

Journal of Biomedical Informatics
A framework for corroborating answers from multiple web sources

Information Systems
Human activity mining using conditional radom fields and self-supervised learning

ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part I
Exploiting content redundancy for web information extraction

Proceedings of the VLDB Endowment
Automatic rule refinement for information extraction

Proceedings of the VLDB Endowment
Dynamic relationship and event discovery

Proceedings of the fourth ACM international conference on Web search and data mining
Joint training for open-domain extraction on the web: exploiting overlap when supervision is limited

Proceedings of the fourth ACM international conference on Web search and data mining
Scalable knowledge harvesting with high precision and high recall

Proceedings of the fourth ACM international conference on Web search and data mining
Materializing multi-relational databases from the web using taxonomic queries

Proceedings of the fourth ACM international conference on Web search and data mining
Searching patterns for relation extraction over the web: rediscovering the pattern-relation duality

Proceedings of the fourth ACM international conference on Web search and data mining
Recognizing relation expression between named entities based on inherent and context-dependent features of relational words

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Semi-supervised semantic pattern discovery with guidance from unsupervised pattern clusters

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Learning web query patterns for imitating Wikipedia articles

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Boosting relation extraction with limited closed-world knowledge

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Information extraction from Wikipedia using pattern learning

Acta Cybernetica
Capturing users' buying activity at Akihabara electric town from twitter

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part II
SEISA: set expansion by iterative similarity aggregation

Proceedings of the 20th international conference on World wide web
FACTO: a fact lookup engine based on web tables

Proceedings of the 20th international conference on World wide web
Learning relation extraction grammars with minimal human intervention: strategy, results, insights and plans

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Using graph based method to improve bootstrapping relation extraction

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
A hybrid approach for the extraction of semantic relations from MEDLINE abstracts

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Self-adjusting bootstrapping

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Extracting XML data from the web

Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
A spoken question answering system based on conditional knowledge

ICCOMP'10 Proceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume I
Taxonomy induction based on a collaboratively built knowledge repository

Artificial Intelligence
Entity set expansion in opinion documents

Proceedings of the 22nd ACM conference on Hypertext and hypermedia
An analysis of open information extraction based on semantic role labeling

Proceedings of the sixth international conference on Knowledge capture
Ontology population and enrichment: state of the art

Knowledge-driven multimedia information extraction and ontology evolution
Event discovery in social media feeds

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
In-domain relation discovery with meta-constraints via posterior regularization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Insights from network structure for text mining

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Can document selection help semi-supervised learning?: a case study on event extraction

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
End-to-end relation extraction using distant supervision from external semantic repositories

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Coreference for learning to extract relations: yes, Virginia, coreference matters

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Detecting emotions in social affective situations using the emotinet knowledge base

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Unsupervised relation extraction using dependency trees for automatic generation of multiple-choice questions

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
EmotiNet: a knowledge base for emotion detection in text built on the appraisal theories

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Extracting conceptual feature structures from text

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Introduction to linked data and its lifecycle on the web

RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
Acquiring knowledge about human goals from Search Query Logs

Information Processing and Management: an International Journal
SCMS: semantifying content management systems

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part II
Max margin learning on domain-independent web information extraction

Proceedings of the 20th ACM international conference on Information and knowledge management
Filtering and clustering relations for unsupervised information extraction in open domain

Proceedings of the 20th ACM international conference on Information and knowledge management
Facilitating pattern discovery for relation extraction with semantic-signature-based clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
Building a generic debugger for information extraction pipelines

Proceedings of the 20th ACM international conference on Information and knowledge management
OpinioNetIt: understanding the opinions-people network for politically controversial topics

Proceedings of the 20th ACM international conference on Information and knowledge management
Self-supervised relation extraction from the web

ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Automatic acquisition of semantic-based question reformulations for question answering

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Harmony and dissonance: organizing the people's voices on political controversies

Proceedings of the fifth ACM international conference on Web search and data mining
Self-supervised capturing of users' activities from weblogs

International Journal of Intelligent Information and Database Systems
Ontology-driven information extraction with ontosyphon

ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Extracting relations in social networks from the web using similarity between collective contexts

ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Extracting information from short messages

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Datasets for generic relation extraction*

Natural Language Engineering
A generative model for unsupervised discovery of relations and argument classes from clinical texts

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Random walk inference and learning in a large scale knowledge base

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Relation acquisition using word classes and partial patterns

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Extreme extraction: machine reading in a week

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discovering relations between noun categories

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discovering relations between named entities from a large raw corpus using tree similarity-based clustering

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Automatic relation extraction with model order selection and discriminative label identification

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Mining inter-entity semantic relations using improved transductive learning

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Event-Driven document selection for terrorism information extraction

ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Using semantic constraints to improve question answering

NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems
Discovering a term taxonomy from term similarities using principal component analysis

EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Analysis and improvement of minimally supervised machine learning for relation extraction

NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Multi-view bootstrapping for relation extraction by exploring web features and linguistic features

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Leveraging different meronym discovery methods for bridging resolution in french

DAARC'11 Proceedings of the 8th international conference on Anaphora Processing and Applications
The HiLeX system for semantic information extraction

Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Organizational search in email systems

Proceedings of the 50th Annual Southeast Regional Conference
REV: extracting entity relations from world wide web

Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Learning causality for news events prediction

Proceedings of the 21st international conference on World Wide Web
Minimally supervised domain-adaptive parse reranking for relation extraction

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
A conversation with Professor Bo Zhang

ACM SIGKDD Explorations Newsletter
Clustering techniques for open relation extraction

PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
A domain-independent approach to finding related entities

Information Processing and Management: an International Journal
Entity-Relationship Queries over Wikipedia

ACM Transactions on Intelligent Systems and Technology (TIST)
Extracting information networks from the blogosphere

ACM Transactions on the Web (TWEB)
Using information extraction to generate trigger questions for academic writing support

ITS'12 Proceedings of the 11th international conference on Intelligent Tutoring Systems
Dependency trigram model for social relation extraction from news articles

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Detecting implicit expressions of emotion in text: A comparative analysis

Decision Support Systems
A semi-supervised approach to extracting multiword entity names from user reviews

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
User-driven relational models for entity-relation search and extraction

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
RevMiner: an extractive interface for navigating reviews on a smartphone

Proceedings of the 25th annual ACM symposium on User interface software and technology
Bootstrapping events and relations from text

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Measuring the use of factual information in test-taker essays

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Bootstrapping via graph propagation

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A graph-based cross-lingual projection approach for weakly supervised relation extraction

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Open language learning for information extraction

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Reading the web with learned syntactic-semantic inference rules

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
A new minimally-supervised framework for domain word sense disambiguation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Finding small molecule and protein pairs in scientific literature using a bootstrapping method

BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Automatic evaluation of relation extraction systems on large-scale

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Identifying untyped relation mentions in a corpus given an ontology

TextGraphs-7 '12 Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing
The bootstrapping based recognition of conceptual relationship for text retrieval

NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Enhancing relation extraction by eliciting selectional constraint features from wikipedia

NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
A co-training based method for chinese patent semantic annotation

Proceedings of the 21st ACM international conference on Information and knowledge management
Web 2.0, Language Resources and standards to automatically build a multilingual Named Entity Lexicon

Language Resources and Evaluation
Social relation extraction from texts using a support-vector-machine-based dependency trigram kernel

Information Processing and Management: an International Journal
Extraction of semantic relation based on feature vector from wikipedia

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Coupled bayesian sets algorithm for semi-supervised learning and information extraction

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Tuple refinement method based on relationship keyword extension

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Ontology-Based information and event extraction for business intelligence

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
A relation extraction method of Chinese named entities based on location and semantic features

Applied Intelligence
Extending enterprise service design knowledge using clustering

ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
Large-Scale learning of relation-extraction rules with distant supervision from the web

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
DeFacto - deep fact validation

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
An evidence-based verification approach to extract entities and relations for knowledge base population

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
WebPut: efficient web-based data imputation

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Learning to predict from textual data

Journal of Artificial Intelligence Research
Extraction, evaluation and integration of lexical-semantic relations for the automated construction of a lexical ontology

AOW '07 Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85
Minimally-supervised extraction of domain-specific part-whole relations using Wikipedia as knowledge-base

Data & Knowledge Engineering
Travel with Words: An Innovative Vision on Travelling

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Knowledge harvesting in the big-data era

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Provenance-based dictionary refinement in information extraction

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Discovering unexpected information on the basis of popularity/unpopularity analysis of coordinate objects and their relationships

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Autonomously reviewing and validating the knowledge base of a never-ending learning system

Proceedings of the 22nd international conference on World Wide Web companion
SEED: a framework for extracting social events from press news

Proceedings of the 22nd international conference on World Wide Web companion
Assessing sparse information extraction using semantic contexts

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Learning open-domain comparable entity graphs from user search queries

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A semi-supervised approach to extract pharmacogenomics-specific drug-gene pairs from biomedical literature for personalized medicine

Journal of Biomedical Informatics
Extracting meronyms for a biology knowledge base using distant supervision

Proceedings of the 2013 workshop on Automated knowledge base construction
Aggregated search: A new information retrieval paradigm

ACM Computing Surveys (CSUR)
Introduction to linked data and its lifecycle on the web

RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction

ACM Transactions on Asian Language Information Processing (TALIP)
Extraction and integration of partially overlapping web sources

Proceedings of the VLDB Endowment
Editorial: Detecting implicit expressions of affect in text using EmotiNet and its extensions

Data & Knowledge Engineering
Editorial: Minimally-supervised learning of domain-specific causal relations using an open-domain corpus as knowledge base

Data & Knowledge Engineering
Semi-automatic construction of domain ontology for agent reasoning

Personal and Ubiquitous Computing
A structural approach to extracting Chinese position relations from web pages

Journal of Web Engineering

Quantified Score

Hi-index	0.02

Visualization

Abstract

Text documents often contain valuable structured data that is hidden Yin regular English sentences. This data is best exploited infavailable as arelational table that we could use for answering precise queries or running data mining tasks.We explore a technique for extracting such tables from document collections that requires only a handful of training examples from users. These examples are used to generate extraction patterns, that in turn result in new tuples being extracted from the document collection.We build on this idea and present our Snowball system. Snowball introduces novel strategies for generating patterns and extracting tuples from plain-text documents.At each iteration of the extraction process, Snowball evaluates the quality of these patterns and tuples without human intervention,and keeps only the most reliable ones for the next iteration. In this paper we also develop a scalable evaluation methodology and metrics for our task, and present a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.