Automatic acquisition of domain knowledge for Information Extraction

Authors:
Roman Yangarber;Ralph Grishman;Pasi Tapanainen;Silja Huttunen
Affiliations:
New York University;New York University;Conexor oy, Helsinki, Finland;University of Helsinki, Finland
Venue:
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Year:
2000

Citing 7
Cited 50

Learning dictionaries for information extraction by multi-level bootstrapping

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Unsupervised discovery of scenario-level patterns for Information Extraction

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A non-projective dependency parser

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
University of Massachusetts: MUC-4 test results and analysis

MUC4 '92 Proceedings of the 4th conference on Message understanding
Description of the UMass system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
The NYU system for MUC-6 or where's the syntax?

MUC6 '95 Proceedings of the 6th conference on Message understanding
Automatically generating extraction patterns from untagged text

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Induction of semantic classes from natural language text

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Information extraction for enhanced access to disease outbreak reports

Journal of Biomedical Informatics - Special issue: Sublanguage
Bootstrapping an ontology-based information extraction system

Intelligent exploration of the web
A portable method for acquiring information extraction patterns without annotated corpora

Natural Language Engineering
Unsupervised learning of soft patterns for generating definitions from online news

Proceedings of the 13th international conference on World Wide Web
LearningPinocchio: adaptive information extraction for real world applications

Natural Language Engineering
Inducing information extraction systems for new languages via cross-language projection

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Unsupervised learning of generalized names

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Using predicate-argument structures for information extraction

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Counter-training in discovery of semantic patterns

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Using HLT for acquiring, retrieving and publishing knowledge in AKT: position paper

HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Learning extraction patterns for subjective expressions

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
A survey for multi-document summarization

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Adaptive information extraction

ACM Computing Surveys (CSUR)
Combining linguistic and statistical analysis to extract relations from web documents

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions

Journal of the American Society for Information Science and Technology
Hierarchical rule generalisation for speaker identification in fiction books

SAICSIT '06 Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
Experiments with interactive question-answering

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A semantic approach to IE pattern induction

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Incremental topic representations

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
How can information extraction ease formalizing treatment processes in clinical practice guidelines?

Artificial Intelligence in Medicine
Integrating pattern-based and distributional similarity methods for lexical entailment acquisition

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Real-time event extraction for infectious disease outbreaks

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Semantic information extraction from Tamil documents

International Journal of Metadata, Semantics and Ontologies
A high accuracy method for semi-supervised information extraction

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Exploiting subjectivity classification to improve information extraction

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
An analysis of bootstrapping for the recognition of temporal expressions

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Comparing information extraction pattern models

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Improving semi-supervised acquisition of relation extraction patterns

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Learning domain-specific information extraction patterns from the Web

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Adaptive information extraction from text by rule induction and generalisation

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
A unified model of phrasal and sentential evidence for information extraction

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
An alignment-based approach to semi-supervised relation extraction including multiple arguments

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Generating image descriptions using dependency relational patterns

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Filtered ranking for bootstrapping in event extraction

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Semi-supervised semantic pattern discovery with guidance from unsupervised pattern clusters

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Self-adjusting bootstrapping

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Automatic rule learning exploiting morphological features for named entity recognition in Turkish

Journal of Information Science
Event discovery in social media feeds

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
In-domain relation discovery with meta-constraints via posterior regularization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Template-based information extraction without the templates

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Peeling back the layers: detecting event role fillers in secondary contexts

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Can document selection help semi-supervised learning?: a case study on event extraction

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
A framework for the automatic extraction of rules from online text

RuleML'2011 Proceedings of the 5th international conference on Rule-based reasoning, programming, and applications
Information extraction and ontology model for a 'call for paper' manager

Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Mining the semantics of text via counter-training

EPIA'05 Proceedings of the 12th Portuguese conference on Progress in Artificial Intelligence
Ontology learning from text: A look back and into the future

ACM Computing Surveys (CSUR)
Bootstrapped training of event extraction classifiers

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Event linking: grounding event reference in a news archive

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Large-Scale learning of relation-extraction rules with distant supervision from the web

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

In developing an Information Extraction (IE) system for a new class of events or relations, one of the major tasks is identifying the many ways in which these events or relations may be expressed in text. This has generally involved the manual analysis and, in some cases, the annotation of large quantities of text involving these events. This paper presents an alternative approach, based on an automatic discovery procedure, EXDISCO, which identifies a set of relevant documents and a set of event patterns from un-annotaled text, starting from a small set of "seed patterns." We evaluate EXDISCO by comparing the performance of discovered patterns against that of manually constructed systems on actual extraction tasks.