Components for information extraction: ontology-based information extractors and generic platforms

Authors:
Daya C. Wimalasuriya;Dejing Dou
Affiliations:
University of Oregon, Eugene, OR, USA;University of Oregon, Eugene, OR, USA
Venue:
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Year:
2010

Citing 18
Cited 5

The Z notation: a reference manual

The Z notation: a reference manual
Knowledge engineering: principles and methods

Data & Knowledge Engineering - Special jubilee issue: DKE 25
Information extraction as a stepping stone toward story understanding

Understanding language understanding
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
Bootstrapping an ontology-based information extraction system

Intelligent exploration of the web
Towards the self-annotating web

Proceedings of the 13th international conference on World Wide Web
Toward semantic understanding: an approach based on information extraction ontologies

ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
KIM – a semantic platform for information extraction and retrieval

Natural Language Engineering
Reference reconciliation in complex information spaces

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Information extraction from Wikipedia: moving down the long tail

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Falcon-AO: A practical ontology matching system

Web Semantics: Science, Services and Agents on the World Wide Web
Natural Language Processing as a Foundation of the Semantic Web

Foundations and Trends in Web Science
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Using multiple ontologies in information extraction

Proceedings of the 18th ACM conference on Information and knowledge management
ontoX - a method for ontology-driven information extraction

ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III
Ontology-based information extraction: An introduction and a survey of current approaches

Journal of Information Science

Constructing efficient information extraction pipelines

Proceedings of the 20th ACM international conference on Information and knowledge management
Using information extractors with the neural electromagnetic ontologies

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems
Providing grades and feedback for student summaries by ontology-based information extraction

Proceedings of the 21st ACM international conference on Information and knowledge management
Two Rule-Based Natural Language Strategies for Requirements Discovery and Classification in Open Source Software Development Projects

Journal of Management Information Systems
Building optimal information systems automatically: configuration space exploration for biomedical information systems

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Information Extraction (IE) has existed as a field for several decades and has produced some impressive systems in the recent past. Despite its success, widespread usage and commercialization remain elusive goals for this field. We identify the lack of effective mechanisms for reuse as one major reason behind this situation. Here, we mean not only the reuse of the same IE technique in different situations but also the reuse of information related to the application of IE techniques (e.g., features used for classification). We have developed a comprehensive component-based approach for information extraction that promotes reuse to address this situation. We designed this approach starting from our previous work on the use of multiple ontologies in information extraction [24]. The key ideas of our approach are "information extractors," which are components of an IE system that make extractions with respect to particular components of an ontology and "platforms for IE," which are domain and corpus independent implementations of IE techniques. A case study has shown that this component-based approach can be successfully applied in practical situations.