Combining Data Integration and IE Techniques to Support Partially Structured Data

Authors:
Dean Williams;Alexandra Poulovassilis
Affiliations:
School of Computer Science and Information Systems, Birkbeck, University of London, London, UK WC1E 7HX;School of Computer Science and Information Systems, Birkbeck, University of London, London, UK WC1E 7HX
Venue:
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Year:
2008

Citing 6
Cited 0

Duplicate record elimination in large data files

ACM Transactions on Database Systems (TODS)
Weaving the Web; The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor (2 Cassettes)

Weaving the Web; The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor (2 Cassettes)
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem

Data Mining and Knowledge Discovery
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
KIM – a semantic platform for information extraction and retrieval

Natural Language Engineering
Coreference for NLP applications

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

A class of applications exists where the information to be stored is partially structured:that is, it consists partly of some structured data sources each conforming to a schema and partly of information left as free text. While investigating the requirements for querying partially structured data, we have encountered several limitations in the currently available approaches and we describe here three new techniques which combine aspects of Information Extraction with data integration in order to better exploit the data in these applications.