An ontology-driven annotation of data tables

Authors:
Gaëlle Hignette;Patrice Buche;Juliette Dibie-Barthélemy;Ollivier Haemmerlé
Affiliations:
UMR, AgroParisTech, INRA, MIA, INRA Unité Mét@risk, Paris Cedex 5, France;UMR, AgroParisTech, INRA, MIA, INRA Unité Mét@risk, Paris Cedex 5, France;UMR, AgroParisTech, INRA, MIA, INRA Unité Mét@risk, Paris Cedex 5, France;IRIT, Université Toulouse le Mirail, Dpt. Mathématiques-Informatique, Toulouse Cedex
Venue:
WISE'07 Proceedings of the 2007 international conference on Web information systems engineering
Year:
2007

Citing 9
Cited 2

Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Visual Web Information Extraction with Lixto

Proceedings of the 27th International Conference on Very Large Data Bases
Boosted Wrapper Induction

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Automatically Extracting Ontologically Specified Data from HTML Tables of Unknown Structure

ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
A survey of table recognition: Models, observations, transformations, and inferences

International Journal on Document Analysis and Recognition
Unsupervised learning of generalized names

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Instantiation of Relations for Semantic Annotation

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Fuzzy querying of incomplete, imprecise, and heterogeneously structured data in the relational model using ontologies and rules

IEEE Transactions on Fuzzy Systems

SCOVO: Using Statistics on the Web of Data

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
An ontological and terminological resource for n-ary relation annotation in web data tables

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper deals with the integration of data extracted from the web into an existing data warehouse indexed by a domain ontology. We are specially interested in data tables extracted from scientific publications found on the web. We propose a way to annotate data tables from the web according to a given domain ontology. In this paper we present the different steps of our annotation process. The columns of a web data table are first segregated according to whether they represent numeric or symbolic data. Then, we annotate the numeric (resp.symbolic) columns with their corresponding numeric (resp. symbolic) type found in the ontology. Our approach combines different evidences from the column contents and from the column title to find the best corresponding type in the ontology. The relations represented by the web data table are recognized using both the table title and the types of the columns that were previously annotated. We give experimental results of our annotation process, our application domain being food microbiology.