The hidden web, XML and the Semantic Web: scientific data management perspectives

Authors:
Fabian M. Suchanek;Aparna S. Varde;Richi Nayak;Pierre Senellart
Affiliations:
INRIA Saclay, Paris, France;Montclair State University, Montclair, NJ;Queensland University of Technology, Brisbane, Australia;Institut Té/lé/com/ Té/lé/com, Paristech/ CNRS LTCI, Paris, France
Venue:
Proceedings of the 14th International Conference on Extending Database Technology
Year:
2011

Citing 8
Cited 1

The Latest MML (Medical Markup Language) Version 2.3—XML-Based Standard for Medical Data Exchange/Storage

Journal of Medical Systems
Online mining of frequent query trees over XML data streams

Proceedings of the 15th international conference on World Wide Web
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Fast and effective clustering of XML data using structural information

Knowledge and Information Systems
Automatic wrapper induction from hidden-web sources with domain knowledge

Proceedings of the 10th ACM workshop on Web information and data management
Knowledge Discovery over the Deep Web, Semantic Web and XML

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Handbook on Ontologies

Handbook on Ontologies
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference

OntoPart: at the cross-roads of ontology partitioning and scalable ontology alignment systems

International Journal of Metadata, Semantics and Ontologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.