Information gathering in the World-Wide Web: the W3QL query language and the W3QS system

Authors:
David Konopnicki;Oded Shmueli
Affiliations:
Technion-Israel Institute of Technology;Technion-Israel Institute of Technology
Venue:
ACM Transactions on Database Systems (TODS)
Year:
1998

Citing 7
Cited 22

Reflections on NoteCards: seven issues for the next generation of hypermedia systems

Communications of the ACM
Expressing structural hypertext queries in graphlog

HYPERTEXT '89 Proceedings of the second annual ACM conference on Hypertext
A logical query language for hypertext systems

Hypertext: concepts, systems and applications
Queries on Structures in Hypertext

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Querying and Updating the File

VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
W3QS: A Query System for the World-Wide Web

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
A Declarative Language for Querying and Restructuring the Web

RIDE '96 Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems

Just-in-time databases and the World-Wide Web

Proceedings of the seventh international conference on Information and knowledge management
The Web as a graph

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Prototype for wrapping and visualizing geo-referenced data in a distributed environment using XML technology

Proceedings of the 8th ACM international symposium on Advances in geographic information systems
Objects objects everywhere

Crossroads
Distributed Hypertext Resource Discovery Through Examples

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
A Model for Querying Annotated Documents

ADBIS '99 Proceedings of the Third East European Conference on Advances in Databases and Information Systems
WWW Exploration Queries

NGIT '99 Proceedings of the 4th International Workshop on Next Generation Information Technologies and Systems
Answering Cooperative Recursive Queries in Web Federated Databases

NGITS '02 Proceedings of the 5th International Workshop on Next Generation Information Technologies and Systems
An Example-Based Environment for Wrapper Generation

ER '00 Proceedings of the Workshops on Conceptual Modeling Approaches for E-Business and The World Wide Web and Conceptual Modeling: Conceptual Modeling for E-Business and the Web
Imposing Disjunctive Constraints on Inter-document Structure

DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
On Formulation of Disjunctive Coupling Queries in WHOWEDA

DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
ObjectGlobe: Ubiquitous query processing on the Internet

The VLDB Journal — The International Journal on Very Large Data Bases
Constraint-driven join processing in a web warehouse

Data & Knowledge Engineering
Capturing User Access Patterns in the Web for Data Mining

ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Formulating disjunctive coupling queries in a web warehouse

Data & Knowledge Engineering
Using weakly structured documents at the user-interface level to fill in a classical database

Advanced topics in database research vol. 1
A uniform framework for integration of information from the web

Information Systems - Special issue on web data integration
Query Processing and Optimization on the Web

Distributed and Parallel Databases
DEQUE: querying the deep web

Data & Knowledge Engineering
HW-STALKER: a machine learning-based system for transforming QURE-Pagelets to XML

Data & Knowledge Engineering
The web as a graph: measurements, models, and methods

COCOON'99 Proceedings of the 5th annual international conference on Computing and combinatorics
Supporting application development with structured queries in the cloud

Proceedings of the 2013 International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The World Wide Web (WWW) is a fast growing global information resource. It contains an enormous amount of information and provides access to a variety of services. Since there is no central control and very few standards of information organization or service offering, searching for information and services is a widely recognized problem. To some degree this problem is solved by “search services,” also known as “indexers,” such as Lycos, AltaVista, Yahoo, and others. These sites employ search engines known as “robots” or “knowbots” that scan the network periodically and form text-based indices. These services are limited in certain important aspects. First, the structural information, namely, the organization of the document into parts pointing to each other, is usually lost. Second, one is limited by the kind of textual analysis provided by the “search service.” Third, search services are incapable of navigating “through” forms. Finally, one cannot prescribe a complex database-like search. We view the WWW as a huge database. We have designed a high-level SQL-like language called W3QL to support effective and flexible query processing, which addresses the structure and content of WWW nodes and their varied sorts of data. We have implemented a system called W3QS to execute W3QL queries. In W3QS, query results are declaratively specified and continuously maintained as views when desired. The current architecture of W3QS provides a server that enables users to pose queries as well as integrate their own data analysis tools. The system and its query language set a framework for the development of database-like tools over the WWW. A significant contribution of this article is in formalizing the WWW and query processing over it.