Automatic Information Discovery from the "Invisible Web"

Authors:
Affiliations:
Venue:
ITCC '02 Proceedings of the International Conference on Information Technology: Coding and Computing
Year:
2002

Citing 0
Cited 14

Query-related data extraction of hidden web documents

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A two-phase sampling technique for information extraction from hidden web databases

Proceedings of the 6th annual ACM international workshop on Web information and data management
SmartCrawl: a new strategy for the exploration of the hidden web

Proceedings of the 6th annual ACM international workshop on Web information and data management
Discover the semantic topology in high-dimensional data

Expert Systems with Applications: An International Journal
Sampling, information extraction and summarisation of hidden web databases

Data & Knowledge Engineering - Special issue: WIDM 2004
Retrieval for decision support resources by structured models

Decision Support Systems
CCReSD: concept-based categorisation of Hidden Web databases

International Journal of High Performance Computing and Networking
Automatic Hidden Web Database Classification

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Facilitating discovery on the private web using dataset digests

Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Shopping search engine technology based on services

ACS'08 Proceedings of the 8th conference on Applied computer scince
A simplicial complex, a hypergraph, structure in the latent semantic space of document clustering

International Journal of Approximate Reasoning
Facilitating discovery on the private web using dataset digests

International Journal of Metadata, Semantics and Ontologies
Federated Search

Foundations and Trends in Information Retrieval
A TNATS approach to hidden web documents

ICDCIT'04 Proceedings of the First international conference on Distributed Computing and Internet Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

A large amount of on-line information resides on the invisible web - web pages generated dynamically from databases and other data sources hidden from the user. They are not indexed by a static URL but is generated when queries are asked via a search interface (we denote them as specialized search engines). In this paper we propose a system that is capable of automatically making use of these specialized engines to find information on the invisible web. We describe our overall architecture and process: from obtaining the search engines to picking the right engines to query. Experiments show that we can find information that is not found by the traditional search engines.