Query-related data extraction of hidden web documents
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A two-phase sampling technique for information extraction from hidden web databases
Proceedings of the 6th annual ACM international workshop on Web information and data management
SmartCrawl: a new strategy for the exploration of the hidden web
Proceedings of the 6th annual ACM international workshop on Web information and data management
Discover the semantic topology in high-dimensional data
Expert Systems with Applications: An International Journal
Sampling, information extraction and summarisation of hidden web databases
Data & Knowledge Engineering - Special issue: WIDM 2004
Retrieval for decision support resources by structured models
Decision Support Systems
CCReSD: concept-based categorisation of Hidden Web databases
International Journal of High Performance Computing and Networking
Automatic Hidden Web Database Classification
PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Facilitating discovery on the private web using dataset digests
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Shopping search engine technology based on services
ACS'08 Proceedings of the 8th conference on Applied computer scince
A simplicial complex, a hypergraph, structure in the latent semantic space of document clustering
International Journal of Approximate Reasoning
Facilitating discovery on the private web using dataset digests
International Journal of Metadata, Semantics and Ontologies
Foundations and Trends in Information Retrieval
A TNATS approach to hidden web documents
ICDCIT'04 Proceedings of the First international conference on Distributed Computing and Internet Technology
Hi-index | 0.00 |
A large amount of on-line information resides on the invisible web - web pages generated dynamically from databases and other data sources hidden from the user. They are not indexed by a static URL but is generated when queries are asked via a search interface (we denote them as specialized search engines). In this paper we propose a system that is capable of automatically making use of these specialized engines to find information on the invisible web. We describe our overall architecture and process: from obtaining the search engines to picking the right engines to query. Experiments show that we can find information that is not found by the traditional search engines.