A technique for measuring the relative size and overlap of public Web search engines
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Towards comprehensive web search
Towards comprehensive web search
Building an open source meta-search engine
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Building an open source meta-search engine
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Search Engine Coverage of the OAI-PMH Corpus
IEEE Internet Computing
Random sampling from a search engine's index
Proceedings of the 15th international conference on World Wide Web
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Generalizing PageRank: damping functions for link-based ranking algorithms
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Methods for comparing rankings of search engine results
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
A study of results overlap and uniqueness among major web search engines
Information Processing and Management: an International Journal
Efficient, automatic web resource harvesting
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Lazy preservation: reconstructing websites by crawling the crawlers
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Estimating corpus size via queries
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
The Web as a graph: How far we are
ACM Transactions on Internet Technology (TOIT)
Extracting Topic Maps from Web Histories by Clustering with Web Structure and Contents
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Web Dragons: Inside the Myths of Search Engine Technology
Web Dragons: Inside the Myths of Search Engine Technology
Is it correct?: towards web-based evaluation of automatic natural language phrase generation
COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
Aggregation of web search engines based on users' preferences in WebFusion
Knowledge-Based Systems
Web searching, search engines and Information Retrieval
Information Services and Use
Efficient search engine measurements
Proceedings of the 16th international conference on World Wide Web
Extraction and classification of dense communities in the web
Proceedings of the 16th international conference on World Wide Web
Temporal Analysis of the Wikigraph
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Factors affecting website reconstruction from the web infrastructure
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Agreeing to disagree: search engines and their public interfaces
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Decoding the structure of the WWW: A comparative analysis of Web crawls
ACM Transactions on the Web (TWEB)
Pruning policies for two-tiered inverted index with correctness guarantee
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating sampling methods for uncooperative collections
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Supporting intelligent Web search
ACM Transactions on Internet Technology (TOIT)
A data-oriented (and beyond) network architecture
Proceedings of the 2007 conference on Applications, technologies, architectures, and protocols for computer communications
Possibilistic fuzzy co-clustering of large document collections
Pattern Recognition
User-assisted similarity estimation for searching related web pages
Proceedings of the eighteenth conference on Hypertext and hypermedia
Comparison of Krylov subspace methods on the PageRank problem
Journal of Computational and Applied Mathematics
Link analysis for Web spam detection
ACM Transactions on the Web (TWEB)
WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
A personalized search engine based on Web-snippet hierarchical clustering
Software—Practice & Experience
Entropy of search logs: how hard is search? with personalization? with backoff?
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
DistanceRank: An intelligent ranking algorithm for web pages
Information Processing and Management: an International Journal
Analyzing the impact of churn and malicious behavior on the quality of peer-to-peer web search
Proceedings of the 2008 ACM symposium on Applied computing
Web science: an interdisciplinary approach to understanding the web
Communications of the ACM - Web science
Rapid bootstrapping of statistical spoken dialogue systems
Speech Communication
Efficient semi-streaming algorithms for local triangle counting in massive graphs
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Random sampling from a search engine's index
Journal of the ACM (JACM)
A Topic-Specific Web Crawler with Concept Similarity Context Graph Based on FCA
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Real-Time Open-Domain QA on the Portuguese Web
IBERAMIA '08 Proceedings of the 11th Ibero-American conference on AI: Advances in Artificial Intelligence
Mining search engine query logs via suggestion sampling
Proceedings of the VLDB Endowment
Mapping geographic coverage of the web
Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
A three-year study on the freshness of web search engine databases
Journal of Information Science
Search personalization through query and page topical analysis
User Modeling and User-Adapted Interaction
TC-SocialRank: Ranking the Social Web
WAW '09 Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph
Extraction and classification of dense implicit communities in the Web graph
ACM Transactions on the Web (TWEB)
Web Structure Mining by Isolated Cliques
IEICE - Transactions on Information and Systems
Ranking billions of web pages using diodes
Communications of the ACM - A Blind Person's Interaction with Technology
Investigation of the accuracy of search engine hit counts
Journal of Information Science
LIPSIN: line speed publish/subscribe inter-networking
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
A method for measuring the evolution of a topic on the Web: The case of “informetrics”
Journal of the American Society for Information Science and Technology
Journal of Information Science
Web Observation from a User Perspective
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
FICA: A novel intelligent crawling algorithm based on reinforcement learning
Web Intelligence and Agent Systems
Improving the load balance for hybrid partitioning scheme by directing hybrid queries
PDCN '08 Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks
Estimating deep web data source size by capture---recapture method
Information Retrieval
From Keyword Search to Exploration: Designing Future Search Interfaces for the Web
Foundations and Trends in Web Science
Metadata as seeds for building an ontology driven information retrieval system
International Journal of Hybrid Intelligent Systems
A web page usage prediction scheme using sequence indexing and clustering techniques
Data & Knowledge Engineering
The adaptive web
A fast and compact web graph representation
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
A web-page usage prediction scheme using weighted suffix trees
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Efficient algorithms for large-scale local triangle counting
ACM Transactions on Knowledge Discovery from Data (TKDD)
A new approach to improving multilingual summarization using a genetic algorithm
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Research proposal for distributed deep web search
PIKM '10 Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management
Succinct representations of separable graphs
CPM'10 Proceedings of the 21st annual conference on Combinatorial pattern matching
Foundations and Trends in Information Retrieval
How much of the web is archived?
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Behavior based web page evaluation
Journal of Web Engineering
Ontology-driven personalized query refinement
Journal of Web Engineering
Efficient Search Engine Measurements
ACM Transactions on the Web (TWEB)
Sampling hidden objects using nearest-neighbor oracles
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
A Case Study of Collaboration and Reputation in Social Web Search
ACM Transactions on Intelligent Systems and Technology (TIST)
An overview of Web search evaluation methods
Computers and Electrical Engineering
Video histogram: a novel video signature for efficient web video duplicate detection
MMM'07 Proceedings of the 13th International conference on Multimedia Modeling - Volume Part II
Intelligent search on the internet
Reasoning, Action and Interaction in AI Theories and Systems
World Wide Web
Characterizing the semantic web on the web
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
The laplacian paradigm: emerging algorithms for massive graphs
TAMC'10 Proceedings of the 7th annual conference on Theory and Applications of Models of Computation
Query retrieval enhancement based on Huffman index terms encoding
Proceedings of the 3rd International Conference on Information and Communication Systems
An investigation into query throughput and load balance using grid IR
FDIA'08 Proceedings of the 2nd BCS IRSG conference on Future Directions in Information Access
Teaching of web information retrieval: web first or IR first?
TLIR'07 Proceedings of the First international conference on Teaching and Learning of Information Retrieval
To what problem is distributed information retrieval the solution?
Journal of the American Society for Information Science and Technology
Estimating sum by weighted sampling
ICALP'07 Proceedings of the 34th international conference on Automata, Languages and Programming
Size estimation of non-cooperative data collections
Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Analyzing and defending against web-based malware
ACM Computing Surveys (CSUR)
A Hybrid Approach for Web Change Detection
International Journal of Information Technology and Web Engineering
A synergistic approach to efficient web searching
Intelligent Decision Technologies
Hi-index | 0.00 |
In this short paper we estimate the size of the public indexable web at 11.5 billion pages. We also estimate the overlap and the index size of Google, MSN, Ask/Teoma and Yahoo!