Search engines and their public interfaces: which apis are the most synchronized?

Authors:
Frank McCown;Michael L. Nelson
Affiliations:
Old Dominion University, Norfolk, VA;Old Dominion University, Norfolk, VA
Venue:
Proceedings of the 16th international conference on World Wide Web
Year:
2007

Citing 3
Cited 5

Comparing Top k Lists

SIAM Journal on Discrete Mathematics
Random sampling from a search engine's index

Proceedings of the 15th international conference on World Wide Web
Methods for comparing rankings of search engine results

Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics

Harvesting needed to maintain scientific literature online

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
IBM research division cloud computing initiative

IBM Journal of Research and Development
Evolution of web search results within years

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Characterizing web search queries that match very few or no results

Proceedings of the 21st ACM international conference on Information and knowledge management
Carbon dating the web: estimating the age of web resources

Proceedings of the 22nd international conference on World Wide Web companion

Quantified Score

Hi-index	0.00

Visualization

Abstract

Researchers of commercial search engines often collect datausing the application programming interface (API) or by"scraping" results from the web user interface (WUI), butanecdotal evidence suggests the interfaces produce differentresults. We provide the first in-depth quantitative analysisof the results produced by the Google, MSN and Yahoo APIand WUI interfaces. After submitting a variety of queriesto the interfaces for 5 months, we found significant discrepanciesin several categories. Our findings suggest that theAPI indexes are not older, but they are probably smaller for Google and Yahoo. Researchers may use our findings tobetter understand the differences between the interfaces andchoose the best API for their particular types of queries.