SIAM Journal on Discrete Mathematics
Random sampling from a search engine's index
Proceedings of the 15th international conference on World Wide Web
Methods for comparing rankings of search engine results
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Harvesting needed to maintain scientific literature online
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
IBM research division cloud computing initiative
IBM Journal of Research and Development
Evolution of web search results within years
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Characterizing web search queries that match very few or no results
Proceedings of the 21st ACM international conference on Information and knowledge management
Carbon dating the web: estimating the age of web resources
Proceedings of the 22nd international conference on World Wide Web companion
Hi-index | 0.00 |
Researchers of commercial search engines often collect datausing the application programming interface (API) or by"scraping" results from the web user interface (WUI), butanecdotal evidence suggests the interfaces produce differentresults. We provide the first in-depth quantitative analysisof the results produced by the Google, MSN and Yahoo APIand WUI interfaces. After submitting a variety of queriesto the interfaces for 5 months, we found significant discrepanciesin several categories. Our findings suggest that theAPI indexes are not older, but they are probably smaller for Google and Yahoo. Researchers may use our findings tobetter understand the differences between the interfaces andchoose the best API for their particular types of queries.