Stochastic Simulations of Rejected World Wide Web Pages

  • Authors:
  • George Meghabghab

  • Affiliations:
  • -

  • Venue:
  • MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study is a premiere in using neural networks in stochastic simulation of the number of rejected Web pages per search query. The evaluation of the quality of search engines should involve not only the resulting set of web pages but also an estimate of the rejected set of web pages. The iterative RDF neural network developed by Meghabghab and Nasr [1] was adapted to the actual evaluation of the number of rejected web pages on four search engines, i.e., Yahoo, Alta Vista, Google, and Northern Light. Nine input variables were selected for the simulation. Typical stochastic simulation meta modeling uses regression models in Response Surface Methods. RBF divides the resulting set of responses to a query into accepted and rejected web pages. RBF meta-model was trained on 937 examples from a set of 9000 different simulation runs on nine input variables. Results show that the number of rejected web pages for a specific set of search queries on these four engines very high. Also a goodness measure of a search engine for a given set of queries can be designed which is a function of the coverage of the search engine and the normalized age of a new document in result set for the query. This study concludes that unless search engine designers address the issue of rejected web pages, indexing, and crawling, the usage of the Web as a research tool for academic and educational purposes will stay hindered.