MM '11 Proceedings of the 19th ACM international conference on Multimedia
Hi-index | 0.00 |
Search engines are playing a more and more important role in discovering information nowadays. Due to limi- tations of time-consuming, network bandwidth and hard- wares, we cannot obtain the whole information on the web and have to download important information first. In this paper we propose a novel crawling ordering strategy which is based on SiteRank. Experimental results running on over 15 million pages indicate that it can work efficiently in dis- covering important pages under the PageRank evaluation of page quality. Furthermore, it exhibits the ability of anti- spamming.