Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Machine Learning
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Improving the performance of focused web crawlers
Data & Knowledge Engineering
A constrained crawling approach and its application to a specialised search engine
International Journal of Information and Communication Technology
Adaptive topical web crawling for domain-specific resource discovery guided by link-context
MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
ICWL'07 Proceedings of the 6th international conference on Advances in web based learning
An analyst-adaptive approach to focused crawlers
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Hi-index | 0.00 |
Focused crawlers are considered as a promising way to tackle the scalability problem of topic-oriented or personalized search engines. To design a focused crawler, the choice of strategy for prioritizing unvisited URLs is crucial. In this paper, we propose a method using a decision tree on anchor texts of hyperlinks. We conducted experiments on the real data sets of four Japanese universities and verified our approach.