The shark-search algorithm. An application: tailored Web site mapping
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
WebCQ-detecting and delivering information changes on the web
Proceedings of the ninth international conference on Information and knowledge management
Proceedings of the 11th international conference on World Wide Web
Accelerated focused crawling through online relevance feedback
Proceedings of the 11th international conference on World Wide Web
Topic-oriented collaborative crawling
Proceedings of the eleventh international conference on Information and knowledge management
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Computing Geographical Scopes of Web Resources
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Estimating frequency of change
ACM Transactions on Internet Technology (TOIT)
Categorizing web queries according to geographical locality
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Effective page refresh policies for Web crawlers
ACM Transactions on Database Systems (TODS)
Managing distributed collections: evaluating web page changes, movement, and replacement
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Distributed location aware web crawling
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Efficient web change monitoring with page digest
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Geographical partition for distributed web crawling
Proceedings of the 2005 workshop on Geographic information retrieval
Link Contexts in Classifier-Guided Topical Crawlers
IEEE Transactions on Knowledge and Data Engineering
Geographically focused collaborative crawling
Proceedings of the 15th international conference on World Wide Web
Designing efficient sampling techniques to detect webpage updates
Proceedings of the 16th international conference on World Wide Web
A Method for Focused Crawling Using Combination of Link Structure and Content Similarity
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
An Efficient Web Page Change Detection System Based on an Optimized Hungarian Algorithm
IEEE Transactions on Knowledge and Data Engineering
Tracking and viewing changes on the web
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Effective change detection using sampling
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Architecture for Parallel Crawling and Algorithm for Change Detection in Web Pages
ICIT '07 Proceedings of the 10th International Conference on Information Technology
ICIT '07 Proceedings of the 10th International Conference on Information Technology
Local search engine with global content based on domain specific knowledge
WSEAS Transactions on Information Science and Applications
Application of structured document parsing to focused web crawling
Computer Standards & Interfaces
E-FFC: an enhanced form-focused crawler for domain-specific deep web databases
Journal of Intelligent Information Systems
Hi-index | 0.00 |
In this paper, we discuss about the focused web crawler and relevance of anchor text as well as method for web page change detection for search engine. We have proposed a technique called weighted anchor text which uses the link structure to form the weighted directed graph of anchor texts. These weights are further used for deciding the relevance of the web pages as the indexing of these pages is done in the decreasing order of weights assigned to them. Weights are assigned for every incoming link for a node of the directed graph. We applied our algorithm on various websites and observed the results. We deduce that the algorithm can be very useful when incorporated with other existing algorithms. As Web usage has increased exponentially in the past few years. This collection of enormous web pages is highly changing and web pages show a rapid change, the degree of which varies from site to site. We discuss the relevance of change detection and then move on to explore the related work in the area. Based on this understanding we propose a new algorithm to map changes in a web page. After verifying results on various web pages we observe the relative merits of the proposed algorithm.