WebCQ-detecting and delivering information changes on the web
Proceedings of the ninth international conference on Information and knowledge management
A First Experience in Archiving the French Web
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
Visual Based Content Understanding towards Web Adaptation
AH '02 Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Detecting Changes in XML Documents
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Learning block importance models for web pages
Proceedings of the 13th international conference on World Wide Web
A browser for browsing the past web
Proceedings of the 15th international conference on World Wide Web
Managing duplicates in a web archive
Proceedings of the 2006 ACM symposium on Applied computing
Fast and simple XML tree differencing by sequence alignment
Proceedings of the 2006 ACM symposium on Document engineering
DTD-Diff: A change detection algorithm for DTDs
Data & Knowledge Engineering
Web Contents Tracking by Learning of Page Grammars
ICIW '08 Proceedings of the 2008 Third International Conference on Internet and Web Applications and Services
Changing how people view changes on the web
Proceedings of the 22nd annual ACM symposium on User interface software and technology
Archiving the web using page changes patterns: a case study
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Improving the quality of web archives through the importance of changes
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Coherence-oriented crawling and navigation using patterns for web archives
TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
Structural and visual comparisons for web page archiving
Proceedings of the 2012 ACM symposium on Document engineering
Hi-index | 0.00 |
Nowadays, many applications are interested in detecting and discovering changes on the web to help users to understand page updates and more generally, the web dynamics. Web archiving is one of these fields where detecting changes on web pages is important. Archiving institutes are collecting and preserving different web site versions for future generation. A major problem encountered by archiving systems is to understand what happened between two versions of web pages. In this paper, we address this requirement by proposing a new change detection approach that computes the semantic differences between two versions of HTML web pages. Our approach, called Vi-DIFF, detects changes on the visual representation of web pages. It detects two types of changes: content and structural changes. Content changes include modifications on text, hyperlinks and images. In contrast, structural changes alter the visual appearance of the page and the structure of its blocks. Our Vi-DIFF solution can serve for various applications such as crawl optimization, archive maintenance, web changes browsing, etc. Experiments on Vi-DIFF were conducted and the results are promising.