Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
Record-boundary discovery in Web documents
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Discovering informative content blocks from Web documents
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
HTML Page Analysis Based on Visual Cues
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Adapting Web Pages for Small-Screen Devices
IEEE Internet Computing
Web data extraction based on partial tree alignment
WWW '05 Proceedings of the 14th international conference on World Wide Web
Proceedings of the 15th international conference on World Wide Web
Robust web page segmentation for mobile terminal using content-distances and page layout information
Proceedings of the 16th international conference on World Wide Web
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
A General Approach for Partitioning Web Page Content Based on Geometric and Style Information
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
A graph-theoretic approach to webpage segmentation
Proceedings of the 17th international conference on World Wide Web
Hi-index | 0.00 |
This paper proposes a novel Web page segmentation method for mobile browsing, aiming to break a Web page into visually and semantically coherent units fitted to the limited screen size of mobile devices. We intend to simulate human’s perceptive process guided by four general laws in Gestalt theory, namely: proximity, similarity, closure and simplicity. We also present an application of adapting Web pages to mobile terminals based on segmentation. Experimental results show that the proposed method is efficient and can greatly improve segmentation accuracy.