Visual Based Content Understanding towards Web Adaptation
AH '02 Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
Improving pseudo-relevance feedback in web information retrieval using web page segmentation
WWW '03 Proceedings of the 12th international conference on World Wide Web
Detecting web page structure for adaptive viewing on small form factor devices
WWW '03 Proceedings of the 12th international conference on World Wide Web
Hearsay: enabling audio browsing on hypertext content
Proceedings of the 13th international conference on World Wide Web
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Bootstrapping Semantic Annotation for Content-Rich HTML Documents
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Fully automatic wrapper generation for search engines
WWW '05 Proceedings of the 14th international conference on World Wide Web
Understanding the function of web elements for mobile content delivery using random walk models
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
A web browsing system based on adaptive presentation of web contents for cellular phones
W4A '06 Proceedings of the 2006 international cross-disciplinary workshop on Web accessibility (W4A): Building the mobile web: rediscovering accessibility?
Acquiring owl ontologies from data-intensive web sites
ICWE '06 Proceedings of the 6th international conference on Web engineering
Spatial graph grammars for graphical user interfaces
ACM Transactions on Computer-Human Interaction (TOCHI)
Reformatting web documents via header trees
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Vertical Navigation of Layout Adapted Web Documents
World Wide Web
Csurf: a context-driven non-visual web-browser
Proceedings of the 16th international conference on World Wide Web
Towards domain-independent information extraction from web tables
Proceedings of the 16th international conference on World Wide Web
Extraction of flat and nested data records from web pages
AusDM '06 Proceedings of the fifth Australasian conference on Data mining and analystics - Volume 61
OPA browser: a web browser for cellular phone users
Proceedings of the 20th annual ACM symposium on User interface software and technology
Automatic accessibility transcoding for flash content
Proceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility
Enabling device independent co-browsing through adaptive view points
Journal of Computer and System Sciences
OntoMiner: automated metadata and instance mining from news websites
International Journal of Web and Grid Services
Hunting for headings: sighted labeling vs. automatic classification of headings
Proceedings of the 10th international ACM SIGACCESS conference on Computers and accessibility
Automated Semantic Analysis of Schematic Data
World Wide Web
Structure Extraction from Presentation Slide Information
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Bridging the Web Accessibility Divide
Electronic Notes in Theoretical Computer Science (ENTCS)
A Visual Technique for Web Pages Comparison
Electronic Notes in Theoretical Computer Science (ENTCS)
Enhanced Gestalt Theory Guided Web Page Segmentation for Mobile Browsing
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Visual extraction of information from web pages
Journal of Visual Languages and Computing
Automatic document structure detection for data integration
BIS'07 Proceedings of the 10th international conference on Business information systems
Extracting content structure for web pages based on visual representation
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
DOM-based web pages to determine the structure of the similarity algorithm
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Retrieval of snippets of web pages converted to plain text: more questions than answers
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
On-line web database integration
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Ontology development for the semantic web: an html form-based reverse engineering approach
Journal of Web Engineering
Block-based language modeling approach towards web search
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
RSS feed generation from legacy HTML pages
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Design and implementation of web usage mining system using page scroll
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Towards understanding the functions of web element
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, COA, and ODBASE - Volume Part II
Mobile web browsing techniques
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part V
Annotation and Auto-Scrolling for Web Page Overview in Mobile Web Browsing
International Journal of Handheld Computing Research
A general theory of spatial relations to support a graphical tool for visual information extraction
Journal of Visual Languages and Computing
Adapting data table to improve web accessibility
Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility
Hi-index | 0.00 |
Abstract: In this paper, we present a novel approach to automatically analyzing semantic structure of HTML pages based on detecting visual similarities of content objects on web pages. The approach is developed based on the observation that in most web pages, layout styles of subtitles or records of the same content category are consistent and there are apparent separation boundaries between different categories. Thus these subtitles should have similar appearances if they are rendered in visual browsers and different categories can be separated clearly. In our approach, we first measure visual similarities of HTML content objects. Then we apply a pattern detection algorithm to detect frequent patterns of visual similarity and use a number of heuristics to choose the most possible patterns. By grouping items according to these patterns, we finally build a hierarchical representation (tree) of HTML document with "visual consistency" inferred semantics. Preliminary experimental results show promising performances of the method with real web pages.