OPAL: a passe-partout for web forms

  • Authors:
  • Xiaonan Guo;Jochen Kranzdorf;Tim Furche;Giovanni Grasso;Giorgio Orsi;Christian Schallhart

  • Affiliations:
  • Oxford University, Oxford, United Kingdom;Oxford University, Oxford, United Kingdom;Oxford University, Oxford, United Kingdom;Oxford University, Oxford, United Kingdom;Oxford University, Oxford, United Kingdom;Oxford University, Oxford, United Kingdom

  • Venue:
  • Proceedings of the 21st international conference companion on World Wide Web
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, and usability and accessibility support for enhanced web browsing. In this demonstration, we use a novel form understanding approach, OPAL, to assist in form filling even for complex, previously unknown forms. OPAL associates form labels to fields by analyzing structural properties in the HTML encoding and visual features of the page rendering. OPAL interprets this labeling and classifies the fields according to a given domain ontology. The combination of these two properties, allows OPAL to deal effectively with many forms outside of the grasp of existing form filling techniques. In the UK real estate domain, OPAL achieves 99% accuracy in form understanding.