Information extraction from web pages based on their visual representation

  • Authors:
  • Ruslan R. Fayzrakhmanov

  • Affiliations:
  • Database and Artificial Intelligence Group, Institute of Information Systems, TU Vienna, Vienna, Austria

  • Venue:
  • ICWE'11 Proceedings of the 11th international conference on Current Trends in Web Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This research is dedicated to enhancing the efficiency of web information extraction and web accessibility. The motivation behind the research, its aim and objectives are presented, and the performed work on developing web page model for information extraction is described. We also present work on making extracted information accessible to blind users, providing them with the means to navigate and access required information quickly. We also present our ongoing research on creating efficient methods and approaches for information extraction from the proposed model. There are two main approaches considered: 1) development of the library which provides required functionality to the programmer; 2) development of declarative Datalog-like language for information extraction.