Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto

  • Authors:
  • Robert Baumgartner;Sergio Flesca;Georg Gottlob

  • Affiliations:
  • -;-;-

  • Venue:
  • LPNMR '01 Proceedings of the 6th International Conference on Logic Programming and Nonmonotonic Reasoning
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting information from Web pages using such wrappers, and for translating the extracted content into XML. This paper describes some advanced features of Lixto, such as disjunctive pattern definitions, specialization rules, and Lixto's capability of collecting and aggregating information from several linked Web pages.