Automatically extracting structure and data from business reports

  • Authors:
  • Stephen W. Liddle;Douglas M. Campbell;Chad Crawford

  • Affiliations:
  • School of Accountancy and Information Systems, Marriott School of Management, Brigham Young University, Provo, UT;Computer Science Department, Brigham Young University, Provo, UT;School of Accountancy and Information Systems, Marriott School of Management, Brigham Young University, Provo, UT

  • Venue:
  • Proceedings of the eighth international conference on Information and knowledge management
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

A considerable amount of clean semistructured data is internally available to companies in the form of business reports. However, business reports are untapped for data mining, data warehousing, and querying because they are not in relational form. Business reports have a regular structure that can be reconstructed. We present algorithms that automatically infer the regular structure underlying business reports and automatically generate wrappers to extract relational data.