Online Evaluation of Patterns from Evolving Web Data Streams

  • Authors:
  • Carlos Rojas;Olfa Nasraoui

  • Affiliations:
  • -;-

  • Venue:
  • WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a generic framework to evaluate patterns obtained from transactional web data streams whose underlying distribution changes with time. The evolving nature of the data makes it very difficult to determine whether there is structure in the data stream, and whether this structure is being learned. This challenge arises in applications such as mining online store transactions, summarizing dynamic document collections, and profiling web traffic. We propose to evaluate this hard instance of unsupervised learning using a continuous assessment of the predictive power of the learned patterns, with specific examples that borrow concepts from supervised learning. We present results from experiments with synthetic data, the 20 Newsgroups dataset, web clickstream data, and a custom collection of RSS News feeds.