Wrappers for web access logs feature selection

  • Authors:
  • Maria Muntean;Honoriu Vălean

  • Affiliations:
  • University of Alba Iulia, Alba-Iulia, Romania;Technical University of Cluj Napoca, Cluj-Napoca, Romania

  • Venue:
  • Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Web Usage Mining (WUM), a rather recent research field, corresponds to the process of knowledge discovery from databases (KDD) applied to the Web usage data. The quantity of the Web usage data to be analyzed and its poor quality (in particular the abundance of features to be analyzed) are the main problems in WUM. Considering the characteristics of Web log data and functions of every phase included in data preprocessing, this paper establishes a Web log data preprocessing algorithm based on feature selection. The implemented Wrapper Evaluation feature selection method use a Best First Search and a Greedy Stepwise Search and evaluate each of the attribute subsets according to Support Vector Machine learning scheme.