A Platform for Extracting and Storing Web Data

  • Authors:
  • L. Víctor Rebolledo;Juan D. Velásquez

  • Affiliations:
  • -;-

  • Venue:
  • KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web data or data originated on the Web contain information and knowledge which allows to improve web site efficiency and effectiveness to attract and retain visitors. However, web data have many irrelevant data inside. Consequently, it is necessary to preprocess them to model and understand the web user browsing behavior inside them. Further, due to frequent changes in the visitor's behavior, as well as in the web site itself, the discovered knowledge may become obsolete in a short period of time. In this paper, we introduce a platform which extracts, preprocesses and stores web data to enabling the utilization of web mining techniques. In other words, there is an Information Repository (IR) which stores preprocessed web data and it facilitates the patterns extraction. Likewise, there is a Knowledge Base (KB) for storing the discovered patterns which have been validated by a domain expert. The proposed structure was tested using a real web site to prove the effectiveness of our approach.