SpeedTracer: a Web usage mining and analysis tool
IBM Systems Journal
Data mining: concepts and techniques
Data mining: concepts and techniques
Analyzing clickstreams using subsessions
Proceedings of the 3rd ACM international workshop on Data warehousing and OLAP
A fine grained heuristic to capture web navigation patterns
ACM SIGKDD Explorations Newsletter
Introduction to Algorithms
Mining Sequential Patterns: Generalizations and Performance Improvements
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth
Proceedings of the 17th International Conference on Data Engineering
Mining Access Patterns Efficiently from Web Logs
PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Data Mining of User Navigation Patterns
WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Web Mining: Information and Pattern Discovery on the World Wide Web
ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
Evaluating the markov assumption for web usage mining
WIDM '03 Proceedings of the 5th ACM international workshop on Web information and data management
Intelligent web traffic mining and analysis
Journal of Network and Computer Applications - Special issue on computational intelligence on the internet
Mining interesting knowledge from weblogs: a survey
Data & Knowledge Engineering
Computational Intelligence techniques for Web personalization
Web Intelligence and Agent Systems
A web page usage prediction scheme using sequence indexing and clustering techniques
Data & Knowledge Engineering
Distributed data mining for e-business
Information Technology and Management
Alternative Approach to Tree-Structured Web Log Representation and Mining
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Usage Profile Generation from Web Usage Data Using Hybrid Biclustering Algorithm
International Journal of Applied Evolutionary Computation
Hi-index | 0.00 |
With the large number of companies using the Internet to distribute and collect information, knowledge discovery on the web has become an important research area. Web usage mining, which is the main topic of this paper, focuses on knowledge discovery from the clicks in the web log for a given site (the so-called click-stream), especially on analysis of sequences of clicks. Existing techniques for analyzing click sequences have different drawbacks, i.e., either huge storage requirements, excessive I/O cost, or scalability problems when additional information is introduced into the analysis.In this paper we present a new hybrid approach for analyzing click sequences that aims to overcome these drawbacks. The approach is based on a novel combination of existing approaches, more specifically the Hypertext Probabilistic Grammar (HPG) and Click Fact Table approaches. The approach allows for additional information, e.g., user demographics, to be included in the analysis without introducing performance problems. The development is driven by experiences gained from industry collaboration. A prototype has been implemented and experiments are presented that show that the hybrid approach performs well compared to the existing approaches. This is especially true when mining sessions containing clicks with certain characteristics, i.e., when constraints are introduced. The approach is not limited to web log analysis, but can also be used for general sequence mining tasks.