C4.5: programs for machine learning
C4.5: programs for machine learning
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Characterizing browsing strategies in the World-Wide Web
Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
In search of reliable usage data on the WWW
Selected papers from the sixth international conference on World Wide Web
Towards adaptive Web sites: conceptual framework and case study
WWW '99 Proceedings of the eighth international conference on World Wide Web
ACM SIGKDD Explorations Newsletter
Principles of data mining
Using HTML
Improving Web Usability Through Visualization
IEEE Internet Computing
Efficient Data Mining for Path Traversal Patterns
IEEE Transactions on Knowledge and Data Engineering
Mining Sequential Patterns: Generalizations and Performance Improvements
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Integrating E-Commerce and Data Mining: Architecture and Challenges
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Research Issues in Web Data Mining
DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
WUM - A Tool for WWW Ulitization Analysis
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Analysis of navigation behaviour in web sites integrating multiple information systems
The VLDB Journal — The International Journal on Very Large Data Bases
Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs
ADL '98 Proceedings of the Advances in Digital Libraries Conference
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
DSM-PLW: single-pass mining of path traversal patterns over streaming web click-sequences
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
A process of knowledge discovery from web log data: Systematization and critical review
Journal of Intelligent Information Systems
Predictive factors of glycemic control: a comparison of decision tree and neural networks
ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
Ethical aspects of web log data mining
International Journal of Information Technology and Management
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Complex and extensive web sites are becoming more and more popular. Companies need to justify their investments. Web related data analysis is the way of providing this justification. It is usual that large amounts of data exist is the repositories and humans do not use. The reasons are simple. They don't know what to do with this data, how to prepare it and what kind of tasks should be performed to retrieve valuable knowledge. Commercial web mining packages do not answer all questions which maybe interesting to the data analyst. In this paper authors suggest several hypotheses what could help to improve web site's retention. The investigation proposes decision trees for web user behaviour analysis. This includes prediction of user future actions and the typical pages leading to browsing termination. Decision tree package C4.5 was used in this study. Decision trees showed reasonable computational performance and accuracy. Experiments showed that it is possible to predict future user actions with reasonable misclassification error as well as to find combinations of sequential pages resulting in browsing termination. In addition to this, decision trees generated human understandable rules which can be used to analyse further for web site improvement.