Hypermedia and cognition: designing for comprehension
Communications of the ACM
Silk from a sow's ear: extracting usable structures from the Web
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Fast discovery of association rules
Advances in knowledge discovery and data mining
SpeedTracer: a Web usage mining and analysis tool
IBM Systems Journal
ACM SIGKDD Explorations Newsletter
SPADE: an efficient algorithm for mining frequent sequences
Machine Learning
Scalable Algorithms for Association Mining
IEEE Transactions on Knowledge and Data Engineering
Mining Sequential Patterns: Generalizations and Performance Improvements
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
WUM - A Tool for WWW Ulitization Analysis
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Data mining for path traversal patterns in a web environment
ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Web Mining: Information and Pattern Discovery on the World Wide Web
ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
Efficiently mining frequent trees in a forest
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
XRules: an effective structural classifier for XML data
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining interesting knowledge from weblogs: a survey
Data & Knowledge Engineering
Proceedings of the 2006 workshop on Memory system performance and correctness
Data & Knowledge Engineering - Special issue: WIDM 2004
Combining Web Usage Mining and XML Mining in a Real Case Study
From Web to Social Web: Discovering and Deploying User and Content Profiles
U3 - Mning Unordered Embedded Subtrees Using TMG Candidate Generation
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Adaptive XML Tree Classification on Evolving Data Streams
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Web Semantics: Science, Services and Agents on the World Wide Web
Model guided algorithm for mining unordered embedded subtrees
Web Intelligence and Agent Systems
Alternative Approach to Tree-Structured Web Log Representation and Mining
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Integrating web conceptual modeling and web usage mining
WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Employing inductive databases in concrete applications
Proceedings of the 2004 European conference on Constraint-Based Mining and Inductive Databases
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
XML document clustering using structure-preserving flat representation of XML content and structure
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Proceedings of the CUBE International Information Technology Conference
Hi-index | 0.00 |
Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.