Efficient Data Mining for Path Traversal Patterns

Authors:
Ming-Syan Chen;Jong Soo Park;Philip S. Yu
Affiliations:
-;-;-
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
1998

Citing 16
Cited 152

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Combinatorial pattern discovery for scientific data: some preliminary results

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Backtracking in a multiple-window hypertext environment

ECHT '94 Proceedings of the 1994 ACM European conference on Hypermedia technology
Characterizing browsing strategies in the World-Wide Web

Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
SpeedTracer: a Web usage mining and analysis tool

IBM Systems Journal
The World Wide Web (Unleashed)

The World Wide Web (Unleashed)
Using a Hash-Based Method with Transaction Trimming for Mining Association Rules

IEEE Transactions on Knowledge and Data Engineering
Induction of Decision Trees

Machine Learning
Efficient Similarity Search In Sequence Databases

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Knowledge Mining by Imprecise Querying: A Classification-Based Approach

Proceedings of the Eighth International Conference on Data Engineering
Mining Sequential Patterns

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
An Interval Classifier for Database Mining Applications

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Knowledge Discovery in Databases: An Attribute-Oriented Approach

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases

Warehousing and mining Web logs

Proceedings of the 2nd international workshop on Web information and data management
Tightly coupling authoring and evaluation in an integrated tool to support iterative design of interactive hypermedia educational manuals

DIS '00 Proceedings of the 3rd conference on Designing interactive systems: processes, practices, methods, and techniques
A fine grained heuristic to capture web navigation patterns

ACM SIGKDD Explorations Newsletter
Scalable data mining with model constraints

ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
Web user clustering from access log using belief function

Proceedings of the 1st international conference on Knowledge capture
Sliding-window filtering: an efficient algorithm for incremental mining

Proceedings of the tenth international conference on Information and knowledge management
Predicting category accesses for a user in a structured information space

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient prediction of web accesses on a proxy server

Proceedings of the eleventh international conference on Information and knowledge management
Entropy-based link analysis for mining web informative structures

Proceedings of the eleventh international conference on Information and knowledge management
Indexing web access-logs for pattern queries

Proceedings of the 4th international workshop on Web information and data management
Efficient Adaptive-Support Association Rule Mining for Recommender Systems

Data Mining and Knowledge Discovery
Prediction of Web Page Accesses by Proxy Server Log

World Wide Web
Mining hybrid sequential patterns and sequential rules

Information Systems
An integrated architecture for tightly coupled design and evaluation of educational multimedia

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Interactive virtual environments and distance education
On Using a Warehouse to Analyze Web Logs

Distributed and Parallel Databases
Complete Mining of Frequent Patterns from Graphs: Mining Graph Data

Machine Learning
Accelerating Dynamic Web Content Generation

IEEE Internet Computing
Mining Sequential Patterns with Regular Expression Constraints

IEEE Transactions on Knowledge and Data Engineering
Developing Data Allocation Schemes by Incremental Mining of User Moving Patterns in a Mobile Computing System

IEEE Transactions on Knowledge and Data Engineering
On Analysis and Modeling of Student Browsing Behavior in Web-Based Asynchronous Learning Environments

ICWL '02 Proceedings of the First International Conference on Advances in Web-Based Learning
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Using Pattern-Join and Purchase-Combination for Mining Web Transaction Patterns in an Electronic Commerce Environment

COMPSAC '00 24th International Computer Software and Applications Conference
Finding Generalized Path Patterns for Web Log Data Mining

ADBIS-DASFAA '00 Proceedings of the East-European Conference on Advances in Databases and Information Systems Held Jointly with International Conference on Database Systems for Advanced Applications: Current Issues in Databases and Information Systems
Intelligent Support for Information Retrieval in WWW Environment

ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
An Heuristic to Capture Longer User Web Navigation Patterns

EC-WEB '00 Proceedings of the First International Conference on Electronic Commerce and Web Technologies
Basket Analysis for Graph Structured Data

PAKDD '99 Proceedings of the Third Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining
Mining Web Transaction Patterns in an Electronic Commerce Environment

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
FFS - An I/O-Efficient Algorithm for Mining Frequent Sequences

PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Making Web Servers Pushier

WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Data Mining of User Navigation Patterns

WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Mining Indirect Associations in Web Data

WEBKDD '01 Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points
A Cube Model and Cluster Analysis for Web Access Sessions

WEBKDD '01 Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points
Exploiting Web Log Mining for Web Cache Enhancement

WEBKDD '01 Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points
A Framework for Efficient and Anonymous Web Usage Mining Based on Client-Side Tracking

WEBKDD '01 Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points
Temporal Pattern Mining of Moving Objects for Location-Based Service

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Efficient similarity search for market basket data

The VLDB Journal — The International Journal on Very Large Data Bases
Distributed data mining in a chain store database of short transactions

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Student modeling for a web-based learning environment: a data mining approach

Eighteenth national conference on Artificial intelligence
Web mining: creating structure out of chaos

Managing data mining technologies in organizations
Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs

World Wide Web
Mining User Moving Patterns for Personal Data Allocation in a Mobile Computing System

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Capturing User Access Patterns in the Web for Data Mining

ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Model-Based Clustering and Visualization of Navigation Patterns on a Web Site

Data Mining and Knowledge Discovery
Intelligent Web mining

Intelligent exploration of the web
A Data Mining Algorithm for Generalized Web Prefetching

IEEE Transactions on Knowledge and Data Engineering
Efficient Data Mining for Maximal Frequent Subtrees

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
An efficient method for mining associated service patterns in mobile web environments

Proceedings of the 2003 ACM symposium on Applied computing
Mining Web Informative Structures and Contents Based on Entropy Analysis

IEEE Transactions on Knowledge and Data Engineering
Golden Path Analyzer: using divide-and-conquer to cluster Web clickstreams

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient data mining for calling path patterns in GSM networks

Information Systems
Mining Frequent Labeled and Partially Labeled Graph Patterns

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
A fuzzy collaborative assessment approach for knowledge grid

Future Generation Computer Systems - Special issue: Semantic grid and knowledge grid: the next-generation web
Web Mining: Research and Practice

Computing in Science and Engineering
On mining webclick streams for path traversal patterns

Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
An Efficient Mining and Clustering Algorithm for Interactive Walk-Through Traversal Patterns

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
A Theoretical Framework and an Implementation Architecture for Self Adaptive Web Sites

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Temporal moving pattern mining for location-based service

Journal of Systems and Software
Shared Data Allocation in a Mobile Computing System: Exploring Local and Global Optimization

IEEE Transactions on Parallel and Distributed Systems
Integrating Web Caching and Web Prefetching in Client-Side Proxies

IEEE Transactions on Parallel and Distributed Systems
Popular web hot spots identification and visualization

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Web log mining with adaptive support thresholds

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Dynamic web log session identification with statistical language models

Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Sliding window filtering: an efficient method for incremental mining on a time-variant database.

Information Systems
Frequent pattern discovery with memory constraint

Proceedings of the 14th ACM international conference on Information and knowledge management
WAM-Miner: in the search of web access motifs from historical web log data

Proceedings of the 14th ACM international conference on Information and knowledge management
A framework for representing navigational patterns as full temporal objects

ACM SIGecom Exchanges
Perfect hashing schemes for mining traversal patterns

Fundamenta Informaticae
DSM-PLW: single-pass mining of path traversal patterns over streaming web click-sequences

Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
The bipartite clique: a topological paradigm for WWWeb user search customization

Proceedings of the 43rd annual Southeast regional conference - Volume 1
Mining web browsing patterns for E-commerce

Computers in Industry
Discovering Frequent Graph Patterns Using Disjoint Paths

IEEE Transactions on Knowledge and Data Engineering
Dare to share: Protecting sensitive knowledge with data sanitization

Decision Support Systems
Constraint-based sequential pattern mining: the consideration of recency and compactness

Decision Support Systems
Mining frequent tree-like patterns in large datasets

Data & Knowledge Engineering
Validation and interpretation of Web users' sessions clusters

Information Processing and Management: an International Journal
Mining Nonambiguous Temporal Patterns for Interval-Based Events

IEEE Transactions on Knowledge and Data Engineering
ServiceFinder: A method towards enhancing service portals

ACM Transactions on Information Systems (TOIS)
Weighted order-dependent clustering and visualization of web navigation patterns

Decision Support Systems
Efficient reduction of access latency through object correlations in virtual environments

EURASIP Journal on Applied Signal Processing
Decision trees for web log mining

Intelligent Data Analysis
Incremental and interactive mining of web traversal patterns

Information Sciences: an International Journal
Website usage metrics: A re-assessment of session data

Information Processing and Management: an International Journal
Web usage mining with intentional browsing data

Expert Systems with Applications: An International Journal
Automated end-user behaviour assessment tool for remote product and system testing

Expert Systems with Applications: An International Journal
Linguistic object-oriented web-usage mining

International Journal of Approximate Reasoning
An incremental data mining algorithm for discovering web access patterns

International Journal of Business Intelligence and Data Mining
Knowledge worker intranet behaviour and usability

International Journal of Business Intelligence and Data Mining
Sequence-based clustering for Web usage mining: A new experimental framework and ANN-enhanced K-means algorithm

Data & Knowledge Engineering
Discovering geographical-specific interests from web click data

Proceedings of the first international workshop on Location and the web
Efficient algorithms for incremental Web log mining with dynamic thresholds

The VLDB Journal — The International Journal on Very Large Data Bases
Mining top-k frequent patterns in the presence of the memory constraint

The VLDB Journal — The International Journal on Very Large Data Bases
WTSPMiner: Efficiently Mining Weighted Sequential Patterns from Directed Graph Traversals

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
A practical extension of web usage mining with intentional browsing data toward usage

Expert Systems with Applications: An International Journal
A change detection method for sequential patterns

Decision Support Systems
Efficient mining of interesting weighted patterns from directed graph traversals

Integrated Computer-Aided Engineering
A sliding window method for finding top-k path traversal patterns over streaming Web click-sequences

Expert Systems with Applications: An International Journal
An ontological Proxy Agent with prediction, CBR, and RBR techniques for fast query processing

Expert Systems with Applications: An International Journal
Mining top-k maximal reference sequences from streaming web click-sequences with a damped sliding window

Expert Systems with Applications: An International Journal
Multi-level association rules for MP3P marketing strategies based on extensive marketing survey data

Expert Systems with Applications: An International Journal
Adaptive Web SitesA Knowledge Extraction from Web Data Approach

Proceedings of the 2008 conference on Adaptive Web Sites: A Knowledge Extraction from Web Data Approach
On mining multi-time-interval sequential patterns

Data & Knowledge Engineering
Discovering recency, frequency, and monetary (RFM) sequential patterns from customers' purchasing data

Electronic Commerce Research and Applications
Mining sequential patterns in the B2B environment

Journal of Information Science
Towards a graph-based user profile modeling for a session-based personalized search

Knowledge and Information Systems
A survey of online failure prediction methods

ACM Computing Surveys (CSUR)
Mining Frequent Purchase Behavior Patterns for Commercial Websites

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Mining Preferred Traversal Paths with HITS

WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
Efficient mining of utility-based web path traversal patterns

ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Assessing users' product-specific knowledge for personalization in electronic commerce

Expert Systems with Applications: An International Journal
Efficient mining and prediction of user behavior patterns in mobile web systems

Information and Software Technology
Identifying web navigation behaviour and patterns automatically from clickstream data

International Journal of Web Engineering and Technology
Fast construction of generalized suffix trees over a very large alphabet

COCOON'03 Proceedings of the 9th annual international conference on Computing and combinatorics
Mining frequent episodes for relating financial events and stock trends

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Progressive weighted miner: an efficient method for time-constraint mining

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
An efficient data mining algorithm for discovering web access patterns

APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
WTPMiner: efficient mining of weighted frequent patterns based on graph traversals

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Association rule mining: models and algorithms

Association rule mining: models and algorithms
Knowledge gathering of fuzzy multi-time-interval sequential patterns

Information Sciences: an International Journal
Application of salesman-like recommendation system in 3G mobile phone online shopping decision support

Expert Systems with Applications: An International Journal
Discovering multi-label temporal patterns in sequence databases

Information Sciences: an International Journal
Association-rules-based recommender system for personalization in adaptive web-based applications

ICWE'10 Proceedings of the 10th international conference on Current trends in web engineering
Mining Web navigation patterns with a path traversal graph

Expert Systems with Applications: An International Journal
Semantically enriched event based model for web usage mining

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Load shedding for multi-way stream joins based on arrival order patterns

Journal of Intelligent Information Systems
Segmenting and labeling query sequences in a multidatabase environment

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part I
Dynamic mining for web navigation patterns based on markov model

CIS'04 Proceedings of the First international conference on Computational and Information Science
Cleopatra: evolutionary pattern-based clustering of web usage data

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
PrefixUnion: mining traversal patterns efficiently in virtual environments

ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
Mission-based navigational behaviour modeling for web recommender systems

WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Efficient approach for interactively mining web traversal patterns

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
Traversal pattern mining in web environment

WINE'05 Proceedings of the First international conference on Internet and Network Economics
Building content clusters based on modelling page pairs

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Discovering better navigation sequences for the session construction problem

Data & Knowledge Engineering
Web mining of preferred traversal patterns in fuzzy environments

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
An improvement algorithm for accessing patterns through clustering in interactive VRML environments

PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part III
A hierarchical markovian mining approach for favorite navigation patterns

SOFSEM'05 Proceedings of the 31st international conference on Theory and Practice of Computer Science
IDS false alarm reduction using continuous and discontinuous patterns

ACNS'05 Proceedings of the Third international conference on Applied Cryptography and Network Security
The research on fuzzy data mining applied on browser records

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Discovering conceptual page hierarchy of a web site from user traversal history

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
An intelligent extracting web content agent on the internet

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
A new algorithm to discover page-action rules on web

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Mining frequent tree-like patterns in large datasets

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Mining significant usage patterns from clickstream data

WebKDD'05 Proceedings of the 7th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Discovering valuable user behavior patterns in mobile commerce environments

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Perfect Hashing Schemes for Mining Traversal Patterns

Fundamenta Informaticae
Efficiently mining frequent subpaths

AusDM '09 Proceedings of the Eighth Australasian Data Mining Conference - Volume 101
Mining interesting user behavior patterns in mobile commerce environments

Applied Intelligence
Fuzzy classification in web usage mining using fuzzy quantifiers

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Weighted path as a condensed pattern in a single attributed DAG

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Recommendations of closed consensus temporal patterns by group decision making

Knowledge-Based Systems
A Sequential Patterns Data Mining Approach Towards Vehicular Route Prediction in VANETs

Mobile Networks and Applications

Quantified Score

Hi-index	0.02

Visualization

Abstract

In this paper, we explore a new data mining capability that involves mining path traversal patterns in a distributed information-providing environment where documents or objects are linked together to facilitate interactive access. Our solution procedure consists of two steps. First, we derive an algorithm to convert the original sequence of log data into a set of maximal forward references. By doing so, we can filter out the effect of some backward references, which are mainly made for ease of traveling and concentrate on mining meaningful user access sequences. Second, we derive algorithms to determine the frequent traversal patterns驴i.e., large reference sequences驴from the maximal forward references obtained. Two algorithms are devised for determining large reference sequences; one is based on some hashing and pruning techniques, and the other is further improved with the option of determining large reference sequences in batch so as to reduce the number of database scans required. Performance of these two methods is comparatively analyzed. It is shown that the option of selective scan is very advantageous and can lead to prominent performance improvement. Sensitivity analysis on various parameters is conducted.