Incremental tensor analysis: Theory and applications

Authors:
Jimeng Sun;Dacheng Tao;Spiros Papadimitriou;Philip S. Yu;Christos Faloutsos
Affiliations:
IBM TJ Watson Research Center, Yorktown Heights, NY;Nanyang Technological University, Singapore;IBM TJ Watson Research Center, Yorktown Heights, NY;University of Illinois at Chicago, Chicago, IL;Carnegie Mellon University, Pittsburgh, PA
Venue:
ACM Transactions on Knowledge Discovery from Data (TKDD)
Year:
2008

Citing 35
Cited 27

Adaptive filter theory (2nd ed.)

Adaptive filter theory (2nd ed.)
Personalized information delivery: an analysis of information filtering methods

Communications of the ACM - Special issue on information filtering
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Latent semantic indexing: a probabilistic analysis

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Parallel Multilevel series k-Way Partitioning Scheme for Irregular Graphs

SIAM Review
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
On the Best Rank-1 and Rank-(R1,R2,. . .,RN) Approximation of Higher-Order Tensors

SIAM Journal on Matrix Analysis and Applications
The ubiquitous Kronecker product

Journal of Computational and Applied Mathematics - Special issue on numerical analysis 2000 Vol. III: linear algebra
Mining the network value of customers

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Mining data streams under block evolution

ACM SIGKDD Explorations Newsletter
Multilinear Analysis of Image Ensembles: TensorFaces

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Identifying Representative Trends in Massive Time Series Data Sets Using Sketches

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Quantifiable data mining using ratio rules

The VLDB Journal — The International Journal on Very Large Data Bases
Clustering Data Streams: Theory and Practice

IEEE Transactions on Knowledge and Data Engineering
On clusterings-good, bad and spectral

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Online Data Mining for Co-Evolving Time Sequences

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Correlating synchronous and asynchronous data streams

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
BRAID: stream mining through group lag correlations

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Concurrent Subspaces Analysis

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Streaming pattern discovery in multiple time-series

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Higher-Order Web Link Analysis Using Multilinear Algebra

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Generalized Low Rank Approximations of Matrices

Machine Learning
Tensor-CUR decompositions for tensor-based data

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Beyond streams and graphs: dynamic tensor analysis

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Two-Dimensional Linear Discriminant Analysis of Principle Component Vectors for Face Recognition

IEICE - Transactions on Information and Systems
Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
StatStream: statistical monitoring of thousands of data streams in real time

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
A framework for clustering evolving data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Adaptive, hands-off stream mining

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Tensor Decompositions and Applications

SIAM Review

Dynamic Exponential Family Matrix Factorization

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
TDM modeling and evaluation of different domain transforms for LSI

Neurocomputing
Multi-view face recognition based on tensor subspace analysis and view manifold modeling

Neurocomputing
Multi-video synopsis for video representation

Signal Processing
Binary sparse nonnegative matrix factorization

IEEE Transactions on Circuits and Systems for Video Technology
A New Incremental PCA Algorithm With Application to Visual Learning and Recognition

Neural Processing Letters
Uncorrelated multilinear principal component analysis for unsupervised multilinear subspace learning

IEEE Transactions on Neural Networks
Distance approximating dimension reduction of Riemannian manifolds

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Discriminative orthogonal neighborhood-preserving projections for classification

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Incremental tensor biased discriminant analysis: A new color-based visual tracking method

Neurocomputing
Photo-sketch synthesis and recognition based on subspace learning

Neurocomputing
A fast recognition framework based on extreme learning machine using hybrid object information

Neurocomputing
A unified tensor level set for image segmentation

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
Incremental embedding and learning in the local discriminant subspace with application to face recognition

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Incremental Tensor Subspace Learning and Its Applications to Foreground Segmentation and Tracking

International Journal of Computer Vision
A survey of multilinear subspace learning for tensor data

Pattern Recognition
CIM: categorical influence maximization

Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Extended HALS algorithm for nonnegative Tucker decomposition and its applications for multiway analysis and classification

Neurocomputing
XML documents clustering using a tensor space model

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
MultiRank: co-ranking for objects and relations in multi-relational data

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Extract minimum positive and maximum negative features for imbalanced binary classification

Pattern Recognition
Fast metadata-driven multiresolution tensor decomposition

Proceedings of the 20th ACM international conference on Information and knowledge management
Incremental threshold learning for classifier selection

Neurocomputing
Utilizing common substructures to speedup tensor factorization for mining dynamic graphs

Proceedings of the 21st ACM international conference on Information and knowledge management
MultiAspectForensics: mining large heterogeneous networks using tensor

International Journal of Web Engineering and Technology
Biview face recognition in the shape-texture domain

Pattern Recognition
From the idea of "sparse representation" to a representation-based transformation method for feature extraction

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

How do we find patterns in author-keyword associations, evolving over time? Or in data cubes (tensors), with product-branchcustomer sales information? And more generally, how to summarize high-order data cubes (tensors)? How to incrementally update these patterns over time? Matrix decompositions, like principal component analysis (PCA) and variants, are invaluable tools for mining, dimensionality reduction, feature selection, rule identification in numerous settings like streaming data, text, graphs, social networks, and many more settings. However, they have only two orders (i.e., matrices, like author and keyword in the previous example). We propose to envision such higher-order data as tensors, and tap the vast literature on the topic. However, these methods do not necessarily scale up, let alone operate on semi-infinite streams. Thus, we introduce a general framework, incremental tensor analysis (ITA), which efficiently computes a compact summary for high-order and high-dimensional data, and also reveals the hidden correlations. Three variants of ITA are presented: (1) dynamic tensor analysis (DTA); (2) streaming tensor analysis (STA); and (3) window-based tensor analysis (WTA). In paricular, we explore several fundamental design trade-offs such as space efficiency, computational cost, approximation accuracy, time dependency, and model complexity. We implement all our methods and apply them in several real settings, such as network anomaly detection, multiway latent semantic indexing on citation networks, and correlation study on sensor measurements. Our empirical studies show that the proposed methods are fast and accurate and that they find interesting patterns and outliers on the real datasets.