Algorithms for clustering data
Algorithms for clustering data
Latent semantic indexing is an optimal special case of multidimensional scaling
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Matrix computations (3rd ed.)
Latent semantic indexing: a probabilistic analysis
PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A similarity-based probability model for latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document clustering with cluster refinement and model selection capabilities
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A Min-max Cut Algorithm for Graph Partitioning and Data Clustering
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Segmentation Using Eigenvectors: A Unifying View
ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
A Fast Algorithm for Finding k-Nearest Neighbors with Non-Metric Dissimilarity
IWFHR '02 Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR'02)
Document clustering based on non-negative matrix factorization
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Locality preserving indexing for document representation
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Document clustering by concept factorization
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Document clustering via adaptive subspace iteration
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Restrictive clustering and metaclustering for self-organizing document collections
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Matching Theory (North-Holland mathematics studies)
Matching Theory (North-Holland mathematics studies)
A Branch and Bound Algorithm for Computing k-Nearest Neighbors
IEEE Transactions on Computers
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data
IEEE Transactions on Knowledge and Data Engineering
Regularized locality preserving indexing via spectral regression
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Locality sensitive semi-supervised feature selection
Neurocomputing
Classification of multivariate time series using locality preserving projections
Knowledge-Based Systems
Modeling hidden topics on document manifold
Proceedings of the 17th ACM conference on Information and knowledge management
An active learning framework for semi-supervised document clustering with language modeling
Data & Knowledge Engineering
AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Approximate Spectral Clustering
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Enhanced bisecting k-means clustering using intermediate cooperation
Pattern Recognition
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Locality preserving nonnegative matrix factorization
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Laplacian Linear Discriminant Analysis Approach to Unsupervised Feature Selection
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A Novel Path-Based Clustering Algorithm Using Multi-dimensional Scaling
AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
2D-LPI: Two-Dimensional Locality Preserving Indexing
PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Constrained Laplacian Eigenmap for dimensionality reduction
Neurocomputing
Pattern Recognition
FLPI: an optimal algorithm for document indexing
RSKT'08 Proceedings of the 3rd international conference on Rough sets and knowledge technology
Approximately harmonic projection: Theoretical analysis and an algorithm
Pattern Recognition
Efficient face recognition using tensor subspace regression
Neurocomputing
LPP solution schemes for use with face recognition
Pattern Recognition
Approximate pairwise clustering for large data sets via sampling plus extension
Pattern Recognition
A comparative study of TF*IDF, LSI and multi-words for text classification
Expert Systems with Applications: An International Journal
Using correlation dimension for analysing text data
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
Comparing fuzzy, probabilistic, and possibilistic partitions
IEEE Transactions on Fuzzy Systems
Document clustering using synthetic cluster prototypes
Data & Knowledge Engineering
Which clustering do you want? inducing your ideal clustering with minimal feedback
Journal of Artificial Intelligence Research
Discriminative concept factorization for data representation
Neurocomputing
Image analysis with nonlinear adaptive dimension reduction
Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Coupled nominal similarity in unsupervised learning
Proceedings of the 20th ACM international conference on Information and knowledge management
Distributed knowledge discovery with non linear dimensionality reduction
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Image clustering via sparse representation
MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Place semantics into context: service community discovery from the WSDL corpus
ICSOC'11 Proceedings of the 9th international conference on Service-Oriented Computing
An efficient framework for constructing generalized locally-induced text metrics
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
On trivial solution and scale transfer problems in graph regularized NMF
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Positive unlabeled learning for time series classification
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Clustering high dimensional data
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Constraint projections for semi-supervised affinity propagation
Knowledge-Based Systems
Relational co-clustering via manifold ensemble learning
Proceedings of the 21st ACM international conference on Information and knowledge management
Accelerating locality preserving nonnegative matrix factorization
Proceedings of the 21st ACM international conference on Information and knowledge management
Personal and Ubiquitous Computing
Sparse functional representation for large-scale service clustering
ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
DLPR: a distributed locality preserving dimension reduction algorithm
IDCS'12 Proceedings of the 5th international conference on Internet and Distributed Computing Systems
p-PIC: Parallel power iteration clustering for big data
Journal of Parallel and Distributed Computing
Equivalence Between LDA/QR and Direct LDA
International Journal of Cognitive Informatics and Natural Intelligence
Manifold based sparse representation for facial understanding in natural images
Image and Vision Computing
Pattern Recognition Letters
G-Optimal Feature Selection with Laplacian regularization
Neurocomputing
Robust tensor clustering with non-greedy maximization
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Coupled attribute analysis on numerical data
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Locality mutual clustering for document retrieval
Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Hi-index | 0.00 |
We propose a novel document clustering method which aims to cluster the documents into different semantic classes. The document space is generally of high dimensionality and clustering in such a high dimensional space is often infeasible due to the curse of dimensionality. By using Locality Preserving Indexing (LPI), the documents can be projected into a lower-dimensional semantic space in which the documents related to the same semantics are close to each other. Different from previous document clustering methods based on Latent Semantic Indexing (LSI) or Nonnegative Matrix Factorization (NMF), our method tries to discover both the geometric and discriminating structures of the document space. Theoretical analysis of our method shows that LPI is an unsupervised approximation of the supervised Linear Discriminant Analysis (LDA) method, which gives the intuitive motivation of our method. Extensive experimental evaluations are performed on the Reuters-21578 and TDT2 data sets.