Algorithms for clustering data
Algorithms for clustering data
Vector quantization and signal compression
Vector quantization and signal compression
Elements of information theory
Elements of information theory
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
Parallel Optimization: Theory, Algorithms and Applications
Parallel Optimization: Theory, Algorithms and Applications
Relative Expected Instantaneous Loss Bounds
COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Feature Weighting in k-Means Clustering
Machine Learning
Journal of Logic, Language and Information
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions
The Journal of Machine Learning Research
A divisive information theoretic feature clustering algorithm for text classification
The Journal of Machine Learning Research
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
An information theoretic analysis of maximum likelihood mixture estimation for exponential families
ICML '04 Proceedings of the twenty-first international conference on Machine learning
IEEE Transactions on Information Theory
IEEE Transactions on Information Theory
On the optimality of conditional expectation as a Bregman predictor
IEEE Transactions on Information Theory
Model-based overlapping clustering
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Streaming and sublinear approximation of entropy and information distances
SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
On approximating the smallest enclosing Bregman Balls
Proceedings of the twenty-second annual symposium on Computational geometry
ICML '06 Proceedings of the 23rd international conference on Machine learning
Unsupervised learning on k-partite graphs
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
IEEE Transactions on Pattern Analysis and Machine Intelligence
Creating probabilistic databases from information extraction models
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Inference and evaluation of the multinomial mixture model for text clustering
Information Processing and Management: an International Journal
Visualizing bregman voronoi diagrams
SCG '07 Proceedings of the twenty-third annual symposium on Computational geometry
A Unified Continuous Optimization Framework for Center-Based Clustering Methods
The Journal of Machine Learning Research
Predictive discrete latent factor models for large scale dyadic data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A probabilistic framework for relational clustering
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
On the smallest enclosing information disk
Information Processing Letters
Clustering for metric and non-metric distance measures
Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
On some entropy functionals derived from Rényi information divergence
Information Sciences: an International Journal
Bregman bubble clustering: A robust framework for mining dense clusters
ACM Transactions on Knowledge Discovery from Data (TKDD)
Fast nearest neighbor retrieval for bregman divergences
Proceedings of the 25th international conference on Machine learning
A reproducing kernel Hilbert space framework for pairwise time series distances
Proceedings of the 25th international conference on Machine learning
Proceedings of the 25th international conference on Machine learning
A decoupled approach to exemplar-based unsupervised learning
Proceedings of the 25th international conference on Machine learning
Relational learning via collective matrix factorization
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
SAIL: summation-based incremental learning for information-theoretic clustering
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Mixed Bregman Clustering with Approximation Guarantees
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A Unified View of Matrix Factorization Models
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Hierarchical clustering-based navigation of image search results
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Bregman Divergences and the Self Organising Map
IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
Clustering with Feature Order Preferences
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Coresets and approximate clustering for Bregman divergences
SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
A new method for hierarchical clustering combination
Intelligent Data Analysis
Clustering Multivariate Normal Distributions
Emerging Trends in Visual Computing
Intrinsic Geometries in Learning
Emerging Trends in Visual Computing
On vector averaging over the unit hypersphere
Digital Signal Processing
Anomaly detection using manifold embedding and its applications in transportation corridors
Intelligent Data Analysis - Knowledge Discovery from Data Streams
Surrogate regret bounds for proper losses
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Bayesian inference for nonnegative matrix factorisation models
Computational Intelligence and Neuroscience
Technical opinion: Steering self-learning distance algorithms
Communications of the ACM - Scratch Programming for All
Sublinear estimation of entropy and information distances
ACM Transactions on Algorithms (TALG)
Cost-sensitive learning based on Bregman divergences
Machine Learning
Bregman divergences in the (m×k)-partitioning problem
Computational Statistics & Data Analysis
Adaptive fuzzy filtering in a deterministic setting
IEEE Transactions on Fuzzy Systems
The global kernel k-means algorithm for clustering in feature space
IEEE Transactions on Neural Networks
High-dimensional statistical measure for region-of-interest tracking
IEEE Transactions on Image Processing
On-line evolutionary exponential family mixture
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Sided and symmetrized Bregman centroids
IEEE Transactions on Information Theory
Similarity search on Bregman divergence: towards non-metric indexing
Proceedings of the VLDB Endowment
Convex Mixture Models for Multi-view Clustering
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II
Worst-Case and Smoothed Analysis of k-Means Clustering with Bregman Divergences
ISAAC '09 Proceedings of the 20th International Symposium on Algorithms and Computation
Bregman vantage point trees for efficient nearest neighbor queries
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
α-divergence is unique, belonging to both f-divergence and Bregman divergence classes
IEEE Transactions on Information Theory
Variational Bayesian mixture model on a subspace of exponential family distributions
IEEE Transactions on Neural Networks
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
Kullback Leibler divergence based curve matching method
SSVM'07 Proceedings of the 1st international conference on Scale space and variational methods in computer vision
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Sparse signal recovery with exponential-family noise
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Klee sets and Chebyshev centers for the right Bregman distance
Journal of Approximation Theory
Approximation algorithms for tensor clustering
ALT'09 Proceedings of the 20th international conference on Algorithmic learning theory
Simplifying mixture models through function approximation
IEEE Transactions on Neural Networks
Clustering for metric and nonmetric distance measures
ACM Transactions on Algorithms (TALG)
Unifying dependent clustering and disparate clustering for non-homogeneous data
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
On community outliers and their efficient detection in information networks
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Quantization and clustering with Bregman divergences
Journal of Multivariate Analysis
SCOAL: A framework for simultaneous co-clustering and learning from complex data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Clustering with feature order preferences
Intelligent Data Analysis - Artificial Intelligence
Tensor sparse coding for region covariances
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Divergence based online learning in vector quantization
ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Probabilistic latent tensor factorization
LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
Extending metric multidimensional scaling with Bregman divergences
Pattern Recognition
Multiple view clustering using a weighted combination of exemplar-based mixture models
IEEE Transactions on Neural Networks
Extending metric multidimensional scaling with bregman divergences
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
Independent component analysis using bregman divergences
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
The Journal of Machine Learning Research
Global Minimization for Continuous Multiphase Partitioning Problems Using a Dual Approach
International Journal of Computer Vision
Mixed-membership naive Bayes models
Data Mining and Knowledge Discovery
Neurocomputing
Divergence-based vector quantization
Neural Computation
Optimality and stability of the K-hyperline clustering algorithm
Pattern Recognition Letters
Schema-as-you-go: on probabilistic tagging and querying of wide tables
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Transfer latent variable model based on divergence analysis
Pattern Recognition
Information, Divergence and Risk for Binary Experiments
The Journal of Machine Learning Research
Proceedings of the 2011 workshop on Data mining for medicine and healthcare
Smoothed Analysis of the k-Means Method
Journal of the ACM (JACM)
C3E: a framework for combining ensembles of classifiers and clusterers
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Pattern change discovery between high dimensional data sets
Proceedings of the 20th ACM international conference on Information and knowledge management
Semi-Supervised Learning with Measure Propagation
The Journal of Machine Learning Research
Extending Sammon mapping with Bregman divergences
Information Sciences: an International Journal
Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Compressed Histogram of Gradients: A Low-Bitrate Descriptor
International Journal of Computer Vision
The mathematics of divergence based online learning in vector quantization
ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Bregman clustering for separable instances
SWAT'10 Proceedings of the 12th Scandinavian conference on Algorithm Theory
Levels of details for gaussian mixture models
ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part II
Graph based k-means clustering
Signal Processing
Language modelling of constraints for text clustering
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Approximate bregman near neighbors in sublinear time: beyond the triangle inequality
Proceedings of the twenty-eighth annual symposium on Computational geometry
A geometric view of conjugate priors
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Mining temporal patterns in popularity of web items
Information Sciences: an International Journal
Objective function-based clustering
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Fast bregman divergence NMF using taylor expansion and coordinate descent
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Algorithmic superactivation of asymptotic quantum capacity of zero-capacity quantum channels
Information Sciences: an International Journal
Geometry preserving multi-task metric learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
A new distance for probability measures based on the estimation of level sets
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Identifying anomalous social contexts from mobile proximity data using binomial mixture models
IDA'12 Proceedings of the 11th international conference on Advances in Intelligent Data Analysis
Query clustering based on bid landscape for sponsored search auction optimization
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
A predictive model for advertiser value-per-click in sponsored search
Proceedings of the 22nd international conference on World Wide Web
Localized matrix factorization for recommendation based on matrix block diagonal forms
Proceedings of the 22nd international conference on World Wide Web
A Bregman extension of quasi-Newton updates II: Analysis of robustness properties
Journal of Computational and Applied Mathematics
ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012
Retargeted matrix factorization for collaborative filtering
Proceedings of the 7th ACM conference on Recommender systems
How to "alternatize" a clustering algorithm
Data Mining and Knowledge Discovery
Geometry preserving multi-task metric learning
Machine Learning
CUDIA: Probabilistic cross-level imputation using individual auxiliary information
ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Coupling Image Restoration and Segmentation: A Generalized Linear Model/Bregman Perspective
International Journal of Computer Vision
Tappan Zee (north) bridge: mining memory accesses for introspection
Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
Pattern learning and recognition on statistical manifolds: an information-geometric review
SIMBAD'13 Proceedings of the Second international conference on Similarity-Based Pattern Recognition
Information-Theoretic dissimilarities for graphs
SIMBAD'13 Proceedings of the Second international conference on Similarity-Based Pattern Recognition
Fast learning of gamma mixture models with k-MLE
SIMBAD'13 Proceedings of the Second international conference on Similarity-Based Pattern Recognition
An efficient and scalable family of algorithms for combining clusterings
Engineering Applications of Artificial Intelligence
Hartigan's K-means versus Lloyd's K-means: is it time for a change?
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Weighted ensemble of algorithms for complex data clustering
Pattern Recognition Letters
Bayesian model diagnostics using functional Bregman divergence
Journal of Multivariate Analysis
Poisson Noise Reduction with Non-local PCA
Journal of Mathematical Imaging and Vision
Hi-index | 0.12 |
A wide variety of distortion functions, such as squared Euclidean distance, Mahalanobis distance, Itakura-Saito distance and relative entropy, have been used for clustering. In this paper, we propose and analyze parametric hard and soft clustering algorithms based on a large class of distortion functions known as Bregman divergences. The proposed algorithms unify centroid-based parametric clustering approaches, such as classical kmeans, the Linde-Buzo-Gray (LBG) algorithm and information-theoretic clustering, which arise by special choices of the Bregman divergence. The algorithms maintain the simplicity and scalability of the classical kmeans algorithm, while generalizing the method to a large class of clustering loss functions. This is achieved by first posing the hard clustering problem in terms of minimizing the loss in Bregman information, a quantity motivated by rate distortion theory, and then deriving an iterative algorithm that monotonically decreases this loss. In addition, we show that there is a bijection between regular exponential families and a large class of Bregman divergences, that we call regular Bregman divergences. This result enables the development of an alternative interpretation of an efficient EM scheme for learning mixtures of exponential family distributions, and leads to a simple soft clustering algorithm for regular Bregman divergences. Finally, we discuss the connection between rate distortion theory and Bregman clustering and present an information theoretic analysis of Bregman clustering algorithms in terms of a trade-off between compression and loss in Bregman information.