Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Document clustering using word clusters via the information bottleneck method
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Unsupervised document classification using sequential information maximization
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Biclustering of Expression Data
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Multivariate Information Bottleneck
UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Distributional word clusters vs. words for text categorization
The Journal of Machine Learning Research
A divisive information theoretic feature clustering algorithm for text classification
The Journal of Machine Learning Research
Information-theoretic co-clustering
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
A generalized maximum entropy approach to bregman co-clustering and matrix approximation
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Sequential information bottleneck for finite data
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Testing the significance of attribute interactions
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Biclustering Algorithms for Biological Data Analysis: A Survey
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Applying discrete PCA in data analysis
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Group and topic discovery from relations and text
Proceedings of the 3rd international workshop on Link discovery
Contextual search and name disambiguation in email using graphs
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Latent semantic analysis for multiple-type interrelated data objects
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
LinkClus: efficient clustering via heterogeneous semantic links
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Multi-task text segmentation and alignment based on weighted mutual information
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Topic segmentation with shared topic detection and alignment of multiple documents
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Shine: search heterogeneous interrelated entities
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Distributional Similarity Model for Multi-modality Clustering in Social Media
WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
Topic modeling with network regularization
Proceedings of the 17th international conference on World Wide Web
Hierarchical, Parameter-Free Community Discovery
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Data weaving: scaling up the state-of-the-art in data clustering
Proceedings of the 17th ACM conference on Information and knowledge management
Improving clustering stability with combinatorial MRFs
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
MetaFac: community discovery via relational hypergraph factorization
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Ranking-based clustering of heterogeneous information networks with star network schema
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Multiview clustering: a late fusion approach using latent models
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
TITPI: web people search task using semi-supervised clustering approach
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Interactive clustering of text collections according to a user-specified criterion
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
Joint group and topic discovery from relations and text
ICML'06 Proceedings of the 2006 conference on Statistical network analysis
The multi-view information bottleneck clustering
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Approximation algorithms for tensor clustering
ALT'09 Proceedings of the 20th international conference on Algorithmic learning theory
ACM Transactions on Information Systems (TOIS)
Graph regularized transductive classification on heterogeneous information networks
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Community Discovery via Metagraph Factorization
ACM Transactions on Knowledge Discovery from Data (TKDD)
A game theoretic framework for heterogenous information network clustering
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering multirelational structure in social media streams
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Information marginalization on subgraphs
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Combinatorial markov random fields
ECML'06 Proceedings of the 17th European conference on Machine Learning
An information-theoretic framework for high-order co-clustering of heterogeneous objects
ECML'06 Proceedings of the 17th European conference on Machine Learning
Incremental clustering of newsgroup articles
IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
Tripartite community structure in social bookmarking data
The New Review of Hypermedia and Multimedia - Special issue on Social Linking and Hypermedia
Hi-index | 0.00 |
We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions between the types, as observed in co-occurrence data. In this scheme, multiple clustering systems are generated aiming at maximizing an objective function that measures multiple pairwise mutual information between cluster variables. To implement this idea, we propose an algorithm that interleaves top-down clustering of some variables and bottom-up clustering of the other variables, with a local optimization correction routine. Focusing on document clustering we present an extensive empirical study of two-way, three-way and four-way applications of our scheme using six real-world datasets including the 20 News-groups (20NG) and the Enron email collection. Our multi-way distributional clustering (MDC) algorithms consistently and significantly outperform previous state-of-the-art information theoretic clustering algorithms.