Leveraging relational autocorrelation with latent group models

Authors:
Jennifer Neville;David Jensen
Affiliations:
University of Massachusetts Amherst, Amherst, MA;University of Massachusetts Amherst, Amherst, MA
Venue:
MRDM '05 Proceedings of the 4th international workshop on Multi-relational mining
Year:
2005

Citing 16
Cited 8

Enhanced hypertext categorization using hyperlinks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Learning to extract symbolic knowledge from the World Wide Web

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning probabilistic relational models

Relational Data Mining
Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
A Machine Learning Approach to Building Domain-Specific Search Engines

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Stochastic link and group detection

Eighteenth national conference on Artificial intelligence
Latent dirichlet allocation

The Journal of Machine Learning Research
Learning probabilistic models of link structure

The Journal of Machine Learning Research
Simple Estimators for Relational Bayesian Classifiers

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Learning relational probability trees

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Why collective inference improves relational classification

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Cluster-based concept invention for statistical relational learning

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Dependency Networks for Relational Data

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Probabilistic classification and clustering in relational data

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Discriminative probabilistic models for relational data

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

Distribution-based aggregation for relational learning with identifier attributes

Machine Learning
Relational learning via latent social dimensions

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Scalable learning of collective behavior based on sparse social dimensions

Proceedings of the 18th ACM conference on Information and knowledge management
Social network mining with nonparametric relational models

SNAKDD'08 Proceedings of the Second international conference on Advances in social network mining and analysis
A multi-resolution approach to learning with overlapping communities

Proceedings of the First Workshop on Social Media Analytics
Leveraging social media networks for classification

Data Mining and Knowledge Discovery
A tag-centric discriminative model for web objects classification

Proceedings of the 21st ACM international conference on Information and knowledge management
Competence region modelling in relational classification

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

The presence of autocorrelation provides strong motivation for using relational techniques for learning and inference. Autocorrelation is a statistical dependency between the values of the same variable on related entities and is a nearly ubiquitous characteristic of relational data sets. Recent research has explored the use of collective inference techniques to exploit this phenomenon. These techniques achieve significant performance gains by modeling observed correlations among class labels of related instances, but the models fail to capture a frequent cause of autocorrelation---the presence of underlying groups that influence the attributes on a set of entities. We propose a latent group model (LGM) for relational data, which discovers and exploits the hidden structures responsible for the observed autocorrelation among class labels. Modeling the latent group structure improves model performance, increases inference efficiency, and enhances our understanding of the datasets. We evaluate performance on three relational classification tasks and show that LGM outperforms models that ignore latent group structure, particularly when there is little information with which to seed inference.