Learning probabilistic models of link structure

Authors:
Lisa Getoor;Nir Friedman;Daphne Koller;Benjamin Taskar
Affiliations:
Computer Science Dept. and UMIACS, University of Maryland, College Park, MD;School of Computer Sci. & Eng., Hebrew University, Jerusalem, 91904, Israel;Computer Science Dept., Stanford University, Stanford, CA;Computer Science Dept., Stanford University, Stanford, CA
Venue:
The Journal of Machine Learning Research
Year:
2003

Citing 19
Cited 78

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Probabilistic Horn abduction and Bayesian networks

Artificial Intelligence
Enhanced hypertext categorization using hyperlinks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Learning to extract symbolic knowledge from the World Wide Web

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Probabilistic frame-based systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
A tutorial on learning with Bayesian networks

Learning in graphical models
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Inductive Logic Programming: Techniques and Applications

Inductive Logic Programming: Techniques and Applications
Automating the Construction of Internet Portals with Machine Learning

Information Retrieval
A Study of Approaches to Hypertext Categorization

Journal of Intelligent Information Systems
Discovering Test Set Regularities in Relational Domains

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Hypertext Categorization using Hyperlink Patterns and Meta Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Probabilistic Logic Programming and Bayesian Networks

ACSC '95 Proceedings of the 1995 Asian Computing Science Conference on Algorithms, Concurrency and Knowledge
Learning Probabilistic Relational Models

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Combining Statistical and Relational Methods for Learning in Hypertext Domains

ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Learning statistical models from relational data

Learning statistical models from relational data
Probabilistic classification and clustering in relational data

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Loopy belief propagation for approximate inference: an empirical study

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Discriminative probabilistic models for relational data

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

Link mining: a new data mining challenge

ACM SIGKDD Explorations Newsletter
Leveraging relational autocorrelation with latent group models

MRDM '05 Proceedings of the 4th international workshop on Multi-relational mining
Dirichlet enhanced relational learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning from labeled and unlabeled data on a directed graph

ICML '05 Proceedings of the 22nd international conference on Machine learning
Higher-Order Web Link Analysis Using Multilinear Algebra

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
PRL: A probabilistic relational language

Machine Learning
Link mining: a survey

ACM SIGKDD Explorations Newsletter
Sampling algorithms for pure network topologies: a study on the stability and the separability of metric embeddings

ACM SIGKDD Explorations Newsletter
The case for anomalous link discovery

ACM SIGKDD Explorations Newsletter
Linear prediction models with graph regularization for web-page categorization

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent linkage semantic kernels for collective classification of link data

Journal of Intelligent Information Systems
Mining constraint violations

ACM Transactions on Database Systems (TODS)
Advances in optimization and prediction techniques: Real-world applications: Thesis

AI Communications
A dynamic ontology for a dynamic reference work

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Impact of social influence in e-commerce decision making

Proceedings of the ninth international conference on Electronic commerce
Applying link-based classification to label blogs

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Automated social hierarchy detection through email network analysis

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
ViSAGE: A Virtual Laboratory for Simulation and Analysis of Social Group Evolution

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Structured entity identification and document categorization: two tasks with one joint model

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Using ghost edges for classification in sparsely labeled networks

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Cost-sensitive learning with conditional Markov networks

Data Mining and Knowledge Discovery
Structured machine learning: the next ten years

Machine Learning
Structure Learning of Probabilistic Relational Models from Incomplete Relational Data

ECML '07 Proceedings of the 18th European conference on Machine Learning
Exploiting shared correlations in probabilistic databases

Proceedings of the VLDB Endowment
Classifying networked entities with modularity kernels

Proceedings of the 17th ACM conference on Information and knowledge management
A methodology for in-network evaluation of integrated logical-statistical models

Proceedings of the 6th ACM conference on Embedded network sensor systems
Towards Machine Learning on the Semantic Web

Uncertainty Reasoning for the Semantic Web I
Segmentation and Automated Social Hierarchy Detection through Email Network Analysis

Advances in Web Mining and Web Usage Analysis
Applying Link-Based Classification to Label Blogs

Advances in Web Mining and Web Usage Analysis
The Time-Series Link Prediction Problem with Applications in Communication Surveillance

INFORMS Journal on Computing
Combining link and content for community detection: a discriminative approach

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
An Inductive Logic Programming Approach to Statistical Relational Learning

Proceedings of the 2005 conference on An Inductive Logic Programming Approach to Statistical Relational Learning
Ranking community answers by modeling question-answer relationships via analogical reasoning

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Learning systems of concepts with an infinite relational model

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Generative Modeling by PRISM

ICLP '09 Proceedings of the 25th International Conference on Logic Programming
Get out the vote: determining support or opposition from congressional floor-debate transcripts

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Identifying graphs from noisy and incomplete data

Proceedings of the 1st ACM SIGKDD Workshop on Knowledge Discovery from Uncertain Data
Transforming between propositions and features: bridging the gap

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Logical generative models for probabilistic reasoning about existence, roles and identity

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Data clustering with a relational push-pull model

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Reflect and correct: A misclassification prediction approach to active inference

ACM Transactions on Knowledge Discovery from Data (TKDD)
Probabilistic Relational Models with Relational Uncertainty: An Early Study in Web Page Classification

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Learning coordination classifiers

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
PrDB: managing and exploiting rich correlations in probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Learning Link-Based Naïve Bayes Classifiers from Ontology-Extended Distributed Data

OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part II
A Survey of Statistical Network Models

Foundations and Trends® in Machine Learning
Modeling parametric web arc weight measurement

ICCSA'07 Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III
Probabilistic inductive logic programming

Probabilistic inductive logic programming
Querying graphs with uncertain predicates

Proceedings of the Eighth Workshop on Mining and Learning with Graphs
Cold start link prediction

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Web page classification: a probabilistic model with relational uncertainty

IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
Identifying graphs from noisy and incomplete data

ACM SIGKDD Explorations Newsletter
Leveraging label-independent features for classification in sparsely labeled networks: an empirical study

SNAKDD'08 Proceedings of the Second international conference on Advances in social network mining and analysis
Learning algorithms for link prediction based on chance constraints

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Label-dependent feature extraction in social networks for node classification

SocInfo'10 Proceedings of the Second international conference on Social informatics
Constructing the Bayesian network structure from dependencies implied in multiple relational schemas

Expert Systems with Applications: An International Journal
A method of label-dependent feature extraction in social networks

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part II
Social Network Analysis and Mining for Business Applications

ACM Transactions on Intelligent Systems and Technology (TIST)
Costco: robust content and structure constrained clustering of networked documents

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Identity matching using personal and social identity features

Information Systems Frontiers
Learning statistical models from relational data

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Simultaneous clustering: a survey

PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
Inferring Networks of Diffusion and Influence

ACM Transactions on Knowledge Discovery from Data (TKDD)
Clustering scientific literature using sparse citation graph analysis

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Fisher kernels for relational data

ECML'06 Proceedings of the 17th European conference on Machine Learning
Combining contents and citations for scientific document classification

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Logical bayesian networks and their relation to other probabilistic logical models

ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
On the combination of logical and probabilistic models for information analysis

Applied Intelligence
Friendship prediction and homophily in social media

ACM Transactions on the Web (TWEB)
Microgroup mining on TSina via network structure and user attribute

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Lifted online training of relational models with stochastic gradient methods

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Validation of network classifiers

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Enhanced spatiotemporal relational probability trees and forests

Data Mining and Knowledge Discovery
Transforming graph data for statistical relational learning

Journal of Artificial Intelligence Research
Estimating domain-based user influence in social networks

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Scalable text and link analysis with mixed-topic link models

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining collective intelligence in diverse groups

Proceedings of the 22nd international conference on World Wide Web
KnowRob: A knowledge processing infrastructure for cognition-enabled robots

International Journal of Robotics Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most real-world data is heterogeneous and richly interconnected. Examples include the Web, hypertext, bibliometric data and social networks. In contrast, most statistical learning methods work with "flat" data representations, forcing us to convert our data into a form that loses much of the link structure. The recently introduced framework of probabilistic relational models (PRMs) embraces the object-relational nature of structured data by capturing probabilistic interactions between attributes of related entities. In this paper, we extend this framework by modeling interactions between the attributes and the link structure itself. An advantage of our approach is a unified generative model for both content and relational structure. We propose two mechanisms for representing a probabilistic distribution over link structures: reference uncertainty and existence uncertainty. We describe the appropriate conditions for using each model and present learning algorithms for each. We present experimental results showing that the learned models can be used to predict link structure and, moreover, the observed link structure can be used to provide better predictions for the attributes in the model.