Machine Learning
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Automating the Construction of Internet Portals with Machine Learning
Information Retrieval
A Study of Approaches to Hypertext Categorization
Journal of Intelligent Information Systems
Composite Kernels for Hypertext Categorisation
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Learning to Probabilistically Identify Authoritative Documents
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Document clustering based on non-negative matrix factorization
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
FOCS '01 Proceedings of the 42nd IEEE symposium on Foundations of Computer Science
Multi-label informed latent semantic indexing
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Learning from labeled and unlabeled data on a directed graph
ICML '05 Proceedings of the 22nd international conference on Machine learning
Linear prediction models with graph regularization for web-page categorization
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Discriminative probabilistic models for relational data
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Web document clustering using hyperlink structures
Computational Statistics & Data Analysis
Learning multiple graphs for document recommendations
Proceedings of the 17th international conference on World Wide Web
Classifiers without borders: incorporating fielded text from neighboring web pages
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Relational learning via collective matrix factorization
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A Unified View of Matrix Factorization Models
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Classifying networked entities with modularity kernels
Proceedings of the 17th ACM conference on Information and knowledge management
Learning latent semantic relations from clickthrough data for query suggestion
Proceedings of the 17th ACM conference on Information and knowledge management
SoRec: social recommendation using probabilistic matrix factorization
Proceedings of the 17th ACM conference on Information and knowledge management
Probabilistic polyadic factorization and its application to personalized recommendation
Proceedings of the 17th ACM conference on Information and knowledge management
Summarization of social activity over time: people, actions and concepts in dynamic networks
Proceedings of the 17th ACM conference on Information and knowledge management
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
Effective latent space graph-based re-ranking model with global consistency
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Extracting community structure through relational hypergraphs
Proceedings of the 18th international conference on World wide web
Using Link-Based Content Analysis to Measure Document Similarity Effectively
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Heterogeneous source consensus learning via decision propagation and negotiation
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
MetaFac: community discovery via relational hypergraph factorization
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Combining link and content for community detection: a discriminative approach
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
iOLAP: A framework for analyzing the internet, social networks, and other networked data
IEEE Transactions on Multimedia - Special section on communities and media computing
Relation regularized matrix factorization
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Weighted Nonnegative Matrix Co-Tri-Factorization for Collaborative Prediction
ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multi-modality in one-class classification
Proceedings of the 19th international conference on World wide web
A Bayesian framework for community detection integrating content and link
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Predicting labels for dyadic data
Data Mining and Knowledge Discovery
Mining mood-specific movie similarity with matrix factorization for context-aware recommendation
Proceedings of the Workshop on Context-Aware Movie Recommendation
Multi-modal multi-correlation person-centric news retrieval
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Adaptive combination of tag and link-based user similarity in flickr
Proceedings of the international conference on Multimedia
Directed graph learning via high-order co-linkage analysis
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Community Discovery via Metagraph Factorization
ACM Transactions on Knowledge Discovery from Data (TKDD)
Probabilistic matrix factorization leveraging contexts for unsupervised relation extraction
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Combining file content and file relations for cloud based malware detection
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-view transfer learning with a large margin approach
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Link prediction via matrix factorization
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Bayesian matrix co-factorization: variational algorithm and Cramér-Rao bound
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Temporal link prediction by integrating content and structure information
Proceedings of the 20th ACM international conference on Information and knowledge management
Towards feature selection in network
Proceedings of the 20th ACM international conference on Information and knowledge management
Discovering multirelational structure in social media streams
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
On clustering heterogeneous social media objects with outlier links
Proceedings of the fifth ACM international conference on Web search and data mining
Matrix co-factorization on compressed sensing
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Generalized latent factor models for social network analysis
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
User community discovery from multi-relational networks
Decision Support Systems
Exploiting homophily effect for trust prediction
Proceedings of the sixth ACM international conference on Web search and data mining
Connecting comments and tags: improved modeling of social tagging systems
Proceedings of the sixth ACM international conference on Web search and data mining
Transforming graph data for statistical relational learning
Journal of Artificial Intelligence Research
Document Re-ranking Using Partial Social Tagging
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Efficient community detection in large networks using content and links
Proceedings of the 22nd international conference on World Wide Web
Pre-release box-office success prediction for motion pictures
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Social trust prediction using heterogeneous networks
ACM Transactions on Knowledge Discovery from Data (TKDD)
CALA: An unsupervised URL-based web page classification system
Knowledge-Based Systems
Hi-index | 0.00 |
The world wide web contains rich textual contents that areinterconnected via complex hyperlinks. This huge database violates the assumption held by most of conventional statistical methods that each web page is considered as an independent and identical sample. It is thus difficult to apply traditional mining or learning methods for solving web mining problems, e.g., web page classification, by exploiting both the content and the link structure. The research in this direction has recently received considerable attention but are still in an early stage. Though a few methods exploit both the link structure or the content information, some of them combine the only authority information with the content information, and the others first decompose the link structure into hub and authority features, then apply them as additional document features. Being practically attractive for its great simplicity, this paper aims to design an algorithm that exploits both the content and linkage information, by carrying out a joint factorization on both the linkage adjacency matrix and the document-term matrix, and derives a new representation for web pages in a low-dimensional factor space, without explicitly separating them as content, hub or authority factors. Further analysis can be performed based on the compact representation of web pages. In the experiments, the proposed method is compared with state-of-the-art methods and demonstrates an excellent accuracy in hypertext classification on the WebKB and Cora benchmarks.