Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Data mining: concepts and techniques
Data mining: concepts and techniques
Markov random field modeling in image analysis
Markov random field modeling in image analysis
Mining the network value of customers
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Mining the Web: Discovering Knowledge from HyperText Data
Mining the Web: Discovering Knowledge from HyperText Data
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
State of the art of graph-based data mining
ACM SIGKDD Explorations Newsletter
Link mining: a new data mining challenge
ACM SIGKDD Explorations Newsletter
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Probability Estimates for Multi-class Classification by Pairwise Coupling
The Journal of Machine Learning Research
Breaking through the syntax barrier: searching with entities and relations
PKDD '04 Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
The database research group at the Max-Planck Institute for Informatics
ACM SIGMOD Record
A neighborhood-based approach for clustering of linked document collections
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Know your neighbors: web spam detection using the web topology
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Detecting research topics via the correlation between graphs and texts
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Link analysis for Web spam detection
ACM Transactions on the Web (TWEB)
Disorder inequality: a combinatorial approach to nearest neighbor search
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Extracting and ranking viral communities using seeds and content similarity
Proceedings of the nineteenth ACM conference on Hypertext and hypermedia
A comparative evaluation of different link types on enhancing document clustering
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Classifiers without borders: incorporating fielded text from neighboring web pages
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Semi-supervised Collaborative Text Classification
ECML '07 Proceedings of the 18th European conference on Machine Learning
Characterization of Graphs Using Degree Cores
Algorithms and Models for the Web-Graph
Classifying networked entities with modularity kernels
Proceedings of the 17th ACM conference on Information and knowledge management
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
Improving music genre classification using collaborative tagging data
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Graffiti: node labeling in heterogeneous networks
Proceedings of the 18th international conference on World wide web
Ontology-Based Service Discovery Front-End Interface for GloServ
ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Mining globally distributed frequent subgraphs in a single labeled graph
Data & Knowledge Engineering
Combinatorial Framework for Similarity Search
SISAP '09 Proceedings of the 2009 Second International Workshop on Similarity Search and Applications
Exploit the tripartite network of social tagging for web clustering
Proceedings of the 18th ACM conference on Information and knowledge management
Semantic relatedness hits bibliographic data
Proceedings of the eleventh international workshop on Web information and data management
Correlative linear neighborhood propagation for video annotation
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hypertext Classification Using Tensor Space Model and Rough Set Based Ensemble Classifier
PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
A higher order collective classifier for detecting andclassifying network events
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Tensor Framework and Combined Symmetry for Hypertext Mining
Fundamenta Informaticae
Knowledge transferring via implicit link analysis
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Disambiguating identity web references using Web 2.0 data and semantics
Web Semantics: Science, Services and Agents on the World Wide Web
Classifying documents with link-based bibliometric measures
Information Retrieval
Measuring the interestingness of articles in a limited user environment
Information Processing and Management: an International Journal
A novel split and merge technique for hypertext classification
Transactions on rough sets XII
Extracting local web communities using lexical similarity
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Automatic topic detection with an incremental clustering algorithm
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Design and implementation of contextual information portals
Proceedings of the 20th international conference companion on World wide web
Topic classification in social media using metadata from hyperlinked objects
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Target-dependent Twitter sentiment classification
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Improving categorisation in social media using hyperlinks to structured data sources
ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
Graph clustering based on optimization of a macroscopic structure of clusters
DS'11 Proceedings of the 14th international conference on Discovery science
Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach
Proceedings of the 20th ACM international conference on Information and knowledge management
Leveraging network structure for incremental document clustering
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
E-commerce market analysis from a graph-based product classifier
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Biomedical text categorization with concept graph representations using a controlled vocabulary
Proceedings of the 11th International Workshop on Data Mining in Bioinformatics
Proceedings of the Third Symposium on Information and Communication Technology
Mining potential research synergies from co-authorship graphs using power graph analysis
International Journal of Web Engineering and Technology
Tensor Framework and Combined Symmetry for Hypertext Mining
Fundamenta Informaticae
Discovering factions in the computational linguistics community
ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Utilization of global ranking information in graph-based biomedical literature clustering
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
A document is known by the company it keeps: neighborhood consensus for short text categorization
Language Resources and Evaluation
Proceedings of the Fourth Symposium on Information and Communication Technology
Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Exploiting small world property for network clustering
World Wide Web
Hi-index | 0.00 |
Automatic classification of data items, based on training samples, can be boosted by considering the neighborhood of data items in a graph structure (e.g., neighboring documents in a hyperlink environment or co-authors and their publications for bibliographic data entries). This paper presents a new method for graph-based classification, with particular emphasis on hyperlinked text documents but broader applicability. Our approach is based on iterative relaxation labeling and can be combined with either Bayesian or SVM classifiers on the feature spaces of the given data items. The graph neighborhood is taken into consideration to exploit locality patterns while at the same time avoiding overfitting. In contrast to prior work along these lines, our approach employs a number of novel techniques: dynamically inferring the link/class pattern in the graph in the run of the iterative relaxation labeling, judicious pruning of edges from the neighborhood graph based on node dissimilarities and node degrees, weighting the influence of edges based on a distance metric between the classification labels of interest and weighting edges by content similarity measures. Our techniques considerably improve the robustness and accuracy of the classification outcome, as shown in systematic experimental comparisons with previously published methods on three different real-world datasets.