The nature of statistical learning theory
The nature of statistical learning theory
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Hierarchical classification of Web content
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
A study of thresholding strategies for text categorization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
A Study of Approaches to Hypertext Categorization
Journal of Intelligent Information Systems
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Web unit mining: finding and classifying subgraphs of web pages
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Combining link-based and content-based methods for web document classification
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Web page classification without the web page
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
GE-CKO: A Method to Optimize Composite Kernels for Web Page Classification
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Application of learned user context to improve web search results
Journal of Computing Sciences in Colleges
Fast webpage classification using URL features
Proceedings of the 14th ACM international conference on Information and knowledge management
Intelligent GP fusion from multiple sources for text classification
Proceedings of the 14th ACM international conference on Information and knowledge management
A comparative study of citations and links in document classification
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Categorizing web search results into meaningful and stable categories using fast-feature techniques
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Template extraction from candidate template set generation: a structure and content approach
Proceedings of the 43rd annual Southeast regional conference - Volume 2
Multi-evidence, multi-criteria, lazy associative document classification
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A Voting Method for the Classification of Web Pages
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Increasing web accessibility by automatically judging alternative text quality
Proceedings of the 12th international conference on Intelligent user interfaces
Architecture of a grid-enabled Web search engine
Information Processing and Management: an International Journal
A hybrid generative/discriminative approach to text classification with additional information
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Web page classification with heterogeneous data fusion
Proceedings of the 16th international conference on World Wide Web
Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search
Journal of Management Information Systems
Finding and classifying web units in websites
International Journal of Business Intelligence and Data Mining
Classifiers without borders: incorporating fielded text from neighboring web pages
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A Framework for Titled Document Categorization with Modified Multinomial Naivebayes Classifier
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Can Social Tags Help You Find What You Want?
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
A bottom-up approach for XML documents classification
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
A comprehensive survey of numeric and symbolic outlier mining techniques
Intelligent Data Analysis
Classifying Web Pages by Using Knowledge Bases for Entity Retrieval
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Semi-supervised learning for multi-component data classification
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Serving Comparative Shopping Links Non-invasively
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Novel web page classification techniques in contextual advertising
Proceedings of the eleventh international workshop on Web information and data management
On strategies for imbalanced text classification using SVM: A comparative study
Decision Support Systems
Web Categorisation Using Distance-Based Decision Trees
Electronic Notes in Theoretical Computer Science (ENTCS)
International Journal of Computers and Applications
Commercial Internet filters: Perils and opportunities
Decision Support Systems
Framework for building a high-quality web page collection considering page group structure
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Blog classification using tags: an empirical study
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Classifying documents with link-based bibliometric measures
Information Retrieval
Kairos: proactive harvesting of research paper metadata from scientific conference web sites
ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
A solution to the exact match on rare item searches: introducing the lost sheep algorithm
Proceedings of the International Conference on Web Intelligence, Mining and Semantics
Using main content extraction to improve performance of Vietnamese web page classification
Proceedings of the Second Symposium on Information and Communication Technology
Core: a search and browsing tool for semantic instances of web sites
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Classification of news web documents based on structural features
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
A classifier design based on combining multiple components by maximum entropy principle
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
A web classification framework based on XSLT
APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
ICADL'06 Proceedings of the 9th international conference on Asian Digital Libraries: achievements, Challenges and Opportunities
Classification of XSLT-Generated web documents with support vector machines
KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
Web classification of conceptual entities using co-training
Expert Systems with Applications: An International Journal
Blog topic analysis using TF smoothing and LDA
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
CatStream: categorising tweets for user profiling and stream filtering
Proceedings of the 2013 international conference on Intelligent user interfaces
Web Intelligence and Agent Systems
Hi-index | 0.00 |
In web classification, web pages from one or more web sites are assigned to pre-defined categories according to their content. Since web pages are more than just plain text documents, web classification methods have to consider using other context features of web pages, such as hyperlinks and HTML tags. In this paper, we propose the use of Support Vector Machine (SVM) classifiers to classify web pages using both their text and context feature sets. We have experimented our web classification method on the WebKB data set. Compared with earlier Foil-Pilfs method on the same data set, our method has been shown to perform very well. We have also shown that the use of context features especially hyperlinks can improve the classification performance significantly.