Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Web classification using support vector machine
Proceedings of the 4th international workshop on Web information and data management
Combining Statistical and Relational Methods for Learning in Hypertext Domains
ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Detecting geographic locations from web resources
Proceedings of the 2005 workshop on Geographic information retrieval
Fast webpage classification using URL features
Proceedings of the 14th ACM international conference on Information and knowledge management
A component model for internet-scale applications
Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
Categorizing web search results into meaningful and stable categories using fast-feature techniques
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Blocking objectionable web content by leveraging multiple information sources
ACM SIGKDD Explorations Newsletter
Knowing a web page by the company it keeps
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Architecture of a grid-enabled Web search engine
Information Processing and Management: an International Journal
Authors vs. readers: a comparative study of document metadata and content in the www
Proceedings of the 2007 ACM symposium on Document engineering
Automatic Recognition of News Web Pages
PAISI, PACCF and SOCO '08 Proceedings of the IEEE ISI 2008 PAISI, PACCF, and SOCO international workshops on Intelligence and Security Informatics
Web page language identification based on URLs
Proceedings of the VLDB Endowment
Can all tags be used for search?
Proceedings of the 17th ACM conference on Information and knowledge management
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Framework for building a high-quality web page collection considering page group structure
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
A Comprehensive Study of Features and Algorithms for URL-Based Topic Classification
ACM Transactions on the Web (TWEB)
ESpotter: adaptive named entity recognition for web browsing
WM'05 Proceedings of the Third Biennial conference on Professional Knowledge Management
Design and implementation of an ontology algorithm for web documents classification
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part IV
Finding patterns in an unknown graph
AI Communications - The Symposium on Combinatorial Search
Multimedia Tools and Applications
A Comprehensive Study of Techniques for URL-Based Web Page Language Classification
ACM Transactions on the Web (TWEB)
Towards automatic assessment of government web sites
Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Hi-index | 0.00 |
Uniform resource locators (URLs), which mark the address of a resource on the World Wide Web, are often human-readable and can hint at the category of the resource. This paper explores the use of URLs for webpage categorization via a two-phase pipeline of word segmentation/expansion and classification. We quantify its performance against document-based methods, which require the retrieval of the source document.