Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Bringing order to the Web: automatically categorizing search results
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Proceedings of the 11th international conference on World Wide Web
Web montage: a dynamic personalized start page
Proceedings of the 11th international conference on World Wide Web
Interface for WordNet Enrichment with Classification Systems
DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
The VLDB Journal — The International Journal on Very Large Data Bases
ScentTrails: Integrating browsing and searching on the Web
ACM Transactions on Computer-Human Interaction (TOCHI)
THESUS: Organizing Web document collections based on link semantics
The VLDB Journal — The International Journal on Very Large Data Bases
Lexical cohesion computed by thesaural relations as an indicator of the structure of text
Computational Linguistics
What's new on the web?: the evolution of the web from a search engine perspective
Proceedings of the 13th international conference on World Wide Web
Liveclassifier: creating hierarchical text classifiers through web corpora
Proceedings of the 13th international conference on World Wide Web
Proceedings of the 13th international conference on World Wide Web
HiBO: a system for automatically organizing bookmarks
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
DirectoryRank: ordering pages in web directories
Proceedings of the 7th annual ACM international workshop on Web information and data management
International Journal of Human-Computer Studies
An Ontology-Based Focused Crawler
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Browsing the underdeveloped Web: An experiment on the Arabic Medical Web Directory
Journal of the American Society for Information Science and Technology
ICFCA '09 Proceedings of the 7th International Conference on Formal Concept Analysis
Building a directory for the underdeveloped web: an experiment on the Arabic medical web directory
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Hi-index | 0.02 |
Web Directories provide a way of locating relevant information on the Web. Typically, Web Directories rely on humans putting in significant time and effort into finding important pages on the Web and categorizing them in the Directory. In this paper we present a way for automating the creation of a Web Directory. At a high level, our method takes as input a subject hierarchy and a collection of pages. We first leverage a variety of lexical resources from the Natural Language Processing community to enrich our hierarchy. After that, we process the pages and identify sequences of important terms, which are referred to as lexical chains. Finally, we use the lexical chains in order to decide where in the enriched subject hierarchy we should assign every page. Our experimental results with real Web data show that our method is quite promising into assisting humans during page categorization.