International Journal of Human-Computer Studies
Browsing the underdeveloped Web: An experiment on the Arabic Medical Web Directory
Journal of the American Society for Information Science and Technology
Building a directory for the underdeveloped web: an experiment on the Arabic medical web directory
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Hi-index | 0.00 |
There are enormous amount of web pages in the world. Retrieval of required information from the WWW is thus an arduous task. Different models for retrieving web pages have been used by the WWW community. One of the most widely used model is by traversing a predefined web directory hierarchy to reach a user's goal. The web directories are compiled or classified folders of web pages and are usually organized into a hierarchical structure. The classificationof web pages into proper directories and the organization of directory hierarchies are generally performed by human experts. In this work, we provide a method to apply a kind of text mining techniques on a set of web pages to automatically create web directories and organize them into hierarchies. The method is based on the self-organizing map learning algorithm and requires no human intervention during the construction of web directories and hierarchies. Theexperiments show that our method can produce comprehensible and reasonable web directories and hierarchies.