Information archiving with bookmarks: personal Web space construction and organization
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatic Web-Page Classification by Using Machine Learning Methods
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
PEBL: Web Page Classification without Negative Examples
IEEE Transactions on Knowledge and Data Engineering
Hi-index | 0.00 |
Web browser bookmark files store records of web pages that the user would like to revisit. We use four methods to index and automatically classify documents referred to in 80 bookmark files, based on document title-only and full-text indexing and two clustering approaches. We evaluate the approaches by selecting a bookmark entry to classify from a bookmark file, re-creating a snapshot of the bookmark file to contain only entries created before the selected bookmark entry. The baseline algorithm is 39% accurate at rank 1 when the target category contains 7 entries. By fusing the recommendations of the 4 approaches, we reach 78.7% accuracy on average, recommending at most 3 categories.