Discovering User Interests from Web Browsing Behavior: An Application to Internet News Services
HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 7 - Volume 7
Extracting noun phrases from large-scale texts: a hybrid approach and its automatic evaluation
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Finding advertising keywords on web pages
Proceedings of the 15th international conference on World Wide Web
Hi-index | 0.00 |
This paper investigates the application of data-mining techniques on a user's browsing history for the purpose of determining the user's interests. More specifically, a system is outlined that attempts to determine certain keywords that a user may or may not be interested in. This is done by first applying a term-frequency/inverse-document frequency filter to extract keywords from webpages in the user's history, after which a Self-Organizing Map (SOM) neural network is utilized to determine if these keywords are of interest to the user. Such a system could enable web-browsers to highlight areas of web pages that may be of higher interest to the user. It is found that while the system is indeed successful in identifying many keywords of user-interest, it also misclassifies many uninteresting words boasting only a 62% accuracy rate.