Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
User interactions with everyday applications as context for just-in-time information access
Proceedings of the 5th international conference on Intelligent user interfaces
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
On Applying an Image Processing Technique to Detecting Spams
ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Spam and the ongoing battle for the inbox
Communications of the ACM - Spam and the ongoing battle for the inbox
International Journal of Web and Grid Services
Web browsing behavior recording system
KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part IV
Tag recommendation for flickr using web browsing behavior
ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part II
Hi-index | 0.01 |
In this paper a spam filtering method is proposed. We focus on user behavior that most email users browse the Web. The method reduces troublesome maintenance of the spam filter, since the filter learns from Web browsing behavior in the background. The method uses Web browsing behavior of each user to learn ham words. Ham words are picked up from browsed Web pages using TF-IDF and stored in the database called ham words list. For each received email, the method extracts keywords from the email, including Web pages of the URLs. If some keywords are in the ham words list, the email is treated as a ham. In our experiments, several spam emails which cannot be detected by a Bayesian filter are detected as spams.