Search your interests everywhere!: wikipedia-based keyphrase extraction from web browsing history

Authors:
Mitsumasa Kondo;Akimichi Tanaka;Tadasu Uchiyama
Affiliations:
NTT Corporation, Hikarinooka, Yokosuka-shi, Japan;NTT Corporation, Hikarinooka, Yokosuka-shi, Japan;NTT Corporation, Hikarinooka, Yokosuka-shi, Japan
Venue:
Proceedings of the 21st ACM conference on Hypertext and hypermedia
Year:
2010

Citing 2
Cited 1

Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Extracting key terms from noisy and multitheme documents

Proceedings of the 18th international conference on World wide web

Keeping keywords fresh: a BM25 variation for personalized keyword extraction

Proceedings of the 2nd Temporal Web Analytics Workshop

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a method that can extract user interests from the user's Web browsing history. Our method allows easy access to multiple content domains such as blogs, movies, QA sites, etc. since the user does not need to input a separate search query in each domain/site. To extract user interests, the method first extracts candidate keyphrases from the user's web browsed documents. Second, important keyphrases obtained from a link structure analysis of Wikipedia content is extracted from the main contents of web documents. This technique is based on the idea that important keyphrases in Wikipedia are important keyphrases in the real world. Finally, keyphrases contained in the documents important to the user are set in order as user interests. An experiment shows that our method offers improvements over a conventional method and can recommend interests attractive to the user.