Search your interests everywhere!: wikipedia-based keyphrase extraction from web browsing history

  • Authors:
  • Mitsumasa Kondo;Akimichi Tanaka;Tadasu Uchiyama

  • Affiliations:
  • NTT Corporation, Hikarinooka, Yokosuka-shi, Japan;NTT Corporation, Hikarinooka, Yokosuka-shi, Japan;NTT Corporation, Hikarinooka, Yokosuka-shi, Japan

  • Venue:
  • Proceedings of the 21st ACM conference on Hypertext and hypermedia
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method that can extract user interests from the user's Web browsing history. Our method allows easy access to multiple content domains such as blogs, movies, QA sites, etc. since the user does not need to input a separate search query in each domain/site. To extract user interests, the method first extracts candidate keyphrases from the user's web browsed documents. Second, important keyphrases obtained from a link structure analysis of Wikipedia content is extracted from the main contents of web documents. This technique is based on the idea that important keyphrases in Wikipedia are important keyphrases in the real world. Finally, keyphrases contained in the documents important to the user are set in order as user interests. An experiment shows that our method offers improvements over a conventional method and can recommend interests attractive to the user.