Word weighting based on user's browsing history

Authors:
Yutaka Matsuo
Affiliations:
National Institute of Advance Industrial Science and Technology
Venue:
UM'03 Proceedings of the 9th international conference on User modeling
Year:
2003

Citing 6
Cited 2

Automatic text processing

Automatic text processing
WebMate: a personal agent for browsing and searching

AGENTS '98 Proceedings of the second international conference on Autonomous agents
WebACE: a Web agent for document categorization and exploration

AGENTS '98 Proceedings of the second international conference on Autonomous agents
The feature quantity: an information theoretic perspective of Tfidf-like measures

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Who Do You Want to Be Today? Web Personae for Personalised Information Access

AH '02 Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
Letizia: an agent that assists web browsing

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1

Usability engineering for the adaptive web

The adaptive web
Keeping keywords fresh: a BM25 variation for personalized keyword extraction

Proceedings of the 2nd Temporal Web Analytics Workshop

Quantified Score

Hi-index	0.00

Visualization

Abstract

We developed a word-weighting algorithm based on the information access history of a user. The information access history of a user is represented as a set of words, and is considered to be a user model. We weight words in a document according to their relevancy to the user model.The relevancy is measured by the biases of co-occurrence, called IRM(Interest Relevance Measure), between a word in a document and words in the user model. We evaluate IRM through a constructed browsing support system, which monitors word occurrences on the user's browsed Web pages and highlights keywords in the current page. Our system consists of three components: a proxy server that monitors access to the Web, a frequency server that stores the frequencies of words appearing on the accessed Web pages, and a keyword extraction module.