Using shallow natural language processing in a just-in-time information retrieval assistant for bloggers

Authors:
Ang Gao;Derek Bridge
Affiliations:
Department of Computer Science, University College Cork, Ireland;Department of Computer Science, University College Cork, Ireland
Venue:
AICS'09 Proceedings of the 20th Irish conference on Artificial intelligence and cognitive science
Year:
2009

Citing 5
Cited 0

Exploring the Web with reconnaissance agents

Communications of the ACM
Query-free news search

WWW '03 Proceedings of the 12th international conference on World Wide Web
Just-in-time information retrieval

Just-in-time information retrieval
Using Physical Context for Just-in-Time Information Retrieval

IEEE Transactions on Computers
Introduction to Information Retrieval

Introduction to Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Just-In-Time Information Retrieval agents proactively retrieve information based on queries that are implicit in, and formulated from, the user's current context, such as the blogpost she is writing. This paper compares five heuristics by which queries can be extracted from a user's blogpost or other document. Four of the heuristics use shallow Natural Language Processing techniques, such as tagging and chunking. An experimental evaluation reveals that most of them perform as well as a heuristic based on term weighting. In particular, extracting noun phrases after chunking is one of the more successful heuristics and can have lower costs than term weighting. In a trial with real users, we find that relevant results have higher rank when we use implicit queries produced by this chunking heuristic than when we use explicit user-formulated queries.