Extracting relevant snippets for web navigation

Authors:
Qing Li;K. Selçuk Candan;Qi Yan
Affiliations:
Southwestern University of Finance and Economics, China;Arizona State University;Arizona State University
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Year:
2008

Citing 10
Cited 4

Subtopic structuring for full-length document access

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Passage-level evidence in document retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic text decomposition and structuring

Information Processing and Management: an International Journal
Passage retrieval revisited

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Effective ranking with arbitrary passages

Journal of the American Society for Information Science and Technology
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Passage retrieval based on language models

Proceedings of the eleventh international conference on Information and knowledge management
Text Segmentation by Topic

ECDL '97 Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries
CUTS: CUrvature-based development pattern analysis and segmentation for blogs and other Text Streams

Proceedings of the seventeenth conference on Hypertext and hypermedia

Relevant shape contour snippet extraction with metadata supported hidden Markov models

Proceedings of the ACM International Conference on Image and Video Retrieval
Editorial: Narrative-based taxonomy distillation for effective indexing of text collections

Data & Knowledge Engineering
Making results fit into 40 characters: a study in document rewriting

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Hive open research network platform

Proceedings of the 16th International Conference on Extending Database Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extracting variable length document snippets by real-time processing of documents using the query issued by the user. With this extra level of information, the returned snippets are considerably more informative. Unlike previous work on passage retrieval which relies on searching relevant segments for filtering of preoccupied passages, we focus on query-informed segmentation to extract context-aware relevant snippets with variable length. In particular, we show that, when informed through an appropriate relevance language model, curvature analysis and Hidden Markov model (HMM) based content segmentation techniques can facilitate to extract relevant document snippets.