A trainable document summarizer
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Advantages of query biased summaries in information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Summarizing text documents: sentence selection and evaluation metrics
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Generic summaries for indexing in information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Text Compression for Dynamic Document Databases
IEEE Transactions on Knowledge and Data Engineering
Introduction to the special issue on summarization
Computational Linguistics - Summarization
Computational Linguistics - Summarization
WWW '03 Proceedings of the 12th international conference on World Wide Web
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
A temporal comparison of AltaVista Web searching: Research Articles
Journal of the American Society for Information Science and Technology
ACM Transactions on Information Systems (TOIS)
Beyond PageRank: machine learning for static ranking
Proceedings of the 15th international conference on World Wide Web
ACM Computing Surveys (CSUR)
Query biased snippet generation in XML search
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Emulating query-biased summaries using document titles
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Improvements in Recall and Precision in Wolters Kluwer Spain Legal Search Engine
Computable Models of the Law
ManyAspects: a system for highlighting diverse concepts in documents
Proceedings of the VLDB Endowment
Pseudo-relevance feedback and statistical query expansion for web snippet generation
Information Processing Letters
Biased LexRank: Passage retrieval using random walks with question-based priors
Information Processing and Management: an International Journal
Snippet Generation for Semantic Web Search Engines
ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
Document Compaction for Efficient Query Biased Snippet Generation
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
A first study on strategies for generating workflow snippets
Proceedings of the First International Workshop on Keyword Search on Structured Data
Good abandonment in mobile and PC internet search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Metric and Relevance Mismatch in Retrieval Evaluation
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Entry Pairing in Inverted File
WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
On compressing the textual web
Proceedings of the third ACM international conference on Web search and data mining
Focused multi-document summarization: human summarization activity vs. automated systems techniques
Journal of Computing Sciences in Colleges
Using clicks as implicit judgments: expectations versus observations
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Improving XML search by generating and utilizing informative result snippets
ACM Transactions on Database Systems (TODS)
Constructing query-biased summaries: a comparison of human and system generated snippets
Proceedings of the third symposium on Information interaction in context
Engineering basic algorithms of an in-memory text search engine
ACM Transactions on Information Systems (TOIS)
Applying wikipedia-based explicit semantic analysis for query-biased document summarization
ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
An algorithmic treatment of strong queries
Proceedings of the fourth ACM international conference on Web search and data mining
Caching query-biased snippets for efficient retrieval
Proceedings of the 14th International Conference on Extending Database Technology
Cost-Aware Strategies for Query Result Caching in Web Search Engines
ACM Transactions on the Web (TWEB)
Integrating Document Clustering and Multidocument Summarization
ACM Transactions on Knowledge Discovery from Data (TKDD)
Summarizing textual information about locations
Proceedings of the 2nd International Conference on Computing for Geospatial Research & Applications
Enhanced results for web search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Transductive learning over automatically detected themes for multi-document summarization
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Size-l object summaries for relational keyword search
Proceedings of the VLDB Endowment
Relative Lempel-Ziv factorization for efficient storage and retrieval of web collections
Proceedings of the VLDB Endowment
Weighted consensus multi-document summarization
Information Processing and Management: an International Journal
On using a quantum physics formalism for multidocument summarization
Journal of the American Society for Information Science and Technology
To index or not to index: time-space trade-offs in search engines with positional ranking functions
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Can click patterns across user's query logs predict answers to definition questions?
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Cache-Based Query Processing for Search Engines
ACM Transactions on the Web (TWEB)
Triggers and Monitoring in Intelligent Personal Health Record
Journal of Medical Systems
Comprehension-based result snippets
Proceedings of the 21st ACM international conference on Information and knowledge management
Language independent query focused snippet generation
CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Sentence length bias in TREC novelty track judgements
Proceedings of the Seventeenth Australasian Document Computing Symposium
Incorporating compactness to generate term-association view snippets for ontology search
Information Processing and Management: an International Journal
The impact of solid state drive on search engine cache management
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Generating informative snippet to maximize item visibility
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines
ACM Transactions on the Web (TWEB)
Hi-index | 0.00 |
The presentation of query biased document snippets as part of results pages presented by search engines has become an expectation of search engine users. In this paper we explore the algorithms and data structures required as part of a search engine to allow efficient generation of query biased snippets. We begin by proposing and analysing a document compression method that reduces snippet generation time by 58% over a baseline using the zlib compression library. These experiments reveal that finding documents on secondary storage dominates the total cost of generating snippets, and so caching documents in RAM is essential for a fast snippet generation process. Using simulation, we examine snippet generation performance for different size RAM caches. Finally we propose and analyse document reordering and compaction, revealing a scheme that increases the number of document cache hits with only a marginal affect on snippet quality. This scheme effectively doubles the number of documents that can fit in a fixed size cache.