Stuff I've seen: a system for personal information retrieval and re-use
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Combining document representations for known-item search
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A study of smoothing methods for language models applied to information retrieval
ACM Transactions on Information Systems (TOIS)
Simple BM25 extension to multiple weighted fields
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Combining the language model and inference network approaches to retrieval
Information Processing and Management: an International Journal - Special issue: Bayesian networks and information retrieval
Connections: using context to enhance file search
Proceedings of the twentieth ACM symposium on Operating systems principles
Fast, flexible filtering with phlat
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Towards task-based personal information management evaluations
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Building simulated queries for known-item topics: an analysis using six european languages
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
On ranking techniques for desktop search
ACM Transactions on Information Systems (TOIS)
Using provenance to aid in personal file search
ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
A generative retrieval model for structured documents
Proceedings of the 17th ACM conference on Information and knowledge management
Extracting structured information from user queries with semi-supervised conditional random fields
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Building a desktop search test-bed
ECIR'07 Proceedings of the 29th European conference on IR research
Applying maximum entropy to known-item email retrieval
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Proceedings of the 19th international conference on World wide web
Ranking using multiple document types in desktop search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
What makes re-finding information difficult? a study of email re-finding
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Seeding simulated queries with user-study data for personal search evaluation
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Cognitive processes in query generation
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Efficiency optimizations for interpolating subqueries
Proceedings of the 20th ACM international conference on Information and knowledge management
Workshop on evaluating personal search
ACM SIGIR Forum
Evaluating search in personal social media collections
Proceedings of the fifth ACM international conference on Web search and data mining
A field relevance model for structured document retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Evaluating personal information retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Generating queries from user-selected text
Proceedings of the 4th Information Interaction in Context Symposium
Towards realistic known-item topics for the ClueWeb
Proceedings of the 4th Information Interaction in Context Symposium
Understanding book search behavior on the web
Proceedings of the 21st ACM international conference on Information and knowledge management
Generating pseudo test collections for learning to rank scientific articles
CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Pseudo test collections for training and tuning microblog rankers
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
Desktop search is an important part of personal information management (PIM). However, research in this area has been limited by the lack of shareable test collections, making cumulative progress difficult. In this paper, we define desktop search as a semi-structured document retrieval problem and introduce a methodology to automatically build a reusable collection (the pseudo-desktop) that has many of the same properties as a real desktop collection. We then present a comprehensive evaluation of retrieval methods for semi-structured document retrieval on several pseudo-desktop collections and the TREC Enterprise collection. Our results show that a probabilistic retrieval model using the mapping relation between a query term and a document field (PRM-S) has the best performance in collections with more structure, such as email, and that the query-likelihood language model is better for other document types. We further analyze the observed differences using generated queries and suggest ways to improve PRM-S, which makes the performance gains more significant and consistent.