TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)
Web Archiving
Characterization of national Web domains
ACM Transactions on Internet Technology (TOIT)
A field study characterizing Web-based information-seeking tasks
Journal of the American Society for Information Science and Technology
ACM Transactions on Information Systems (TOIS)
Novelty and diversity in information retrieval evaluation
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Relevance judgments between TREC and Non-TREC assessors
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the Second ACM International Conference on Web Search and Data Mining
A study of link farm distribution and evolution using a time series of web snapshots
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Methods for Evaluating Interactive Information Retrieval Systems with Users
Foundations and Trends in Information Retrieval
The Probabilistic Relevance Framework: BM25 and Beyond
Foundations and Trends in Information Retrieval
How are we searching the World Wide Web? A comparison of nine search engine transaction logs
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Leveraging temporal dynamics of document content in relevance ranking
Proceedings of the third ACM international conference on Web search and data mining
Towards recency ranking in web search
Proceedings of the third ACM international conference on Web search and data mining
How does search behavior change as search becomes more difficult?
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Socio-sense: a system for analysing the societal behavior from long term web archive
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Repeatable and reliable search system evaluation using crowdsourcing
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
A survey on web archiving initiatives
TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
Search the past with the portuguese web archive
Proceedings of the 22nd international conference on World Wide Web companion
Creating a billion-scale searchable web archive
Proceedings of the 22nd international conference on World Wide Web companion
Hi-index | 0.00 |
The information published on the web, a representation of our collective memory, is rapidly vanishing. At least 77 web archives have been developed to cope with the web's transience problem, but despite their technology having achieved a good maturity level, the retrieval effectiveness of the search services they provide still presents unsatisfactory results. In this work, we propose an evaluation methodology for web archive search systems based on a list of requirements compiled from previous characterizations of web archives and their users. The methodology includes the design of a test collection and the selection of evaluation measures to support realistic and reproducible experiments. The test collection enabled, for the first time, to measure the effectiveness of state-of-the-art IR technology employed in web archives. Results confirm the poor quality of search results retrieved with such technology. However, we show how to combine temporal features, along with the regular topical features, to improve the search effectiveness on web archives. The test collection is available to the research community.