Enforcing strict model-view separation in template engines
Proceedings of the 13th international conference on World Wide Web
Exploiting time-based synonyms in searching document archives
Proceedings of the 10th annual joint conference on Digital libraries
Hybrid index structures for temporal-textual web search
APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
NTLM: a time-enhanced language model based ranking approach for web search
WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Handling temporal information in web search engines
ACM SIGMOD Record
Hi-index | 0.00 |
Search engines regularly crawl the web taking vast snapshots of sitecontent. Because previous crawls are not archived, however, searchresults pertain only to a single, recent instant in time. Search engine users are unable to request pages discussing UK politics in2001, for example. The Internet Archive, an organization dedicated to maintaining such snapshots of the Internet, provides access to many previous web crawls, but lacks a search facility. Users of the ``Way Back Machine'' must provide a specific URL for which they want a listof snapshots organized by date. This short paper describes Chronica, atemporal search engine that indexes Internet Archive crawl data in order to provide search results spanning user-specified time ranges. Chronica can generate graphs showing query result hit counts across a given time span and even side-by-side comparisons of different query results. These graphs can be used to, among other things, track a term's popularity over time for marketing or academic research purposes.