The case for browser provenance

  • Authors:
  • Daniel W. Margo;Margo Seltzer

  • Affiliations:
  • Harvard School of Engineering and Applied Sciences;Harvard School of Engineering and Applied Sciences

  • Venue:
  • TAPP'09 First workshop on on Theory and practice of provenance
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In our increasingly networked world, web browsers are important applications. Originally an interface tool for accessing distributed documents, browsers have become ubiquitous, incorporating a significant portion of user interaction. A modern browser now also reads email, plays media, edits documents, and runs applications. Consequently, browsers process large quantities of data, and must record metadata, such as history, to help users manage their data. Most of the metadata that modern browsers record is actually provenance -- metadata that captures the causality and lineage of data obtained via the browser. We demonstrate that characterizing browser metadata as provenance and then applying techniques from the provenance research community enables new browser functionality. For example, provenance can improve both history and web search by indicating contextual and personal relationships between data items. Users can also answer complex questions about the origins of their data by querying provenance. Our initial results suggest these features are feasible to implement and could perform well in modern browsers.