Semantic document selection: historical research on collections that span multiple centuries

  • Authors:
  • Daan Odijk;Ork de Rooij;Maria-Hendrike Peetz;Toine Pieters;Maarten de Rijke;Stephen Snelders

  • Affiliations:
  • ISLA, University of Amsterdam, The Netherlands;ISLA, University of Amsterdam, The Netherlands;ISLA, University of Amsterdam, The Netherlands;Descartes Center for the History and Philosophy of the Sciences and the Humanities, Utrecht University, The Netherlands;ISLA, University of Amsterdam, The Netherlands;Descartes Center for the History and Philosophy of the Sciences and the Humanities, Utrecht University, The Netherlands

  • Venue:
  • TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The availability of digitized collections of historical data, such as newspapers, increases every day. With that, so does the wish for historians to explore these collections. Methods that are traditionally used to examine a collection do not scale up to today's collection sizes. We propose a method that combines text mining with exploratory search to provide historians with a means of interactively selecting and inspecting relevant documents from very large collections. We assess our proposal with a case study on a prototype system.