Finding a catalog: generating analytical catalog records from well-structured digital texts

  • Authors:
  • David Mimno;Alison Jones;Gregory Crane

  • Affiliations:
  • Tufts University, Medford, MA;Tufts University, Medford, MA;Tufts University, Medford, MA

  • Venue:
  • Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the criticisms library users often make of catalogs is that they rarely include information below the bibliographic level. It is generally impossible to search a catalog for the titles and subjects of particular chapters or volumes. There has been no way to add this information to catalog records without exponentially increasing the workload of catalogers. At the same time, well-structured full-text XML transcriptions of printed works are becoming increasingly available. This paper describes how existing investments in full text digitization and structural markup combined with current named-entity extraction technology can efficiently generate the detailed level of catalog data that users want, at no significant additional cost. This system is demonstrated on an existing digital collection within the Perseus Digital Library.