Content analysis of museum documentation with a transdisciplinary perspective

Authors:
Günther Goerz;Martin Scholz
Affiliations:
University of Erlangen-Nuremberg, Erlangen, Germany;University of Erlangen-Nuremberg, Erlangen, Germany
Venue:
LaTeCH-SHELT&R '09 Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education
Year:
2009

Citing 3
Cited 0

Detecting events with date and place information in unstructured text

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Text Encoding Initiative: Background and Contexts

Text Encoding Initiative: Background and Contexts
The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata

AI Magazine

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many cases, museum documentation consists of semi-structured data records with free text fields, which usually refer to contents of other fields, in the same data record, as well as in others. Most of these references comprise of person and place names, as well as time specifications. It is, therefore, important to recognize those in the first place. We report on techniques and results of partial parsing in an ongoing project, using a large database on German goldsmith art. The texts are encoded according to the TEI guidelines and expanded by structured descriptions of named entities and time specifications. These are building blocks for event descriptions, at which the next step is aiming. The identification of named entities allows the data to be linked with various resources within the domain of cultural heritage and beyond. For the latter case, we refer to a biological database and present a solution in a transdisciplinary perspective by means of the CIDOC Conceptual Reference Model (CRM).