Usage of XSL stylesheets for the annotation of the Sámi language corpora

Authors:
Saara Huhmarniemi;Sjur N. Moshagen;Trond Trosterud
Affiliations:
University of Tromsø;Norwegian Sámi Parliament;University of Tromsø
Venue:
LAW '07 Proceedings of the Linguistic Annotation Workshop
Year:
2007

Citing 1
Cited 1

Building Minority Language Corpora by Learning to Generate Web Search Queries

Knowledge and Information Systems

Multiple level of referents in information state

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes an annotation system for Sámi language corpora, which consists of structured, running texts. The annotation of the texts is fully automatic, starting from the original documents in different formats. The texts are first extracted from the original documents preserving the original structural markup. The markup is enhanced by a document-specific XSLT script which contains document-specific formatting instructions. The overall maintenance is achieved by system-wide XSLT scripts.