Development of corpora within the CLaRK system: the BulTreeBank project experience

  • Authors:
  • Kiril Simov;Alexander Simov;Milen Kouylekov;Krasimira Ivanova;Ilko Grigorov;Hristo Ganev

  • Affiliations:
  • BulTreeBank Project, Sofia, Bulgaria;BulTreeBank Project, Sofia, Bulgaria;BulTreeBank Project, Sofia, Bulgaria;BulTreeBank Project, Sofia, Bulgaria;BulTreeBank Project, Sofia, Bulgaria;BulTreeBank Project, Sofia, Bulgaria

  • Venue:
  • EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

CLaRK is an XML-based software system for corpora development. It incorporates several technologies: XML technology; Unicode; Regular Cascaded Grammars; Constraints over XML Documents. The basic components of the system are: a tagger, a concordancer, an extractor, a grammar processor, a constraint engine.