Segmentation Charts for Czech --- Relations among Segments in Complex Sentences

Authors:
Markéta Lopatková;Tomáš Holan
Affiliations:
Charles University in Prague, Czech Republic;Charles University in Prague, Czech Republic
Venue:
LATA '09 Proceedings of the 3rd International Conference on Language and Automata Theory and Applications
Year:
2009

Citing 7
Cited 1

Partial parsing via finite-state cascades

Natural Language Engineering
Full text parsing using cascades of rules: an information extraction perspective

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Cascaded Markov Models

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Exploring the role of punctuation in parsing natural text

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Dependency parsing of Japanese spoken monologue based on clause boundaries

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
On Parallel Communicating Grammar Systems and Correctness Preserving Restarting Automata

LATA '09 Proceedings of the 3rd International Conference on Language and Automata Theory and Applications
The best of two worlds: cooperation of statistical and rule-based taggers for Czech

ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies

Annotation of sentence structure

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Syntactic analysis of natural languages is the fundamental requirement of many applied tasks. We propose a new module between morphological and syntactic analysis that aims at determining the overall structure of a sentence prior to its complete analysis. We exploit a concept of segments, easily automatically detectable and linguistically motivated units. The output of the module, so-called `segmentation chart', describes the relationship among segments, especially relations of coordination and apposition or relation of subordination. In this text we present a framework that enables us to develop and test rules for automatic identification of segmentation charts. We describe two basic experiments --- an experiment with segmentation patterns obtained from the Prague Dependency Treebank and an experiment with the segmentation rules applied to plain text. Further, we discuss the evaluation measures suitable for our task.