Deterministic part-of-speech tagging with finite-state transducers
Computational Linguistics
FACILE: Classifying Texts Integrating Pattern Matching and Information Extraction
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Ein Werkzeug zur partiellen syntaktischen Analyse deutscher Textkorpora
Natural Language Processing and Speech Technology, Results of the 3rd KONVENS Conference
An information extraction core system for real world German text processing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
GETESS: Constructing a Linguistic Search Index for an Internet Search Engine
NLDB '00 Proceedings of the 5th International Conference on Applications of Natural Language to Information Systems-Revised Papers
Robustness beyond shallowness: incremental deep parsing
Natural Language Engineering
A Generic Finite State Compiler for Tagging Rules
Machine Translation
A cascaded finite-state parser for German
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
A stochastic topological parser for German
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Experiments in German noun chunking
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Annotating topological fields and chunks: and revising POS tags at the same time
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Extracting a domain-specific ontology from a corporate intranet
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
The automatic generation of formal annotations in a multimedia indexing and searching environment
HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Topological field chunking for German
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Annotating grammatical functions for German using finite-state cascades
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Topological field parsing of German
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Hi-index | 0.00 |
We present a divide-and-conquer strategy based on finite state technology for shallow parsing of real-world German texts. In a first phase only the topological structure of a sentence (i.e., verb groups, subclauses) are determined. In a second phase the phrasal grammars are applied to the contents of the different fields of the main and sub-clauses. Shallow parsing is supported by suitably configured preprocessing, including: morphological and on-line compound analysis, efficient POS-filtering, and named entity recognition. The whole approach proved to be very useful for processing of free word order languages like German. Especially for the divide-and-conquer parsing strategy we obtained an f-measure of 87.14% on unseen data.