Effective parsing using competing CFG rules

Authors:
Miloš Jakubíček
Affiliations:
Natural Language Processing Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic
Venue:
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Year:
2011

Citing 8
Cited 0

Parsing Schemata: A Framework for Specification and Analysis of Parsing Algorithms

Parsing Schemata: A Framework for Specification and Analysis of Parsing Algorithms
A test of the leaf-ancestor metric for parse accuracy

Natural Language Engineering
New Methods for Pruning and Ordering of Syntax Parsing Trees

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Mining Phrases from Syntactic Analysis

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Improving parsing accuracy by combining diverse dependency parsers

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Dependency and phrasal parsers of the Czech language: a comparison

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Semantic role patterns and verb classes in verb valency lexicon

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
New meta-grammar constructs in czech language parser synt

TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper a new pruning method for a rule-based parser is described that relies on separating the underlying grammar rules into several mutually competing levels. This method has been developed and exploited for Czech in the syntactic parser Synt to reduce the number of possible output derivation trees. The algorithm behind operates on a so called packed forest of trees, a compressing data structure used for internal representation of parallel analyses, and thus performs very effectively. An evaluation of its contribution has been performed on the Brno Phrasal Treebank showing that the algorithm significantly prunes the resulting tree space while preserving perspective parses.