Very high accuracy and fast dependency parsing is not a contradiction

Authors:
Bernd Bohnet
Affiliations:
University of Stuttgart
Venue:
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Year:
2010

Citing 16
Cited 23

Three new probabilistic models for dependency parsing: an exploration

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Online large-margin training of dependency parsers

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Online Passive-Aggressive Algorithms

The Journal of Machine Learning Research
Experiments with a multilanguage non-projective dependency parser

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Dependency-based syntactic-semantic analysis with PropBank and NomBank

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
The CoNLL-2009 shared task: syntactic and semantic dependencies in multiple languages

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
A latent variable model of synchronous syntactic-semantic parsing for multiple languages

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Multilingual dependency-based syntactic and semantic parsing

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Efficient parsing of syntactic and semantic dependency structures

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Incremental integer linear programming for non-projective dependency parsing

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
A latent variable model for generative dependency parsing

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Non-projective dependency parsing in expected linear time

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Hash Kernels for Structured Data

The Journal of Machine Learning Research
Random projection, margins, kernels, and feature-selection

SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection

Document assignment in multi-site search engines

Proceedings of the fourth ACM international conference on Web search and data mining
Getting the most out of transition-based dependency parsing

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Constructing efficient information extraction pipelines

Proceedings of the 20th ACM international conference on Information and knowledge management
Parse correction with specialized models for difficult attachment types

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
On the role of explicit morphological feature representation in syntactic dependency parsing for German

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Active learning for dependency parsing using partially annotated sentences

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Features for phrase-structure reranking from dependency parses

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Dependency parsing of Hungarian: baseline results and challenges

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
The best of both worlds: a graph-based completion model for transition-based parsers

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Aligning predicate argument structures in monolingual comparable texts: a new corpus for a new task

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Modeling covert event retrieval in logical metonymy: probabilistic and distributional accounts

CMCL '12 Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Semi-supervised dependency parsing using lexical affinities

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A comparison of Chinese parsers for stanford dependencies

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Aligning predicates across monolingual comparable texts using graph-based clustering

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
A transition-based system for joint part-of-speech tagging and labeled non-projective dependency parsing

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Crosslingual distant supervision for extracting relations of different complexity

Proceedings of the 21st ACM international conference on Information and knowledge management
A word-order based graph representation for relevance identification

Proceedings of the 21st ACM international conference on Information and knowledge management
Dependency parsing with efficient feature extraction

KI'12 Proceedings of the 35th Annual German conference on Advances in Artificial Intelligence
Induction of dependency structures based on weighted projection

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
Parsing morphologically rich languages: Introduction to the special issue

Computational Linguistics
Morphological and syntactic case in statistical dependency parsing

Computational Linguistics
ReliAble dependency arc recognition

Expert Systems with Applications: An International Journal
Joint Optimization for Chinese POS Tagging and Dependency Parsing

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In addition to a high accuracy, short parsing and training times are the most important properties of a parser. However, parsing and training times are still relatively long. To determine why, we analyzed the time usage of a dependency parser. We illustrate that the mapping of the features onto their weights in the support vector machine is the major factor in time complexity. To resolve this problem, we implemented the passive-aggressive perceptron algorithm as a Hash Kernel. The Hash Kernel substantially improves the parsing times and takes into account the features of negative examples built during the training. This has lead to a higher accuracy. We could further increase the parsing and training speed with a parallel feature extraction and a parallel parsing algorithm. We are convinced that the Hash Kernel and the parallelization can be applied successful to other NLP applications as well such as transition based dependency parsers, phrase structrue parsers, and machine translation.