Empirical methods for artificial intelligence
Empirical methods for artificial intelligence
Learning to Parse Natural Language with Maximum Entropy Models
Machine Learning - Special issue on natural language learning
Discriminative Reranking for Natural Language Parsing
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
The domain dependence of parsing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Supervised grammar induction using training data with limited constituent information
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Bootstrapping statistical parsers from small datasets
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Coarse-to-fine n-best parsing and MaxEnt discriminative reranking
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Effective self-training for parsing
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Evaluating and integrating treebank parsers on a biomedical corpus
Software '05 Proceedings of the Workshop on Software
MAP adaptation of stochastic grammars
Computer Speech and Language
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Towards robust semantic role labeling
Computational Linguistics
Semi-supervised model adaptation for statistical machine translation
Machine Translation
Self-training for biomedical parsing
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Computing confidence scores for all sub parse trees
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing
ACM Transactions on Asian Language Information Processing (TALIP)
Learning reliable information for dependency parsing adaptation
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Automatic prediction of parser accuracy
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Adapting WSJ-trained parsers to the British National Corpus using in-domain self-training
IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Porting a lexicalized-grammar parser to the biomedical domain
Journal of Biomedical Informatics
Exploiting heterogeneous treebanks for parsing
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Cross language dependency parsing using a bilingual lexicon
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Cross-domain dependency parsing using a deep linguistic grammar
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Improving dependency parsing with subtrees from auto-parsed data
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Parser adaptation and projection with quasi-synchronous grammar features
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Domain adaptation for conditional random fields
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Correlating natural language parser performance with statistical measures of the text
KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Automatic domain adaptation for parsing
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
"cba to check the spelling" investigating parser performance on discussion forum posts
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Faster parsing by supertagger adaptation
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Creating robust supervised classifiers via web-scale N-gram data
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Open-domain semantic role labeling by modeling word spans
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Viterbi training for PCFGs: hardness results and competitiveness of uniform initialization
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Open-domain commonsense reasoning using discourse relations from a corpus of weblog stories
FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Adaptive parameters for entity recognition with perceptron HMMs
DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Exploring representation-learning approaches to domain adaptation
DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Viterbi training improves unsupervised dependency parsing
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Improved fully unsupervised parsing with zoomed learning
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Uptraining for accurate deterministic question parsing
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Effective constituent projection across languages
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Domain adaptation by constraining inter-domain variability of latent feature representation
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Exploiting web-derived selectional preference to improve statistical dependency parsing
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Parsing natural language queries for life science knowledge
BioNLP '11 Proceedings of BioNLP 2011 Workshop
ULISSE: an unsupervised algorithm for detecting reliable dependency parses
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Assessing the practical usability of an automatically annotated corpus
LAW V '11 Proceedings of the 5th Linguistic Annotation Workshop
Cross-Domain Effects on Parse Selection for Precision Grammars
Research on Language and Computation
Multi-source transfer of delexicalized dependency parsers
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Relaxed cross-lingual projection of constituent syntax
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Training dependency parsers by jointly optimizing multiple objectives
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A word clustering approach to domain adaptation: effective parsing of biomedical texts
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Minimally supervised domain-adaptive parse reranking for relation extraction
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Comparing the use of edited and unedited text in parser self-training
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Data point selection for self-training
SPMRL '11 Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages
EXPLOITING SUBTREES IN AUTO-PARSED DATA TO IMPROVE DEPENDENCY PARSING
Computational Intelligence
Anaphora resolution in biomedical literature: a hybrid approach
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Dependency Parsing domain adaptation using transductive SVM
ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
Utilizing dependency language models for graph-based dependency parsing models
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Improved parsing and POS tagging using inter-sentence consistency constraints
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Unsupervised feature adaptation for cross-domain NLP with an application to compositionality grading
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Hi-index | 0.00 |
Statistical parsers trained and tested on the Penn Wall Street Journal (WSJ) treebank have shown vast improvements over the last 10 years. Much of this improvement, however, is based upon an ever-increasing number of features to be trained on (typically) the WSJ treebank data. This has led to concern that such parsers may be too finely tuned to this corpus at the expense of portability to other genres. Such worries have merit. The standard "Charniak parser" checks in at a labeled precision-recall f-measure of 89.7% on the Penn WSJ test set, but only 82.9% on the test set from the Brown treebank corpus.This paper should allay these fears. In particular, we show that the reranking parser described in Charniak and Johnson (2005) improves performance of the parser on Brown to 85.2%. Furthermore, use of the self-training techniques described in (McClosky et al., 2006) raise this to 87.8% (an error reduction of 28%) again without any use of labeled Brown data. This is remarkable since training the parser and reranker on labeled Brown data achieves only 88.4%.