Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Surface grammatical analysis for the extraction of terminological noun phrases
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
The role of lexicalization and pruning for base noun phrase grammars
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Noun phrase chunking with APL2
APL '00 Proceedings of the international conference on APL-Berlin-2000 conference
Shallow parsing with pos taggers and linguistic features
The Journal of Machine Learning Research
Shallow parsing using noisy and non-stationary training material
The Journal of Machine Learning Research
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Noun phrase recognition by system combination
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Theory refinement and Natural Language Learning
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Tagging and chunking with bigrams
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Man vs. machine: a case study in base noun phrase learning
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Incorporating compositional evidence in memory-based partial parsing
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A unified statistical model for the identification of English baseNP
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Rule writing or annotation: cost-efficient resource usage for base noun phrase chunking
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Shallow parsing by inferencing with classifiers
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Shallow parsing as part-of-speech tagging
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Exploring evidence for shallow parsing
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
The Smart/Empire TIPSTER IR system
TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998
Noun phrase chunking in Hebrew: influence of lexical and morphological features
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Highly accurate error-driven method for noun phrase detection
Pattern Recognition Letters
On the role of lexical features in sequence labeling
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Statistical recognition of noun phrases in unrestricted text
IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Automatic partial parsing rule acquisition using decision tree induction
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A set of NP-Extraction rules for portuguese: defining, learning and pruning
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Turkish constituent chunking with morphological and contextual features
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Naxi sentence similarity calculation based on improved chunking edit-distance
International Journal of Wireless and Mobile Computing
Hi-index | 0.00 |
Finding simple, non-recursive, base noun phrases is an important subtask for many natural language processing applications. While previous empirical methods for base NP identification have been rather complex, this paper instead proposes a very simple algorithm that is tailored to the relative simplicity of the task. In particular, we present a corpus-based approach for finding base NPs by matching part-of-speech tag sequences. The training phase of the algorithm is based on two successful techniques: first the base NP grammar is read from a "treebank" corpus; then the grammar is improved by selecting rules with high "benefit" scores. Using this simple algorithm with a naive heuristic for matching rules, we achieve surprising accuracy in an evaluation on the Penn Treebank Wall Street Journal.