Learning to Parse Natural Language with Maximum Entropy Models

Authors:
Adwait Ratnaparkhi
Affiliations:
Department of Computer and Information Science, University of Pennsylvania, 200 South 33rd Street, Philadelphia, PA 19104-6389. adwait@unagi.cis.upenn.edu
Venue:
Machine Learning - Special issue on natural language learning
Year:
1999

Citing 19
Cited 82

Procedure for quantitatively comparing the syntactic coverage of English grammars

HLT '91 Proceedings of the workshop on Speech and Natural Language
Natural language understanding (2nd ed.)

Natural language understanding (2nd ed.)
A maximum entropy approach to natural language processing

Computational Linguistics
Inducing Features of Random Fields

IEEE Transactions on Pattern Analysis and Machine Intelligence
Theory of Syntactic Recognition for Natural Languages

Theory of Syntactic Recognition for Natural Languages
Generalized probabilistic LR parsing of natural language (Corpora) with unification-based grammars

Computational Linguistics - Special issue on using large corpora: I
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
The domain dependence of parsing

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Three generative, lexicalised models for statistical parsing

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Learning parse and translation decisions from examples with rich context

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Towards history-based grammars: using richer models for probabilistic parsing

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Statistical decision-tree models for parsing

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Adaptive language modeling using the maximum entropy principle

HLT '93 Proceedings of the workshop on Human Language Technology
Decision tree parsing using a hidden derivation model

HLT '94 Proceedings of the workshop on Human Language Technology
Compilers: Principles, Techniques, and Tools (2nd Edition)

Compilers: Principles, Techniques, and Tools (2nd Edition)
Statistical parsing with a context-free grammar and word statistics

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence

Analysis of the grammatical functions between adnoun and noun phrases in Korean using Support Vector Machines

Natural Language Engineering
Do all fragments count?

Natural Language Engineering
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Parsing with the shortest derivation

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Investigating GIS and smoothing for maximum entropy taggers

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Neural network probability estimation for broad coverage parsing

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Mitigating the paucity-of-data problem: exploring the effect of training corpus size on classifier performance for natural language processing

HLT '01 Proceedings of the first international conference on Human language technology research
What is the minimal set of fragments that achieves maximal parse accuracy?

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Immediate-head parsing for language models

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Edit detection and parsing for transcribed speech

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Inducing history representations for broad coverage statistical parsing

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A parsing: fast exact Viterbi parse selection

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Supervised and unsupervised PCFG adaptation to novel domains

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Identifying and tracking entity mentions in a maximum entropy framework

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Maximum Entropy Modeling: A Suitable Framework to Learn Context-Dependent Lexicon Models for Statistical Machine Translation

Machine Learning
Filtering-Ranking Perceptron Learning for Partial Parsing

Machine Learning
Bootstrapping parsers via syntactic projection across parallel texts

Natural Language Engineering
A comparison between supervised learning algorithms for word sense disambiguation

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
A robust risk minimization based named entity recognition system

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Investigating loss functions and optimization methods for discriminative learning of label sequences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
HowtogetaChineseName(Entity): segmentation and combination issues

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Creating probabilistic databases from information extraction models

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Discriminative training of a neural network statistical parser

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Incremental parsing with the perceptron algorithm

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations

ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
Online large-margin training of dependency parsers

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Data-defined kernels for parse reranking derived from probabilistic models

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Reranking and self-training for parser adaptation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A fast, accurate deterministic parser for Chinese

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Advances in discriminative parsing

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Discriminative hidden Markov modeling with long state dependence using a kNN ensemble

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Multi-lingual coreference resolution with syntactic features

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Minority vote: at-least-N voting improves recall for extracting relations

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Algorithms for deterministic incremental dependency parsing

Computational Linguistics
Information Extraction

Foundations and Trends in Databases
CRF Models for Tamil Part of Speech Tagging and Chunking

ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Porting statistical parsers with data-defined kernels

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Improved large margin dependency parsing via local constraints and laplacian regularization

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Multi-lingual dependency parsing at NAIST

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Classifying chart cells for quadratic complexity context-free inference

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Efficient incremental beam-search parsing with generative and discriminative models: keynote talk

IncrementParsing '04 Proceedings of the Workshop on Incremental Parsing: Bringing Engineering and Cognition Together
A statistical constraint dependency grammar (CDG) parser

IncrementParsing '04 Proceedings of the Workshop on Incremental Parsing: Bringing Engineering and Cognition Together
Linear complexity context-free parsing pipelines via chart constraints

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Meeting TempEval-2: shallow approach for temporal tagger

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Simple training of dependency parsers via structured boosting

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Bayesian information extraction network

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Computational challenges in parsing by classification

CHSLP '06 Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Stacked sequential learning

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
MAP adaptation of stochastic grammars

Computer Speech and Language
Efficacy of beam thresholding, unification filtering and hybrid parsing in probabilistic HPSG parsing

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
k-best A* parsing

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Transition-based parsing of the Chinese treebank using a global discriminative model

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
The integration of syntactic parsing and semantic role labeling

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Structural bias in inducing representations for probabilistic natural language parsing

ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
Syntactic parsing with hierarchical modeling

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Comparing two approaches for the recognition of temporal expressions

KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Joint syntactic and semantic parsing of Chinese

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Profiting from mark-up: hyper-text annotations for guided parsing

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Feature selection for fluency ranking

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Constituent reordering and syntax models for English-to-Japanese statistical machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Syntax based reordering with automatically derived rules for improved statistical machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A deterministic method to predict phrase boundaries of a syntactic tree

ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
Tree topological features for unlexicalized parsing

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
An analysis of tree topological features in classifier-based unlexicalized parsing

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Syntactic processing using the generalized perceptron and beam search

Computational Linguistics
Temporal restricted Boltzmann machines for dependency parsing

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Reversible stochastic attribute-value grammars

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Semantic role labeling using maximum entropy

CIS'04 Proceedings of the First international conference on Computational and Information Science
Iterative CKY parsing for probabilistic context-free grammars

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Deterministic dependency structure analyzer for chinese

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Event extraction as dependency parsing for BioNLP 2011

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Content-based mobile spam classification using stylistically motivated features

Pattern Recognition Letters
Mutual information independence model using kernel density estimation for segmenting and labeling sequential data

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
A correction model for word alignments

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Parsing biomedical literature

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A machine learning parser using an unlexicalized distituent model

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
P-top-k queries in a probabilistic framework from information extraction models

Computers & Mathematics with Applications
Tree representations in probabilistic models for extended named entities detection

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Finite-state chart constraints for reduced complexity context-free parsing pipelines

Computational Linguistics
Entity extraction, linking, classification, and tagging for social media: a wikipedia-based approach

Proceedings of the VLDB Endowment
Multilingual joint parsing of syntactic and semantic dependencies with a latent variable model

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a machine learning systemfor parsing natural language thatlearns from manually parsed example sentences, andparses unseen data at state-of-the-art accuracies.Its machine learning technology, based on the maximum entropy framework, is highly reusable and not specific to the parsing problem,while the linguistic hints that it uses to learncan be specified concisely.It therefore requires a minimal amount of human effort and linguistic knowledge for its construction.In practice, the running time of the parser on a test sentence is linear with respect to the sentence length.We also demonstrate that the parser can train from other domains without modificationto the modeling framework or the linguistic hints it uses to learn.Furthermore, this paper shows that research into rescoring the top 20 parses returned by the parsermight yield accuracies dramatically higher than the state-of-the-art.