Immediate-head parsing for language models

Authors:
Eugene Charniak
Affiliations:
Brown University, Providence RI
Venue:
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Year:
2001

Citing 15
Cited 105

An efficient probabilistic context-free parsing algorithm that computes prefix probabilities

Computational Linguistics
Learning to Parse Natural Language with Maximum Entropy Models

Machine Learning - Special issue on natural language learning
Discriminative Reranking for Natural Language Parsing

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Head-driven statistical models for natural language parsing

Head-driven statistical models for natural language parsing
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Probabilistic top-down parsing and language modeling

Computational Linguistics
Estimation of probabilistic context-free grammars

Computational Linguistics
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Three generative, lexicalised models for statistical parsing

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Exploiting syntactic structure for language modeling

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Corpus statistics meet the noun compound: some empirical results

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Statistical decision-tree models for parsing

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Precise n-gram probabilities from stochastic context-free grammars

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
What is the minimal set of fragments that achieves maximal parse accuracy?

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Tree-bank grammars

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Multiword Expressions: A Pain in the Neck for NLP

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Semantic Role Parsing: Adding Semantic Structure to Unstructured Text

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Sources of Success for Boosted Wrapper Induction

The Journal of Machine Learning Research
Dependence language model for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to the special issue on statistical language modeling

ACM Transactions on Asian Language Information Processing (TALIP)
A hybrid language model based on a combination of N-grams and stochastic context-free grammars

ACM Transactions on Asian Language Information Processing (TALIP)
Neural network probability estimation for broad coverage parsing

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
A stochastic parser based on an SLM with arboreal context trees

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A study on richer syntactic dependencies for structured language modeling

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Markov parsing: lattice rescoring with a statistical parser

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A decoder for syntax-based statistical MT

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Evaluating translational correspondence using annotation projection

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Accurate unlexicalized parsing

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Unsupervised learning of dependency structure for language modeling

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Support Vector Learning for Semantic Argument Classification

Machine Learning
A Neural Syntactic Language Model

Machine Learning
Bootstrapping parsers via syntactic projection across parallel texts

Natural Language Engineering
Head-Driven Statistical Models for Natural Language Parsing

Computational Linguistics
The SuperARV language model: investigating the effectiveness of tightly integrating multiple knowledge sources

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Exploiting headword dependency and predictive clustering for language modeling

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Comparing the sentence alignment yield from two news corpora using a dictionary-based alignment system

HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Training connectionist models for the structured language model

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Development in parsing technology: from theory to application

New developments in parsing technology
Probabilistic parsing strategies

Journal of the ACM (JACM)
A TAG-based noisy channel model of speech repairs

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Attention shifting for parsing speech

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Head-driven parsing for word lattices

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Supervised and unsupervised learning for sentence compression

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Discriminative syntactic language modeling for speech recognition

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A composite kernel to extract relations between entities with both flat and structured features

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Statistical language modeling with performance benchmarks using various levels of syntactic-semantic information

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Exploring syntactic features for relation extraction using a convolution tree kernel

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Cross-entropy and estimation of probabilistic context-free grammars

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Dependency structure language model for topic detection and tracking

Information Processing and Management: an International Journal
Book review:

Computational Linguistics
Probabilistic Context-Free Grammars Estimated from Infinite Distributions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring the potential of intractable parsers

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Abstractive headline generation using WIDL-expressions

Information Processing and Management: an International Journal
Natural language processing for information retrieval: the time is ripe (again)

Proceedings of the ACM first Ph.D. workshop in CIKM
Exploring syntactic structured features over parse trees for relation extraction using kernel methods

Information Processing and Management: an International Journal
The importance of syntactic parsing and inference in semantic role labeling

Computational Linguistics
Training tree transducers

Computational Linguistics
Efficient Pruning of Probabilistic Automata

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Position Models and Language Modeling

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Experiments on Generating Questions About Facts

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Label propagation via bootstrapped support vectors for semantic relation extraction between named entities

Computer Speech and Language
Models for the semantic classification of noun phrases

CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
Support vector machines applied to the classification of semantic relations in nominalized noun phrases

CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
A look at parsing and its applications

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Exploiting constituent dependencies for tree kernel-based semantic relation extraction

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A parallel Proposition Bank II for Chinese and English

CorpusAnno '05 Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky
A framework for incorporating alignment information in parsing

CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
Lattice parsing to integrate speech recognition and rule-based machine translation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
A statistical constraint dependency grammar (CDG) parser

IncrementParsing '04 Proceedings of the Workshop on Incremental Parsing: Bringing Engineering and Cognition Together
Integrating multi-level linguistic knowledge with a unified framework for Mandarin speech recognition

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Parsing arguments of nominalizations in English and Chinese

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
A computational framework for non-lexicalist semantics

HLT-SRWS '04 Proceedings of the Student Research Workshop at HLT-NAACL 2004
Language modeling for determiner selection

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Natural language generation for text-to-text applications using an information-slim representation

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Automatic recognition of logical relations for English, Chinese and Japanese in the GLARF framework

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Making grammar-based generation easier to deploy in dialogue systems

SIGdial '08 Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue
IDL-expressions: a formalism for representing and parsing finite languages in natural language processing

Journal of Artificial Intelligence Research
Dependency-based statistical machine translation

ACLstudent '05 Proceedings of the ACL Student Research Workshop
The necessity of syntactic parsing for semantic role labeling

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Estimation of stochastic context-free grammars and their use as language models

Computer Speech and Language
Corrective modeling for non-projective dependency parsing

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Language models and reranking for machine translation

StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
Brutus: a semantic role labeling system incorporating CCG, CFG, and dependency features

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Semi-supervised learning of dependency parsers using generalized expectation criteria

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Who, what, when, where, why?: comparing multiple approaches to the cross-lingual 5W task

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Transducing logical relations from automatic and manual GLARF

ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Non-projective parsing for statistical machine translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Language models based on semantic composition

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Global learning of noun phrase anaphoricity in coreference resolution via label propagation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Practical grammar-based NLG from examples

INLG '08 Proceedings of the Fifth International Natural Language Generation Conference
Tree kernel-based semantic relation extraction with rich syntactic and semantic information

Information Sciences: an International Journal
The word is mightier than the count: accumulating translation resources from parsed parallel corpora

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Convolution kernel over packed parse forest

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Joint syntactic and semantic parsing of Chinese

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Profiting from mark-up: hyper-text annotations for guided parsing

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Clustering-based stratified seed sampling for semi-supervised relation classification

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
What a parser can learn from a semantic role labeler and vice versa

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A large scale ranker-based system for search query spelling correction

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Detecting speech repairs incrementally using a noisy channel approach

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Perplexity of n-gram and dependency language models

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Factored bilingual n-gram language models for statistical machine translation

Machine Translation
Learning noun phrase anaphoricity in coreference resolution via label propagation

Journal of Computer Science and Technology - Special issue on natural language processing
Kernel-based semantic relation detection and classification via enriched parse tree structure

Journal of Computer Science and Technology - Special issue on natural language processing
A large scale distributed syntactic, semantic and lexical language model for machine translation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Employing Constituent Dependency Information for Tree Kernel-Based Semantic Relation Extraction between Named Entities

ACM Transactions on Asian Language Information Processing (TALIP)
Improving MT word alignment using aligned multi-stage parses

SSST-5 Proceedings of the Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
Language Modeling for Syntax-Based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation

ACM Transactions on Asian Language Information Processing (TALIP)
Splittability of bilexical context-free grammars is undecidable

Computational Linguistics
Long distance dependency in language modeling: an empirical study

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Tiburon: a weighted tree automata toolkit

CIAA'06 Proceedings of the 11th international conference on Implementation and Application of Automata
An overview of probabilistic tree transducers for natural language processing

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Dynamic bayesian networks for language modeling

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
PAC-learning unambiguous NTS languages

ICGI'06 Proceedings of the 8th international conference on Grammatical Inference: algorithms and applications
Syntactic language modeling with formal grammars

Speech Communication
The latent words language model

Computer Speech and Language
Bayesian induction of syntactic language models for brazilian portuguese

PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
A scalable distributed syntactic, semantic, and lexical language model

Computational Linguistics
Large-scale syntactic language modeling with treelets

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
An information-theoretic measure to evaluate parsing difficulty across treebanks

ACM Transactions on Speech and Language Processing (TSLP)
Incorporating lexical semantic similarity to tree kernel-based chinese relation extraction

CLSW'12 Proceedings of the 13th Chinese conference on Chinese Lexical Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present two language models based upon an "immediate-head" parser --- our name for a parser that conditions all events below a constituent c upon the head of c. While all of the most accurate statistical parsers are of the immediate-head variety, no previous grammatical language model uses this technology. The perplexity for both of these models significantly improve upon the trigram model base-line as well as the best previous grammar-based language model. For the better of our two models these improvements are 24% and 14% respectively. We also suggest that improvement of the underlying parser should significantly improve the model's perplexity and that even in the near term there is a lot of potential for improvement in immediate-head language models.