A fast, accurate deterministic parser for Chinese
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Effective use of prosody in parsing conversational speech
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Fully parsing the Penn Treebank
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Extracting semantics in a clinical scenario
ACSW '07 Proceedings of the fifth Australasian symposium on ACSW frontiers - Volume 68
Labeling chinese predicates with semantic roles
Computational Linguistics
Probabilistic Models for Action-Based Chinese Dependency Parsing
ECML '07 Proceedings of the 18th European conference on Machine Learning
Searching for Part of Speech Tags That Improve Parsing Models
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Parallel entity and treebank annotation
CorpusAnno '05 Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky
Ungreedy methods for Chinese deterministic dependency parsing
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
k-NN for local probability estimation in generative parsing models
Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Semitic '07 Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources
The hidden TAG model: synchronous grammars for parsing resource-poor languages
TAGRF '06 Proceedings of the Eighth International Workshop on Tree Adjoining Grammar and Related Formalisms
Exploiting heterogeneous treebanks for parsing
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Transition-based parsing of the Chinese treebank using a global discriminative model
IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
K-best combination of syntactic parsers
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Accurate and robust LFG-based generation for Chinese
INLG '08 Proceedings of the Fifth International Natural Language Generation Conference
A Linguistically Inspired Statistical Model for Chinese Punctuation Generation
ACM Transactions on Asian Language Information Processing (TALIP)
Handling unknown words in statistical latent-variable parsing models for Arabic, English and French
SPMRL '10 Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages
Top-down nearly-context-sensitive parsing
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A deterministic method to predict phrase boundaries of a syntactic tree
ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
Head finders inspection: an unsupervised optimization approach
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Exploiting Separation of Closed-Class Categories for Arabic Tokenization and Part-of-Speech Tagging
ACM Transactions on Asian Language Information Processing (TALIP)
Head-modifier relation based non-lexical reordering model for phrase-based translation
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Phrase structure parsing with dependency structure
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Syntactic processing using the generalized perceptron and beam search
Computational Linguistics
Tree transformations and dependencies
MOL'11 Proceedings of the 12th biennial conference on The mathematics of language
Word-Level Reordering Model for Phrase-Based SMT
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Parsing noun phrases in the penn treebank
Computational Linguistics
Parsing the penn chinese treebank with semantic knowledge
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A machine learning parser using an unlexicalized distituent model
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
ZamAn and raqm: extracting temporal and numerical expressions in arabic
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
One-step statistical parsing of hybrid dependency-constituency syntactic representations
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Morphological features for parsing morphologically-rich languages: a case of Arabic
SPMRL '11 Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages
Survey: Weighted Extended Top-down Tree Transducers Part II—Application in Machine Translation
Fundamenta Informaticae - Non-Classical Models of Automata and Applications II
The challenges of parsing Chinese with combinatory categorial grammar
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Exploiting chunk-level features to improve phrase chunking
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
A feature-based approach to better automatic treebank conversion
Language Resources and Evaluation
Hi-index | 0.00 |
In this thesis, we apply as well as develop techniques and methodologies for the examination of the complex systems that are lexicalized statistical parsing models. The primary idea is that of treating the “model as data”, which is not a particular method, but a paradigm and a research methodology. Our argument is that lexicalized statistical parsing models have become increasingly complex, and therefore require thorough scrutiny, both to achieve the scientific aim of understanding what has been built thus far, and to achieve both the scientific and engineering goal of using that understanding for progress. In this thesis, we take a particular, dominant type of parsing model and perform a macro analysis, to reveal its core (and design a software engine that modularizes the periphery), and we also crucially perform a detailed analysis, which provides for the first time a window onto the efficacy of specific parameters. These analyses have not only yielded insight into the core model, but they have also enabled the identification of “inefficiencies” in our baseline model, such that those inefficiencies can be reduced to form a more compact model, or exploited for finding a better-estimated model with higher accuracy, or both.