Natural language parsing as statistical pattern recognition
Natural language parsing as statistical pattern recognition
The computational analysis of the syntax and interpretation of free word order in Turkish
The computational analysis of the syntax and interpretation of free word order in Turkish
The syntactic process
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
The Penn Chinese TreeBank: Phrase structure annotation of a large corpus
Natural Language Engineering
Generative models for statistical parsing with Combinatory Categorial Grammar
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Is it harder to parse Chinese, or the Chinese Treebank?
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automated extraction of Tree-Adjoining Grammars from treebanks
Natural Language Engineering
Creating a CCGbank and a wide-coverage CCG lexicon for German
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank
Computational Linguistics
Wide-coverage efficient statistical parsing with ccg and log-linear models
Computational Linguistics
Labeling chinese predicates with semantic roles
Computational Linguistics
Fully lexicalising CCGbank with hat categories
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Analysis of the difficulties in Chinese deep parsing
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
The challenges of parsing Chinese with combinatory categorial grammar
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Dependency hashing for n-best CCG parsing
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Hi-index | 0.00 |
Automated conversion has allowed the development of wide-coverage corpora for a variety of grammar formalisms without the expense of manual annotation. Analysing new languages also tests formalisms, exposing their strengths and weaknesses. We present Chinese CCGbank, a 760,000 word corpus annotated with Combinatory Categorial Grammar (ccg) derivations, induced automatically from the Penn Chinese Treebank (pctb). We design parsimonious ccg analyses for a range of Chinese syntactic constructions, and transform the pctb trees to produce them. Our process yields a corpus of 27,759 derivations, covering 98.1% of the pctb.