Automatic grammar generation from two different perspectives
Automatic grammar generation from two different perspectives
Facilitating treebank annotation using a statistical parser
HLT '01 Proceedings of the first international conference on Human language technology research
Statistical parsing with an automatically-extracted tree adjoining grammar
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Statistically-enhanced new word identification in a rule-based Chinese system
CLPW '00 Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12
The Penn Chinese TreeBank: Phrase structure annotation of a large corpus
Natural Language Engineering
Is it harder to parse Chinese, or the Chinese Treebank?
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks
Computational Linguistics
Combining classifiers for Chinese word segmentation
SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
An all-subtrees approach to unsupervised parsing
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Syntax-based alignment: supervised or unsupervised?
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Prototype-driven learning for sequence models
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Chinese Word Segmentation for Terrorism-Related Contents
PAISI, PACCF and SOCO '08 Proceedings of the IEEE ISI 2008 PAISI, PACCF, and SOCO international workshops on Intelligence and Security Informatics
Unsupervised parsing with U-DOP
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
The NVI clustering evaluation measure
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Unsupervised induction of labeled parse trees by clustering with syntactic features
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Chinese dependency parsing with large scale automatically constructed case structures
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
TBL-improved non-deterministic segmentation and POS tagging for a Chinese parser
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Two languages are better than one (for syntactic parsing)
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A three-step deterministic parser for Chinese dependency parsing
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
A dual-layer CRFs based joint decoding method for cascaded segmentation and labeling tasks
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Annotation compatibility working group report
LAC '06 Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006
Semi-automated named entity annotation
LAW '07 Proceedings of the Linguistic Annotation Workshop
Building Chinese sense annotated corpus with the help of software tools
LAW '07 Proceedings of the Linguistic Annotation Workshop
Label correspondence learning for part-of-speech annotation transformation
Proceedings of the 18th ACM conference on Information and knowledge management
Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation
ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Bilingually-constrained (monolingual) shift-reduce parsing
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Refining grammars for parsing with hierarchical semantic knowledge
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Natural language grammar induction with a generative constituent-context model
Pattern Recognition
Painless unsupervised learning with features
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Bitext dependency parsing with bilingual subtree constraints
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Phylogenetic grammar induction
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
The deep re-annotation in a Chinese scientific Treebank
LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
Learning better monolingual models with unannotated bilingual text
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Heterogeneous parsing via collaborative decoding
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A comparison of unsupervised methods for part-of-speech tagging in Chinese
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A multi-domain web-based algorithm for POS tagging of unknown words
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Semi-automatically developing Chinese HPSG grammar from the Penn Chinese Treebank for deep parsing
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Automatic treebank conversion via informed decoding
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Domain-specific Chinese word segmentation using suffix tree and mutual information
Information Systems Frontiers
Better automatic treebank conversion using a feature-based approach
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Automatic Treebank Conversion via Informed Decoding - A Case Study on Chinese Treebanks
ACM Transactions on Asian Language Information Processing (TALIP)
A chinese corpus with word sense annotation
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
A lexicon-constrained character model for chinese morphological analysis
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Language Resources and Evaluation
Hi-index | 0.00 |
In this paper we address issues related to building a large-scale Chinese corpus. We try to answer four questions: (i) how to speed up annotation, (ii) how to maintain high annotation quality, (iii) for what purposes is the corpus applicable, and finally (iv) what future work we anticipate.