Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Improved Named Entity Translation and Bilingual Named Entity Extraction
ICMI '02 Proceedings of the 4th IEEE International Conference on Multimodal Interfaces
Two languages are more informative than one
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
A high-performance semi-supervised learning method for text chunking
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Mention detection crossing the language barrier
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Language specific issue and feature exploration in Chinese event extraction
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
The stages of event extraction
ARTE '06 Proceedings of the Workshop on Annotating and Reasoning about Time and Events
Data selection in semi-supervised learning for name tagging
IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Arabic named entity recognition: using features extracted from noisy data
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
A pairwise event coreference model, feature impact and evaluation for event coreference resolution
eETTs '09 Proceedings of the Workshop on Events in Emerging Text Types
Leveraging natural language processing of clinical narratives for phenotype modeling
PIKM '10 Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management
Challenges from information extraction to information fusion
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Semi-supervised semantic pattern discovery with guidance from unsupervised pattern clusters
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A transfer approach to detecting disease reporting events in blog social media
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Employing compositional semantics and discourse consistency in Chinese event extraction
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Using compositional semantics and discourse consistency to improve Chinese trigger identification
Information Processing and Management: an International Journal
Hi-index | 0.00 |
This paper proposes a new bootstrapping framework using cross-lingual information projection. We demonstrate that this framework is particularly effective for a challenging NLP task which is situated at the end of a pipeline and thus suffers from the errors propagated from upstream processing and has low-performance baseline. Using Chinese event extraction as a case study and bitexts as a new source of information, we present three bootstrapping techniques. We first conclude that the standard mono-lingual bootstrapping approach is not so effective. Then we exploit a second approach that potentially benefits from the extra information captured by an English event extraction system and projected into Chinese. Such a cross-lingual scheme produces significant performance gain. Finally we show that the combination of mono-lingual and cross-lingual information in bootstrapping can further enhance the performance. Ultimately this new framework obtained 10.1% relative improvement in trigger labeling (F-measure) and 9.5% relative improvement in argument-labeling.