The Hierarchical Hidden Markov Model: Analysis and Applications
Machine Learning
Automatic recognition of Chinese unknown words based on roles tagging
SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
Log-linear models for word alignment
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Discriminative pruning of language models for Chinese word segmentation
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Unsupervised segmentation of Chinese text by use of branching entropy
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Subword-based tagging for confidence-dependent Chinese word segmentation
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Chinese word segmentation and statistical machine translation
ACM Transactions on Speech and Language Processing (TSLP)
Chinese Word Segmentation for Terrorism-Related Contents
PAISI, PACCF and SOCO '08 Proceedings of the IEEE ISI 2008 PAISI, PACCF, and SOCO international workshops on Intelligence and Security Informatics
A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Mobi-watchdog: you can steal, but you can't run!
Proceedings of the second ACM conference on Wireless network security
Bilingually Motivated Word Segmentation for Statistical Machine Translation
ACM Transactions on Asian Language Information Processing (TALIP)
Bayesian semi-supervised Chinese word segmentation for statistical machine translation
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Bilingually motivated domain-adapted word segmentation for statistical machine translation
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Information retrieval oriented word segmentation based on character associative strength ranking
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Hiding Information by Context-Based Synonym Substitution
IWDW '09 Proceedings of the 8th International Workshop on Digital Watermarking
Improved statistical machine translation by multiple Chinese word segmentation
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
A delimiter-based general approach for Chinese term extraction
Journal of the American Society for Information Science and Technology
A statistical service composition approach based on hidden hierarchy Markov model
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 1
Domain ontology learning and consistency checking based on TSC approach and racer
RR'07 Proceedings of the 1st international conference on Web reasoning and rule systems
Towards knowledge extraction from weblogs and rule-based semantic querying
RuleML'07 Proceedings of the 2007 international conference on Advances in rule interchange and applications
A character-based joint model for Chinese word segmentation
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Joint tokenization and translation
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A semantic analyzer for aiding emotion recognition in Chinese
ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
Treatment of quantifiers in Chinese-Japanese machine translation
ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
Domain-specific Chinese word segmentation using suffix tree and mutual information
Information Systems Frontiers
SEEN: a semantic dependency analyzer for Chinese
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Rule-based translation of quantifiers for Chinese-Japanese machine translation
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Syntax-based reordering for statistical machine translation
Computer Speech and Language
The human-like emotions recognition using mutual information and semantic clues
Edutainment'11 Proceedings of the 6th international conference on E-learning and games, edutainment technologies
Keyword extraction based on sequential pattern mining
Proceedings of the Third International Conference on Internet Multimedia Computing and Service
A novel hierarchical document clustering algorithm based on a kNN connection graph
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Interactive chinese search results clustering for personalization
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
The use of SVM for chinese new word identification
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
A question answering system on special domain and the implementation of speech interface
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A chunking strategy towards unknown word detection in chinese word segmentation
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A lexicon-constrained character model for chinese morphological analysis
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
An adaptive approach to chinese semantic advertising
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Expert Systems with Applications: An International Journal
Integrating Generative and Discriminative Character-Based Models for Chinese Word Segmentation
ACM Transactions on Asian Language Information Processing (TALIP)
An ontology-based approach to Chinese semantic advertising
Information Sciences: an International Journal
Evaluating indirect strategies for Chinese-Spanish statistical machine translation
Journal of Artificial Intelligence Research
Building a bilingual dictionary from a Japanese-Chinese patent corpus
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Semantic separator learning and its applications in unsupervised Chinese text parsing
Frontiers of Computer Science: Selected Publications from Chinese Universities
Tweeting under pressure: analyzing trending topics and evolving word choice on sina weibo
Proceedings of the first ACM conference on Online social networks
Hi-index | 0.00 |
This document presents the results from Inst. of Computing Tech., CAS in the ACL SIGHAN-sponsored First International Chinese Word Segmentation Bake-off. The authors introduce the unified HHMM-based frame of our Chinese lexical analyzer ICTCLAS and explain the operation of the six tracks. Then provide the evaluation results and give more analysis. Evaluation on ICTCLAS shows that its performance is competitive. Compared with other system, ICTCLAS has ranked top both in CTB and PK closed track. In PK open track, it ranks second position. ICTCLAS BIG5 version was transformed from GB version only in two days; however, it achieved well in two BIG5 closed tracks. Through the first bakeoff, we could learn more about the development in Chinese word segmentation and become more confident on our HHMM-based approach. At the same time, we really find our problems during the evaluation. The bakeoff is interesting and helpful.