Chinese text retrieval without using a dictionary
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A study on word-based and integral-bit Chinese text compression algorithms
Journal of the American Society for Information Science
Employing multiple representations for Chinese information retrieval
Journal of the American Society for Information Science
Text segmentation for chinese spell checking
Journal of the American Society for Information Science
Improving English and Chinese Ad-Hoc Retrieval: A Tipster Text Phase 3 Project Report
Information Retrieval
A Chinese dictionary construction algorithm for information retrieval
ACM Transactions on Asian Language Information Processing (TALIP)
Critical tokenization and its properties
Computational Linguistics
Hi-index | 0.00 |
The increasing interest in cross-lingual and multilingual information retrieval has posed a great challenge of designing accurate information retrieval systems for Asian languages such as Chinese, Thai and Japanese. Word segmentation is one of the most important pre-processes of Chinese information processing. This paper reviews some popular word segmentation algorithms. Based on an improved Converse Chinese dictionary and an optimized reverse maximum matching algorithm, a Chinese word segmentation system is proposed. Experiments are carried out to demonstrate the substantially ameliorated accuracy and speed of the system.