An Efficient Digital Search Algorithm by Using a Double-Array Structure
IEEE Transactions on Software Engineering
Communications of the ACM
Using modern graphics architectures for general-purpose computing: a framework and analysis
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
Chinese word segmentation and its effect on information retrieval
Information Processing and Management: an International Journal
Fast computation of database operations using graphics processors
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
GPU Cluster for High Performance Computing
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Chinese Word Segmentation and Named Entity Recognition: A Pragmatic Approach
Computational Linguistics
A search-based Chinese word segmentation method
Proceedings of the 16th international conference on World Wide Web
A compact static double-array keeping character codes
Information Processing and Management: an International Journal
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Relational joins on graphics processors
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Using graphics processors for high performance IR query processing
Proceedings of the 18th international conference on World wide web
GPU accelerated Monte Carlo simulation of the 2D and 3D Ising model
Journal of Computational Physics
The Research of Chinese Automatic Word Segmentation In Hierarchical Model Dictionary Binary Tree
DBTA '09 Proceedings of the 2009 First International Workshop on Database Technology and Applications
A Dictionary Mechanism for Chinese Word Segmentation Based on the Finite Automata
IALP '10 Proceedings of the 2010 International Conference on Asian Language Processing
Hi-index | 0.00 |
The task of Chinese word segmentation is to split sequence of Chinese characters into tokens so that the Chinese information can be more easily retrieved by web search engine. Due to the dramatic increase in the amount of Chinese literature in recent years, it becomes a big challenge for web search engines to analyze massive Chinese information in time. In this paper, we investigate a new approach to high-performance Chinese information processing. We propose a CPU-GPU collaboration model for Chinese word segmentation. In our novel model, a dictionary-based word segmentation approach is proposed to fit GPU architecture. Three basic word segmentation algorithms are applied to evaluate the performance of this model. In addition, we present several optimization strategies to fully exploit the potential computing power of GPU. Our experimental results show that our model can achieve significant performance speedups up to 3-fold compared with the implementations on CPU.