Storing and retrieving word phrases
Information Processing and Management: an International Journal
Stochastic models for the distribution of index terms
Journal of Documentation
Information Processing and Management: an International Journal - Special issue on Informetrics
Usage analysis of a digital library
Proceedings of the third ACM conference on Digital libraries
Analysis of a very large web search engine query log
ACM SIGIR Forum
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Searching the Web: the public and their queries
Journal of the American Society for Information Science and Technology
Vox populi: the public searching of the Web
Journal of the American Society for Information Science and Technology
Subject categorization of query terms for exploring Web users' search interests
Journal of the American Society for Information Science and Technology
Characteristics of question format web queries: an exploratory study
Information Processing and Management: an International Journal
A Contextual Term Suggestion Mechanism for Interactive Web Search
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
Journal of the American Society for Information Science and Technology
Mining longitudinal web queries: trends and patterns
Journal of the American Society for Information Science and Technology
Web searching for sexual information: an exploratory study
Information Processing and Management: an International Journal
SpidersRUs: automated development of vertical search engines in different domains and languages
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Extension of Zipf's law to words and phrases
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Analysis of the query logs of a web site search engine
Journal of the American Society for Information Science and Technology
Web searching in Chinese: A study of a search engine in Hong Kong
Journal of the American Society for Information Science and Technology
Extending Zipf's law to n-grams for large corpora
Artificial Intelligence Review
Exploiting query term correlation for list caching in web search engines
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Character usage in Chinese short message service SMS: a real-world study in Mainland China
International Journal of Mobile Communications
Recent and robust query auto-completion
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
The use of non-English Web search engines has been prevalent. Given the popularity of Chinese Web searching and the unique characteristics of Chinese language, it is imperative to conduct studies with focuses on the analysis of Chinese Web search queries. In this paper, we report our research on the character usage of Chinese search logs from a Web search engine in Hong Kong. By examining the distribution of search query terms, we found that users tended to use more diversified terms and that the usage of characters in search queries was quite different from the character usage of general online information in Chinese. After studying the Zipf distribution of n-grams with different values of n, we found that the curve of unigram is the most curved one of all while the bigram curve follows the Zipf distribution best, and that the curves of n-grams with larger n (n=3-6) had similar structures with @b-values in the range of 0.66-0.86. The distribution of combined n-grams was also studied. All the analyses are performed on the data both before and after the removal of function terms and incomplete terms and similar findings are revealed. We believe the findings from this study have provided some insights into further research in non-English Web searching and will assist in the design of more effective Chinese Web search engines.