Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Indexing and retrieval performance: the logical evidence
Journal of the American Society for Information Science
Failure analysis of subject searches in a test of a new design for subject access to online catalogs
Journal of the American Society for Information Science - Special issue: current research in online public access systems
Analysis of a very large web search engine query log
ACM SIGIR Forum
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Journal of the American Society for Information Science
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Searching the Web: the public and their queries
Journal of the American Society for Information Science and Technology
A review of web searching studies and a framework for future research
Journal of the American Society for Information Science and Technology
Information Retrieval
Modern Information Retrieval
Using Subject Headings for Online Retrieval: Theory, Practice, and Potential
Using Subject Headings for Online Retrieval: Theory, Practice, and Potential
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Translation of web queries using anchor text mining
ACM Transactions on Asian Language Information Processing (TALIP)
Translating unknown queries with web corpora for cross-language information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The impact of metadata implementation on webpage visibility in search engine results (part II)
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Exploiting the Web as the multilingual corpus for unknown query translation
Journal of the American Society for Information Science and Technology
Evidence-based practice in search interface design
Journal of the American Society for Information Science and Technology
Automatic new topic identification using multiple linear regression
Information Processing and Management: an International Journal
Journal of the American Society for Information Science and Technology
In search of query patterns: a case study of a university OPAC
Information Processing and Management: an International Journal
Web searching in Chinese: A study of a search engine in Hong Kong
Journal of the American Society for Information Science and Technology
Extracting semantic relations from query logs
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 4th ACM workshop on Geographical information retrieval
Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
An analysis of failed queries for web image retrieval
Journal of Information Science
Characteristics of character usage in Chinese Web searching
Information Processing and Management: an International Journal
Clique Analysis of Query Log Graphs
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
Survey and evaluation of query intent detection methods
Proceedings of the 2009 workshop on Web Search Click Data
Mining web query hierarchies from clickthrough data
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Proceedings of the 18th ACM conference on Information and knowledge management
Online community information seeking: The queries of three communities in Southwestern Ontario
Information Processing and Management: an International Journal
Identifying the optimal set of parameters for new topic identification through experimental design
Expert Systems with Applications: An International Journal
Who uses web search for what: and how
Proceedings of the fourth ACM international conference on Web search and data mining
An analysis of web proxy logs with query distribution pattern approach for search engines
Computer Standards & Interfaces
A query analytic model for image retrieval
ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Data Mining and Knowledge Discovery
A feature-free search query classification approach using semantic distance
Expert Systems with Applications: An International Journal
Mining query subtopics from search log data
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining query log graphs towards a query folksonomy
Concurrency and Computation: Practice & Experience
Hi-index | 0.00 |
Subject content analysis of Web query terms is essential to understand Web searching interests. Such analysis includes exploring search topics and observing changes in their frequency distributions with time. To provide a basis for in-depth analysis of users' search interests on a larger scale, this article presents a query categorization approach to automatically classifying Web query terms into broad subject categories. Because a query is short in length and simple in structure, its intended subject(s) of search is difficult to judge. Our approach, therefore, combines the search processes of real-world search engines to obtain highly ranked Web documents based on each unknown query term. These documents are used to extract cooccurring terms and to create a feature set. An effective ranking function has also been developed to find the most appropriate categories. Three search engine logs in Taiwan were collected and tested. They contained over 5 million queries from different periods of time. The achieved performance is quite encouraging compared with that of human categorization. The experimental results demonstrate that the approach is efficient in dealing with large numbers of queries and adaptable to the dynamic Web environment. Through good integration of human and machine efforts, the frequency distributions of subject categories in response to changes in users' search interests can be systematically observed in real time. The approach has also shown potential for use in various information retrieval applications, and provides a basis for further Web searching studies.