Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Agglomerative clustering of a search engine query log
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering user queries of a search engine
Proceedings of the 10th international conference on World Wide Web
Scaling personalized web search
WWW '03 Proceedings of the 12th international conference on World Wide Web
Relevance information: a loss of entropy but a gain for IDF?
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
A large-scale evaluation and analysis of personalized search strategies
Proceedings of the 16th international conference on World Wide Web
Learn from web search logs to organize search results
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Random walks on the click graph
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Extracting semantic relations from query logs
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Active exploration for learning rankings from clickthrough data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
An experimental comparison of click position-bias models
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Query-sets: using implicit feedback and query patterns to organize web documents
Proceedings of the 17th international conference on World Wide Web
Mining the search trails of surfing crowds: identifying relevant websites from user activity
Proceedings of the 17th international conference on World Wide Web
To personalize or not to personalize: modeling queries with variation in user intent
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A user browsing model to predict search engine click data from past observations.
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Learning query intent from regularized click graphs
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
TF-IDF uncovered: a study of theories and probabilities
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Query suggestion using hitting time
Proceedings of the 17th ACM conference on Information and knowledge management
Dr. Searcher and Mr. Browser: a unified hyperlink-click graph
Proceedings of the 17th ACM conference on Information and knowledge management
Query recommendation using query logs in search engines
EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Automobile, car and BMW: horizontal and hierarchical approach in social tagging systems
Proceedings of the 2nd ACM workshop on Social web search and mining
Optimal rare query suggestion with implicit user feedback
Proceedings of the 19th international conference on World wide web
Sampling high-quality clicks from noisy click data
Proceedings of the 19th international conference on World wide web
A unified framework for recommending diverse and relevant queries
Proceedings of the 20th international conference on World wide web
Key concepts identification and weighting in search engine queries
APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Post-ranking query suggestion by diversifying search results
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
DBpedia spotlight: shedding light on the web of documents
Proceedings of the 7th International Conference on Semantic Systems
Proceedings of the 20th ACM international conference on Information and knowledge management
Using query log and social tagging to refine queries based on latent topics
Proceedings of the 20th ACM international conference on Information and knowledge management
Query suggestion by constructing term-transition graphs
Proceedings of the fifth ACM international conference on Web search and data mining
Introduction to social computing
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Automatically structuring domain knowledge from text: An overview of current research
Information Processing and Management: an International Journal
Evaluating the effectiveness of search task trails
Proceedings of the 21st international conference on World Wide Web
Learning to suggest: a machine learning framework for ranking query suggestions
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining query subtopics from search log data
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
DQR: a probabilistic approach to diversified query recommendation
Proceedings of the 21st ACM international conference on Information and knowledge management
Measuring website similarity using an entity-aware click graph
Proceedings of the 21st ACM international conference on Information and knowledge management
A vlHMM approach to context-aware search
ACM Transactions on the Web (TWEB)
Orthogonal query recommendation
Proceedings of the 7th ACM conference on Recommender systems
Modeling semantic and behavioral relations for query suggestion
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.00 |
Query log analysis has received substantial attention in recent years, in which the click graph is an important technique for describing the relationship between queries and URLs. State-of-the-art approaches based on the raw click frequencies for modeling the click graph, however, are not noise-eliminated. Nor do they handle heterogeneous query-URL pairs well. In this paper, we investigate and develop a novel entropy-biased framework for modeling click graphs. The intuition behind this model is that various query-URL pairs should be treated differently, i.e., common clicks on less frequent but more specific URLs are of greater value than common clicks on frequent and general URLs. Based on this intuition, we utilize the entropy information of the URLs and introduce a new concept, namely the inverse query frequency (IQF), to weigh the importance (discriminative ability) of a click on a certain URL. The IQF weighting scheme is never explicitly explored or statistically examined for any bipartite graphs in the information retrieval literature. We not only formally define and quantify this scheme, but also incorporate it with the click frequency and user frequency information on the click graph for an effective query representation. To illustrate our methodology, we conduct experiments with the AOL query log data for query similarity analysis and query suggestion tasks. Experimental results demonstrate that considerable improvements in performance are obtained with our entropy-biased models. Moreover, our method can also be applied to other bipartite graphs.