Characterizing browsing strategies in the World-Wide Web
Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
The nature of statistical learning theory
The nature of statistical learning theory
Analysis of a very large web search engine query log
ACM SIGIR Forum
ACM Computing Surveys (CSUR)
Combining evidence for automatic web session identification
Information Processing and Management: an International Journal - Issues of context in information retrieval
Constrained K-means Clustering with Background Knowledge
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Learning to match and cluster large high-dimensional data sets for data integration
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Using terminological feedback for web search refinement: a log-based study
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Query chains: learning to rank from implicit feedback
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Supervised clustering with support vector machines
ICML '05 Proceedings of the 22nd international conference on Machine learning
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
Building bridges for web query classification
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
On coreference resolution performance metrics
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Investigating behavioral variability in web search
Proceedings of the 16th international conference on World Wide Web
Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs
Proceedings of the 17th ACM conference on Information and knowledge management
Learning structural SVMs with latent variables
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Locality preserving nonnegative matrix factorization
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Multitasking during Web search sessions
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Identifying task-based sessions in search engine query logs
Proceedings of the fourth ACM international conference on Web search and data mining
Modeling and analysis of cross-session search tasks
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Evaluating the effectiveness of search task trails
Proceedings of the 21st international conference on World Wide Web
Search, interrupted: understanding and predicting search task continuation
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
Search tasks, comprising a series of search queries serving the same information need, have recently been recognized as an accurate atomic unit for modeling user search intent. Most prior research in this area has focused on short-term search tasks within a single search session, and heavily depend on human annotations for supervised classification model learning. In this work, we target the identification of long-term, or cross-session, search tasks (transcending session boundaries) by investigating inter-query dependencies learned from users' searching behaviors. A semi-supervised clustering model is proposed based on the latent structural SVM framework, and a set of effective automatic annotation rules are proposed as weak supervision to release the burden of manual annotation. Experimental results based on a large-scale search log collected from Bing.com confirms the effectiveness of the proposed model in identifying cross-session search tasks and the utility of the introduced weak supervision signals. Our learned model enables a more comprehensive understanding of users' search behaviors via search logs and facilitates the development of dedicated search-engine support for long-term tasks.