A study of retrospective and on-line event detection
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Agglomerative clustering of a search engine query log
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Clustering user queries of a search engine
Proceedings of the 10th international conference on World Wide Web
Retrieving and organizing web pages by “information unit”
Proceedings of the 10th international conference on World Wide Web
Probabilistic query expansion using query logs
Proceedings of the 11th international conference on World Wide Web
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
SimRank: a measure of structural-context similarity
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Topic-conditioned novelty detection
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Retrieval and novelty detection at the sentence level
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Web unit mining: finding and classifying subgraphs of web pages
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Optimizing web search using web click-through data
Proceedings of the thirteenth ACM international conference on Information and knowledge management
CubeSVD: a novel approach to personalized Web search
WWW '05 Proceedings of the 14th international conference on World Wide Web
Time-dependent semantic similarity measure of queries using historical click-through data
Proceedings of the 15th international conference on World Wide Web
Exact indexing of dynamic time warping
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Event-Driven document selection for terrorism information extraction
ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Towards effective browsing of large scale social annotations
Proceedings of the 16th international conference on World Wide Web
Using subspace analysis for event detection from web click-through data
Proceedings of the 17th international conference on World Wide Web
Discovering correlated spatio-temporal changes in evolving graphs
Knowledge and Information Systems
Event detection with common user interests
Proceedings of the 10th ACM workshop on Web information and data management
sDoc: exploring social wisdom for document enhancement in web mining
Proceedings of the 18th ACM conference on Information and knowledge management
Event detection from flickr data through wavelet-based spatial analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Detecting News Event from a Citizen Journalism Website Using Tags
AMT '09 Proceedings of the 5th International Conference on Active Media Technology
SEM: mining spatial events from the web
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Modeling the evolution of associated data
Data & Knowledge Engineering
Detecting hot events from web search logs
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Multidimensional mining of large-scale search logs: a topic-concept cube approach
Proceedings of the fourth ACM international conference on Web search and data mining
Chinese new word detection from query logs
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Detecting and Tracking Topics and Events from Web Search Logs
ACM Transactions on Information Systems (TOIS)
Monetising user generated content using data mining techniques
AusDM '09 Proceedings of the Eighth Australasian Data Mining Conference - Volume 101
Pervasive social context: Taxonomy and survey
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Sections on Paraphrasing; Intelligent Systems for Socially Aware Computing; Social Computing, Behavioral-Cultural Modeling, and Prediction
Investigating query bursts in a web search engine
Web Intelligence and Agent Systems
Hi-index | 0.00 |
Previous efforts on event detection from the web have focused primarily on web content and structure data ignoring the rich collection of web log data. In this paper, we propose the first approach to detect events from the click-through data, which is the log data of web search engines. The intuition behind event detection from click-through data is that such data is often event-driven and each event can be represented as a set ofquery-page pairs that are not only semantically similar but also have similar evolution pattern over time. Given the click-through data, in our proposed approach, we first segment it into a sequence of bipartite graphs based on theuser-defined time granularity. Next, the sequence of bipartite graphs is represented as a vector-based graph, which records the semantic and evolutionary relationships between queries and pages. After that, the vector-based graph is transformed into its dual graph, where each node is a query-page pair that will be used to represent real world events. Then, the problem of event detection is equivalent to the problem of clustering the dual graph of the vector-based graph. The clustering process is based on a two-phase graph cut algorithm. In the first phase, query-page pairs are clustered based on thesemantic-based similarity such that each cluster in the result corresponds to a specific topic. In the second phase, query-page pairs related to the same topic are further clustered based on the evolution pattern-based similarity such that each cluster is expected to represent a specific event under the specific topic. Experiments with real click-through data collected from a commercial web search engine show that the proposed approach produces high quality results.