A cooccurrence-based thesaurus and two applications to information retrieval
Information Processing and Management: an International Journal
A System for new event detection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Simple Semantics in Topic Detection and Tracking
Information Retrieval
Text classification and named entities for new event detection
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling
IEEE Transactions on Knowledge and Data Engineering
Efficient computation of personal aggregate queries on blogs
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
EuroISI '08 Proceedings of the 1st European Conference on Intelligence and Security Informatics
Methodological Review: Empirical distributional semantics: Methods and biomedical applications
Journal of Biomedical Informatics
Semantic density analysis: comparing word meaning across time and phonetic space
GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
The S-Space package: an open source package for word space models
ACLDemos '10 Proceedings of the ACL 2010 System Demonstrations
The S-Space package: an open source package for word space models
ACLDemos '10 Proceedings of the ACL 2010 System Demonstrations
Enhancing clinical concept extraction with distributional semantics
Journal of Biomedical Informatics
Connecting the dots: mass, energy, word meaning, and particle-wave duality
QI'12 Proceedings of the 6th international conference on Quantum Interaction
Hi-index | 0.00 |
Automatic event detection aims to identify novel, interesting topics as they are published online. While existing algorithms for event detection have focused on newswire releases, we examine how event detection can work on less structured corpora of blogs. The proliferation of blogs and other forms of self-published media have given rise to an ever-growing corpus of news, commentary and opinion texts. Blogs offer a major advantage for event detection as their content may be rapidly updated. However, blogs texts also pose a significant challenge in that the described events may be less easy to detect given the variety of topics, writing styles and possible author biases. We propose a new way of detecting events in this media by looking for changes in word semantics. We first outline a new algorithm that makes use of a temporally-annotated semantic space for tracking how words change semantics. Then we demonstrate how identified changes could be used to detect new events and their associated blog entries.