Do-I-Care: a collaborative Web agent
Conference Companion on Human Factors in Computing Systems
Electronic document addressing: dealing with change
ACM Computing Surveys (CSUR)
Web page change and persistence---a four-year longitudinal study
Journal of the American Society for Information Science and Technology
What are the Characteristics of Digital Genres? - Genre Theory from a Multi-Modal Perspective
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4 - Volume 04
Evolution of web site design patterns
ACM Transactions on Information Systems (TOIS)
Effects of web document evolution on genre classification
Proceedings of the 14th ACM international conference on Information and knowledge management
Towards light semantic processing for Question Answering
HLT-NAACL-TEXTMEANING '03 Proceedings of the HLT-NAACL 2003 workshop on Text meaning - Volume 9
System for spatio-temporal analysis of online news and blogs
Proceedings of the 15th international conference on World Wide Web
ICML '06 Proceedings of the 23rd international conference on Machine learning
Data association for topic intensity tracking
ICML '06 Proceedings of the 23rd international conference on Machine learning
A social hypertext model for finding community in blogs
Proceedings of the seventeenth conference on Hypertext and hypermedia
Topics over time: a non-Markov continuous-time model of topical trends
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
BuzzTrack: topic detection and tracking in email
Proceedings of the 12th international conference on Intelligent user interfaces
Longitudinal study of changes in blogs
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
An analysis of personal collections among users of social media
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
New structures of video collections
Proceedings of the 2012 iConference
A quantitative evaluation of techniques for detection of abnormal change events in blogs.
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Hi-index | 0.00 |
Information on the Internet, especially blog content, changes rapidly. Users of information collections, such as the blogs hosted by technorati.com, have little, if any, control over the content or frequency of these changes. However, it is important for users to be able to monitor content for deviations in the expected pattern of change. If a user is interested in political blogs and a blog switches subjects to a literary review blog, the user would want to know of this change in behavior. Since pages may change too frequently for manual inspection for "unwanted" changes, an automated approach is wanted. In this paper, we explore methods for indentifying unexpected change by using Kalman filters to model blog behavior over time. Using this model, we examine the history of several blogs and determine methods for flagging the significance of a blog's change from one time step to the next. We are able to predict large deviations in blog content, and allow user-defined sensitivity parameters to tune a statistical threshold of significance for deviation from expectation.