Leveraging microblogging big data with a modified density-based clustering approach for event awareness and topic ranking

Authors:
Chung-Hong Lee;Tzan-Feng Chien
Affiliations:
;
Venue:
Journal of Information Science
Year:
2013

Citing 27
Cited 1

Learning in the presence of concept drift and hidden contexts

Machine Learning
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Extracting significant time varying features from text

Proceedings of the eighth international conference on Information and knowledge management
Automatic generation of overview timelines

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Incremental Clustering for Mining in a Data Warehousing Environment

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Bursty and hierarchical structure in streams

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Data Streams: Theory and Practice

IEEE Transactions on Knowledge and Data Engineering
Mining data streams: a review

ACM SIGMOD Record
2005 Special Issue: Efficient streaming text clustering

Neural Networks - 2005 Special issue: IJCNN 2005
Evolutionary clustering

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for clustering evolving data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for projected clustering of high dimensional data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Why we twitter: understanding microblogging usage and communities

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Tracking clusters in evolving data streams over sliding windows

Knowledge and Information Systems
Using twitter to recommend real-time topical news

Proceedings of the third ACM conference on Recommender systems
User interests in social media sites: an exploration with micro-blogs

Proceedings of the 18th ACM conference on Information and knowledge management
Integrating web-based intelligence retrieval and decision-making from the twitter trends knowledge base

Proceedings of the 2nd ACM workshop on Social web search and mining
TwitterStand: news in tweets

Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Twitter power: Tweets as electronic word of mouth

Journal of the American Society for Information Science and Technology
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams

Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
TwitterMonitor: trend detection over the twitter stream

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
On the real-time web as a source of recommendation knowledge

Proceedings of the fourth ACM conference on Recommender systems
Addressing Concept-Evolution in Concept-Drifting Data Streams

ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Sentiment in Twitter events

Journal of the American Society for Information Science and Technology
Hip and trendy: Characterizing emerging trends on Twitter

Journal of the American Society for Information Science and Technology
BursT: a dynamic term weighting scheme for mining microblogging messages

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Mining spatio-temporal information on microblogging streams using a density-based online clustering method

Expert Systems with Applications: An International Journal

Exploring temporal relationships between scientific and technical fronts: a case of biotechnology field

Scientometrics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although diverse groups argue about the potential and true value benefits from social-media big data, there is no doubt that the era of big data exploitation has begun, driving the development of novel data-centric applications. Big data is notable not only because of its size, but also because of the complexity caused by its relationality to other data. In the past, owing to the limited possibilities of accessing big data, few data sources were available to allow researchers to develop advanced data-driven applications, such as monitoring of emerging real-world events. In fact, social media is greatly impacting the growth of big data; and big data is providing enterprises with the data to help them understand how to better detect marketing demands. Microblogging is a social network service capable of aggregating messages to explore facts and unknown knowledge. Nowadays, people often attempt to search for trending news and hot topics in real time from microblogging messages to satisfy their information needs. Under such a circumstance, a real demand is to find a way to allow users to organize a large number of microblogging messages into understandable events. In this work, we attempt to tackle such challenges by developing an online text-stream clustering approach using a modified density-based clustering model with collected microblogging big data. The system kernel combines three technical components, including a dynamic term weighting scheme, a neighbourhood generation algorithm and an online density-based clustering technique. After acquiring detected event topics by the system, our system provides functions for recommending top-priority event information to assist people to effectively organize emerging event data through the developed topic ranking algorithm.