NE-Rank: A Novel Graph-Based Keyphrase Extraction in Twitter

  • Authors:
  • Abdelghani Bellaachia;Mohammed Al-Dhelaan

  • Affiliations:
  • -;-

  • Venue:
  • WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The massive growth of the micro-blogging service Twitter has shed the light on the challenging problem of summarizing a collection of large number of tweets. This paper attempts to extract topical key phrases that would represent topics in tweets. Due to the informality, noise, and short length of tweets, such research is nontrivial. We tackle such challenges with extensive preprocessing approach. Followed by, introduction of new features that improve topical key phrase extraction in Twitter. We start by proposing a novel unsupervised graph-based keyword ranking method, called NE-Rank, that considers word weights in addition to edge weights when calculating the ranking. Then we introduce a new approach of leveraging hash tags when extracting key phrases. We have conducted a set of experiments showing the potential of both approaches with 16% to 39% improvement for NE-Rank and 20% improvement for hash tag enhanced extraction.