Language processing for arabic microblog retrieval

  • Authors:
  • Kareem Darwish;Walid Magdy;Ahmed Mourad

  • Affiliations:
  • Qatar Computing Research Institute, Doha, Qatar;Qatar Computing Research Institute, Doha, Qatar;Qatar Computing Research Institute, Doha, Qatar

  • Venue:
  • Proceedings of the 21st ACM international conference on Information and knowledge management
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The use of social media has profoundly affected social and political dynamics in the Arab world. In this paper, we explore the Arabic microblogs retrieval. We illustrate some of the challenges associated with Arabic microblog retrieval, which mainly stem from the use of different Arabic dialects that vary in lexical selection, morphology, and phonetics and lack orthographic and spelling conventions. We present some of the required processing for effective retrieval such as improved letter normalization, elongated word handling, stopword removal, and stemming