Assas-Band, an affix-exception-list based Urdu stemmer

  • Authors:
  • Qurat-ul-Ain Akram;Asma Naseer;Sarmad Hussain

  • Affiliations:
  • NUCES, Pakistan;NUCES, Pakistan;NUCES, Pakistan

  • Venue:
  • ALR7 Proceedings of the 7th Workshop on Asian Language Resources
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Both Inflectional and derivational morphology lead to multiple surface forms of a word. Stemming reduces these forms back to its stem or root, and is a very useful tool for many applications. There has not been any work reported on Urdu stemming. The current work develops an Urdu stemmer or Assas-Band and improves the performance using more precise affix based exception lists, instead of the conventional lexical lookup employed for developing stemmers in other languages. Testing shows an accuracy of 91.2%. Further enhancements are also suggested.