Experiments with English-Persian text retrieval

  • Authors:
  • Abolfazl AleAhmad;Hadi Amiri;Masoud Rahgozar;Farhad Oroumchian

  • Affiliations:
  • University of Tehran, Tehran, Iran;University of Tehran, Tehran, Iran;University of Tehran, Tehran, Iran;University of Wollongong in Dubai, Dubai, Uae

  • Venue:
  • Proceedings of the 2nd ACM workshop on Improving non english web searching
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the number of non-English documents is increasing dramatically on the web nowadays, the study and design of information retrieval systems for these languages is very important. The Persian language is the official language of Iran, Afghanistan and Tajikistan and is also spoken in some other countries in the Middle East, so there are significant amount of Persian documents available on the web. In this study, we will present and compare our English-Persian cross language text retrieval experiments on Hamshahri text collection. Also, we will present Combinatorial Translation Probability (CTP) calculation method for query translation that estimates translation probabilities based on the collection itself.