On the evaluation and improvement of Arabic WordNet coverage and usability

  • Authors:
  • Lahsen Abouenour;Karim Bouzoubaa;Paolo Rosso

  • Affiliations:
  • Mohammadia School of Engineers, Mohammed V University-Agdal, Rabat, Morocco;Mohammadia School of Engineers, Mohammed V University-Agdal, Rabat, Morocco;Natural Language Engineering Lab., ELiRF, Universitat Politècnica de València, Valencia, Spain

  • Venue:
  • Language Resources and Evaluation
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Built on the basis of the methods developed for Princeton WordNet and EuroWordNet, Arabic WordNet (AWN) has been an interesting project which combines WordNet structure compliance with Arabic particularities. In this paper, some AWN shortcomings related to coverage and usability are addressed. The use of AWN in question/answering (Q/A) helped us to deeply evaluate the resource from an experience-based perspective. Accordingly, an enrichment of AWN was built by semi-automatically extending its content. Indeed, existing approaches and/or resources developed for other languages were adapted and used for AWN. The experiments conducted in Arabic Q/A have shown an improvement of both AWN coverage as well as usability. Concerning coverage, a great amount of named entities extracted from YAGO were connected with corresponding AWN synsets. Also, a significant number of new verbs and nouns (including Broken Plural forms) were added. In terms of usability, thanks to the use of AWN, the performance for the AWN-based Q/A application registered an overall improvement with respect to the following three measures: accuracy (+9.27聽% improvement), mean reciprocal rank (+3.6 improvement) and number of answered questions (+12.79聽% improvement).