PanLex and LEXTRACT: translating all words of all languages of the world

  • Authors:
  • Timothy Baldwin;Jonathan Pool;Susan M. Colowick

  • Affiliations:
  • University of Melbourne;Utilika Foundation;Utilika Foundation

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

PanLex is a lemmatic translation resource which combines a large number of translation dictionaries and other translingual lexical resources. It currently covers 1353 language varieties and 12M expressions, but aims to cover all languages and up to 350M expressions. This paper describes the resource and current applications of it, as well as lextract, a new effort to expand the coverage of PanLex via semi-automatic dictionary scraping.