An alternative approach to tagging
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Hi-index | 0.00 |
In the aim of safeguarding the Amazigh heritage from being threatened of disappearance, it seems opportune to equip this language of necessary means to confront the stakes of access to the domain of New Information and Communication Technologies (ICT). In this context, and in the perspective to build tools and linguistic resources for the automatic processing of Amazigh language, we have undertaken to develop a module for automatic lexical-analysis of the Amazigh which can recognize lexical units from texts. To achieve this goal, we have made in the first instance, a formalization of the Amazigh vocabulary namely: noun, verb and particles. This work began with the formalization of the two categories noun and particles by building a dictionary named "EDicAm" (Electronic Dictionary for Amazigh), in which entry is associated with linguistic information such as lexical categories and classes of semantics distribution.