A lemmatization method for Mongolian and its application to indexing for information retrieval
Information Processing and Management: an International Journal
Lemmatization of Polish person names
ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Spejd: A Shallow Processing and Morphological Disambiguation Tool
Human Language Technology. Challenges of the Information Society
A morphosyntactic Brill Tagger for inflectional languages
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Automatic identification of legal terms in czech law texts
Semantic Processing of Legal Texts
Polish language processing chains for multilingual information systems
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Hi-index | 0.00 |
While morphological analysers and taggers usually assign lemmata to wordforms, those tools focus on single words. For some tasks a tool that lemmatises (and thus normalises) whole phrases would be more appropriate. The paper presents, discusses and evaluates a set of tools to lemmatise nominal groups, based on a shallow grammar for Polish. The tools reach an overall success rate of over 58%, and almost 83% on the nominal groups that are correctly recognised by the grammar. The approach should be portable to other languages, especially those morphologically rich.