An alternative approach to tagging

Authors:
Max Silberztein
Affiliations:
LASELDI, Université de Franche-Comté, Besançon, France
Venue:
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Year:
2007

Citing 4
Cited 6

Natural language processing for information retrieval

Communications of the ACM
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
Semi-automatic recognition of noun modifier relationships

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1

Exploiting noun phrases and semantic relationships for text document clustering

Information Sciences: an International Journal
Towards an electronic dictionary of Tamajaq language in Niger

AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
Data-driven approach based on semantic roles for recognizing temporal expressions and events in Chinese

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
OpenLogos machine translation: philosophy, model, resources and customization

Machine Translation
Towards a cascade of morpho-syntactic tools for arabic natural language processing

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Finite state morphology for amazigh language

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

NooJ is a linguistic development environment that allows users to construct large formalised dictionaries and grammars and use these resources to build robust NLP applications. NooJ's approach to the formalisation of natural languages is bottom-up: linguists start by formalising basic phenomena such as spelling and morphology, and then formalise higher and higher linguistic levels, moving up towards the sentence level. NooJ provides parsers that operate in cascade at each individual level of the formalisation: tokenizers, morphological analysers, simple and compound terms indexers, disambiguation tools, syntactic parsers, named entities annotators and semantic analysers. This architecture requires NooJ's parsers to communicate via a Text Annotation Structure that stores both correct results and erroneous hypotheses (to be deleted later).