Partial parsing via finite-state cascades
Natural Language Engineering
Outilex, a linguistic platform for text processing
COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
Hi-index | 0.00 |
Language is full of multiword unit expressions that form basic semantic units. The identification of these structures limits the combinatorial complexity induced by lexical ambiguity. In this paper, we detail an experiment that largely integrates these notions in a finite-state procedure of segmentation into super-chunks, preliminary to a parser. We show that the chunker, developped for French, reaches 92.9% precision and 98.7% recall.