Knowledge-free discovery of domain-specific multiword units

Authors:
Axel-Cyrille Ngonga Ngomo
Affiliations:
Institute of Computer Sciences, Leipzig, Germany
Venue:
Proceedings of the 2008 ACM symposium on Applied computing
Year:
2008

Citing 7
Cited 1

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Toward knowledge-free induction of machine-readable dictionaries

Toward knowledge-free induction of machine-readable dictionaries
Co-occurrences of antonymous adjectives and their contexts

Computational Linguistics
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Termight: identifying and translating technical terminology

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Word association norms, mutual information, and lexicography

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Handbook of Exact String Matching Algorithms

Handbook of Exact String Matching Algorithms

SIGNUM: a graph algorithm for terminology extraction

CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The discovery of multiword units is one of the key steps in the preprocessing of raw text. In this paper, we propose a know ledge-free approach for the discovery on such entities- It does not only outperform state-of-the-art approaches, but is also fully unsupervised. Furthermore, it does not demand the setting of any threshold, making it appropriate for usage by non-experts. The approach proposed is evaluated against five other metrics on a medical corpus.