A Very Large Database of Collocations and Semantic Links

Authors:
Igor A. Bolshakov;Alexander F. Gelbukh
Affiliations:
-;-
Venue:
NLDB '00 Proceedings of the 5th International Conference on Applications of Natural Language to Information Systems-Revised Papers
Year:
2000

Citing 5
Cited 5

Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Multifunction thesaurus for Russian word processing

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Automatic learning for semantic collocation

ANLC '92 Proceedings of the third conference on Applied natural language processing
Large scale collocation data and their application to Japanese word processor technology

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Acquisition of lexical information: from a large textual Italian corpus

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3

On Semantic Classification of Modifiers

NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
Heuristics-Based Replenishment of Collocation Databases

PorTAL '02 Proceedings of the Third International Conference on Advances in Natural Language Processing
Dictionary-Based Method for Coherence Maintenance in Man-Machine Dialogue with Indirect Antecedents and Ellipses

TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
A method of linguistic steganography based on collocationally-verified synonymy

IH'04 Proceedings of the 6th international conference on Information Hiding
DILUCT: an open-source spanish dependency parser based on rules, heuristics, and selectional preferences

NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A computational system manages a very large database of collocations (word combinations) and semantic links. The collocations are related (in the meaning of a dependency grammar) word pairs, joint immediately or through prepositions. Synonyms, antonyms, subclasses, superclasses, etc. represent semantic relations and form a thesaurus. The structure of the system is universal, so that its language-dependent parts are easily adjustable to any specific language (English, Spanish, Russian, etc.). Inference rules for prediction of highly probable new collocations automatically enrich the database at runtime. The inference is assisted by the available thesaurus links. The aim of the system is word processing, foreign language learning, parse filtering, and lexical disambiguation.