Foundations of statistical natural language processing
Foundations of statistical natural language processing
Using Electronic Texts for an Annotated Corpus Building
ENC '03 Proceedings of the 4th Mexican International Conference on Computer Science
A method of linguistic steganography based on collocationally-verified synonymy
IH'04 Proceedings of the 6th international conference on Information Hiding
An experiment in detection and correction of malapropisms through the web
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.00 |
Stable coordinate pairs (SCP) like comentarios y sugerencias ‘comments and suggestions’ or sano y salvo ‘safe and sound’ are rather frequent in texts in Spanish, though there are only few thousands of them in language. We characterize SCPs statistically by a numerical Stable Connection Index and reveal its unimodal distribution. We also propose lexical, morphologic, syntactic, and semantic categories for SCP structural description — for both a whole SCP and its components. It is argued that database containing a set of categorized SCPs facilitates several tasks of automatic NLP.. The research is based on a set of ca. 2200 Spanish coordinate pairs.