Foundations of statistical natural language processing
Foundations of statistical natural language processing
Methoden zum qualitativen Vergleich von Signifikanzmaßen zur Kollokationsidentifikation
KONVENS 2000 / Sprachkommunikation, Vorträge der gemeinsamen Veranstaltung 5. Konferenz zur Verarbeitung natürlicher Sprache (KONVENS), 6. ITG-Fachtagung "Sprachkommunikation"
Extracting the lowest-frequency words: pitfalls and possibilities
Computational Linguistics
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Word association norms, mutual information, and lexicography
ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Experiments on candidate data for collocation extraction
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Wordform- and class-based prediction of the components of German nominal compounds in an AAC system
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Finding new terminology in very large corpora
Proceedings of the 3rd international conference on Knowledge capture
Acquiring collocations for lexical choice between near-synonyms
ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
A nonparametric method for extraction of candidate phrasal terms
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Significance tests for the evaluation of ranking methods
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Collocation extraction based on modifiability statistics
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Paradigmatic modifiability statistics for the extraction of complex multi-word terms
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Unsupervised Multilingual Sentence Boundary Detection
Computational Linguistics
Combining association measures for collocation extraction
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Reviewing and Evaluating Automatic Term Recognition Techniques
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Unsupervised type and token identification of idiomatic expressions
Computational Linguistics
Word lookup on the basis of associations: from an idea to a roadmap
ElectricDict '04 Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries
Multilingual collocation extraction: issues and solutions
MLRI '06 Proceedings of the Workshop on Multilingual Language Resources and Interoperability
Automatic identification of non-compositional multi-word expressions using latent semantic analysis
MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
A measure of syntactic flexibility for automatically identifying multiword expressions in corpora
MWE '07 Proceedings of the Workshop on a Broader Perspective on Multiword Expressions
In Search of Semantic Compositionality in Vector Spaces
ICCS '09 Proceedings of the 17th International Conference on Conceptual Structures: Conceptual Structures: Leveraging Semantic Technologies
An extensive empirical study of collocation extraction methods
ACLstudent '05 Proceedings of the ACL Student Research Workshop
Mining linguistic cues for query expansion: applications to drug interaction search
Proceedings of the 18th ACM conference on Information and knowledge management
Product feature categorization with multilevel latent semantic association
Proceedings of the 18th ACM conference on Information and knowledge management
Using small random samples for the manual evaluation of statistical association measures
Computer Speech and Language
A re-examination of lexical association measures
MWE '09 Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Web-based model for disambiguation of prepositional phrase usage
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Various criteria of collocation cohesion in internet: comparison of resolving power
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Finding domain specific collocations and concordances on the web
MCTLLL '09 Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Identifying multi-word expressions by leveraging morphological and syntactic idiosyncrasy
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Finding the storyteller: automatic spoiler tagging using linguistic cues
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Collocation extraction in Turkish texts using statistical methods
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
A machine learning approach to relational noun mining in German
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
An n-gram frequency database reference to handle MWE extraction in NLP applications
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Unsupervised learning of p NP p word combinations
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Towards the automatic learning of idiomatic prepositional phrases
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
A cascaded classification approach to semantic head recognition
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Relative compositionality of multi-word expressions: a study of verb-noun (v-n) collocations
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Massive biomedical term discovery
DS'05 Proceedings of the 8th international conference on Discovery Science
A lexical database of portuguese multiword expressions
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Corpus-Based acquisition of support verb constructions for portuguese
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Automatic construction and enrichment of informal ontologies: A survey
Programming and Computing Software
Hi-index | 0.00 |
This paper presents methods for a qualitative, unbiased comparison of lexical association measures and the results we have obtained for adjective-noun pairs and preposition-noun-verb triples extracted from German corpora. In our approach, we compare the entire list of candidates, sorted according to the particular measures, to a reference set of manually identified "true positives". We also show how estimates for the very large number of hapaxlegomena and double occurrences can be inferred from random samples.