Unsupervised learning by probabilistic latent semantic analysis
Machine Learning
Two biomedical sublanguages: a description based on the theories of Zellig Harris
Journal of Biomedical Informatics - Special issue: Sublanguage
Automatic verb classification based on statistical distributions of argument structure
Computational Linguistics
Automatic extraction of subcategorization from corpora
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Using a probabilistic class-based lexicon for lexical ambiguity resolution
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Clustering polysemic subcategorization frame distributions semantically
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Spectral clustering for German verbs
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
The choice of features for classification of verbs in biomedical texts
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Automatic fine-grained semantic classification for domain adaptation
STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Unsupervised and constrained Dirichlet process mixture models for verb clustering
GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
Verb class discovery from rich syntactic data
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Active learning for constrained Dirichlet process mixture models
GEMS '10 Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics
Pattern-based synonym and antonym extraction
Proceedings of the 48th Annual Southeast Regional Conference
Learning syntactic verb frames using graphical models
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Hi-index | 0.00 |
Lexical classes, when tailored to the application and domain in question, can provide an effective means to deal with a number of natural language processing (NLP) tasks. While manual construction of such classes is difficult, recent research shows that it is possible to automatically induce verb classes from cross-domain corpora with promising accuracy. We report a novel experiment where similar technology is applied to the important, challenging domain of biomedicine. We show that the resulting classification, acquired from a corpus of biomedical journal articles, is highly accurate and strongly domain-specific. It can be used to aid BIO-NLP directly or as useful material for investigating the syntax and semantics of verbs in biomedical texts.