Foundations of statistical natural language processing
Foundations of statistical natural language processing
Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
A probabilistic model of information retrieval: development and comparative experiments Part 2
Information Processing and Management: an International Journal
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity
Computational Linguistics
A bootstrapping method for learning semantic lexicons using extraction pattern contexts
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A general framework for distributional similarity
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Unsupervised named-entity extraction from the Web: An experimental study
Artificial Intelligence
Directional distributional similarity for lexical expansion
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Automatic set instance extraction using the web
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Inducing fine-grained semantic classes via hierarchical and collective classification
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Hi-index | 0.00 |
The paper describes a semi-supervised approach to extracting multiword units that belong to a specific semantic class of entities. The approach uses a small set of seed words representing the target class, and calculates distributional similarity between the candidate and seed words. We adapt a well-known document ranking function, BM25, to the task of calculating similarity between vectors of context features representing seed words and candidate words, and perform a systematic comparison to a number of distributional similarity measures. We then introduce a method for ranking multiword units by the likelihood of belonging to the target semantic class. The task used for evaluation is extraction of restaurant dish names from the corpus of 157,865 restaurant reviews.