Placing search in context: the concept revisited
Proceedings of the 10th international conference on World Wide Web
Automatic labeling of semantic roles
Computational Linguistics
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
The Journal of Machine Learning Research
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Introduction to Information Retrieval
Introduction to Information Retrieval
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Efficient methods for topic model inference on streaming document collections
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised semantic role labeling
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Probase: a probabilistic taxonomy for text understanding
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Short text conceptualization using a probabilistic knowledgebase
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Improving word representations via global context and multiple word prototypes
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Understanding tables on the web
ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Hi-index | 0.00 |
Conceptualization seeks to map a short text (i.e., a word or a phrase) to a set of concepts as a mechanism of understanding text. Most of prior research in conceptualization uses human-crafted knowledge bases that map instances to concepts. Such approaches to conceptualization have the limitation that the mappings are not context sensitive. To overcome this limitation, we propose a framework in which we harness the power of a probabilistic topic model which inherently captures the semantic relations between words. By combining latent Dirichlet allocation, a widely used topic model with Probase, a large-scale probabilistic knowledge base, we develop a corpus-based framework for context-dependent conceptualization. Through this simple but powerful framework, we improve conceptualization and enable a wide range of applications that rely on semantic understanding of short texts, including frame element prediction, word similarity in context, ad-query similarity, and query similarity.