A statistical approach to machine translation
Computational Linguistics
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval (poster abstract)
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
A study of smoothing methods for language models applied to information retrieval
ACM Transactions on Information Systems (TOIS)
Efficiently linking text documents with relevant structured information
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Collective entity resolution in relational data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Extracting product features and opinions from reviews
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Eliminating fuzzy duplicates in data warehouses
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Statistical Language Models for Information Retrieval A Critical Review
Foundations and Trends in Information Retrieval
Foundations and Trends in Databases
Mining opinion features in customer reviews
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Topic identification for fine-grained opinion analysis
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
For a few dollars less: identifying review pages sans human labels
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Collecting evaluative expressions for opinion extraction
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Object matching in tweets with spatial models
Proceedings of the fifth ACM international conference on Web search and data mining
A simple word trigger method for social tag suggestion
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Associating structured records to text documents
Proceedings of the 21st international conference companion on World Wide Web
Hi-index | 0.00 |
We develop a generic method for the review matching problem, which is to match unstructured text reviews to a list of objects, where each object has a set of attributes. To this end, we propose a translation model for generating reviews from a structured description of objects. We develop an EM-based method to estimate the model parameters and use this model to find, given a review, the object most likely to be the topic of the review. We conduct extensive experiments on two large-scale datasets: a collection of restaurant reviews from Yelp and a collection of movie reviews from IMDb. The experiments show that our translation model-based method is superior to traditional tf-idf based methods as well as a recent mixture model-based method for the review matching problem.