A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
On an equivalence between PLSI and LDA
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The Journal of Machine Learning Research
Cluster-based retrieval using language models
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The author-topic model for authors and documents
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Pachinko allocation: DAG-structured mixture models of topic correlations
ICML '06 Proceedings of the 23rd international conference on Machine learning
The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
Journal of the ACM (JACM)
Mining a digital library for influential authors
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Efficient topic-based unsupervised name disambiguation
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Organizing the OCA: learning faceted subjects from a library of digital books
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Latent concept expansion using markov random fields
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A study of Poisson query generation model for information retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic labeling of multinomial topic models
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Expertise modeling for matching papers with reviewers
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Image retrieval on large-scale image databases
Proceedings of the 6th ACM international conference on Image and video retrieval
Proceedings of the 17th international conference on World Wide Web
Generating summary keywords for emails using topics
Proceedings of the 13th international conference on Intelligent user interfaces
The opposite of smoothing: a language model approach to ranking query-specific document clusters
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Discovering key concepts in verbose queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Analyzing web text association to disambiguate abbreviation in queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Fast collapsed gibbs sampling for latent dirichlet allocation
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
ArnetMiner: extraction and mining of academic social networks
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Entity Ranking from Annotated Text Collections Using Multitype Topic Models
Focused Access to XML Documents
Evaluating topic models for information retrieval
Proceedings of the 17th ACM conference on Information and knowledge management
Passage relevance models for genomics search
Proceedings of the 2nd international workshop on Data and text mining in bioinformatics
Topic models and a revisit of text-related applications
Proceedings of the 2nd PhD workshop on Information and knowledge management
Word Topic Models for Spoken Document Retrieval and Transcription
ACM Transactions on Asian Language Information Processing (TALIP)
A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization
ACM Transactions on Asian Language Information Processing (TALIP)
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Statistical Language Models for Information Retrieval A Critical Review
Foundations and Trends in Information Retrieval
Clusters, language models, and ad hoc information retrieval
ACM Transactions on Information Systems (TOIS)
A density-based method for adaptive LDA model selection
Neurocomputing
Topic-Level Random Walk through Probabilistic Model
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Query-Focused Summarization by Combining Topic Model and Affinity Propagation
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
A Comparative Study of Utilizing Topic Models for Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
A Topic-Based Measure of Resource Description Quality for Distributed Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Re-ranking search results using language models of query-specific clusters
Information Retrieval
Address standardization with latent semantic association
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Named entity mining from click-through data using weakly supervised latent dirichlet allocation
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Ranking Answers by Hierarchical Topic Models
IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
An Effective Approach to Verbose Queries Using a Limited Dependencies Language Model
ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
HTM: a topic model for hypertexts
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
The Sensitivity of Latent Dirichlet Allocation for Information Retrieval
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Latent Dirichlet Allocation for Automatic Document Categorization
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Domain adaptation with latent semantic association for named entity recognition
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Dynamic mixture models for multiple time series
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Topic modeling for spoken document retrieval using word- and syllable-level information
SSCS '09 Proceedings of the third workshop on Searching spontaneous conversational speech
Computational community interest for ranking
Proceedings of the 18th ACM conference on Information and knowledge management
Product feature categorization with multilevel latent semantic association
Proceedings of the 18th ACM conference on Information and knowledge management
Dynamic hyperparameter optimization for bayesian topical trend analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Learning author-topic models from text corpora
ACM Transactions on Information Systems (TOIS)
A Latent Dirichlet Framework for Relevance Modeling
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Modeling term associations for ad-hoc retrieval performance within language modeling framework
ECIR'07 Proceedings of the 29th European conference on IR research
Smoothing LDA model for text categorization
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Text categorization based on topic model
RSKT'08 Proceedings of the 3rd international conference on Rough sets and knowledge technology
A statistical view of binned retrieval models
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Multilingual topic models for unaligned text
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Software traceability with topic modeling
Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Community-based ranking of the social web
Proceedings of the 21st ACM conference on Hypertext and hypermedia
Bug localization using latent Dirichlet allocation
Information and Software Technology
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
TIARA: a visual exploratory text analytic system
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Holistic sentiment analysis across languages: multilingual supervised latent Dirichlet allocation
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
What can quantum theory bring to information retrieval
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Decomposing background topics from keywords by principal component pursuit
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Latent interest-topic model: finding the causal relationships behind dyadic data
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
OpinionIt: a text mining system for cross-lingual opinion analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Exploiting site-level information to improve web search
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Ranking social bookmarks using topic models
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Finding the storyteller: automatic spoiler tagging using linguistic cues
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Predicting best answerers for new questions in community question answering
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Semantic grounding of hybridization for tag recommendation
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Smoothing methods and cross-language document re-ranking
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
An intelligent system for sentence retrieval and novelty mining
International Journal of Knowledge Engineering and Data Mining
Investigating retrieval performance with manually-built topic models
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
#TwitterSearch: a comparison of microblog search and web search
Proceedings of the fourth ACM international conference on Web search and data mining
Improving social bookmark search using personalised latent variable language models
Proceedings of the fourth ACM international conference on Web search and data mining
Best topic word selection for topic labelling
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Search with synonyms: problems and solutions
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Dual-space re-ranking model for document retrieval
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Wikipedia based news video topic modeling for information extraction
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Concept-Based Information Retrieval Using Explicit Semantic Analysis
ACM Transactions on Information Systems (TOIS)
Ranking in context-aware recommender systems
Proceedings of the 20th international conference companion on World wide web
Automatic labelling of topic models
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Active learning to maximize accuracy vs. effort in interactive information retrieval
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query by document via a decomposition-based two-level retrieval approach
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Clickthrough-based latent semantic models for web search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Regularized latent semantic indexing
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Ranking related news predictions
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Multilingual sentence categorization and novelty mining
Information Processing and Management: an International Journal
Tracking trends: incorporating term volume into temporal topic models
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent topic feedback for information retrieval
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
A time-dependent topic model for multiple text streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Probabilistic topic models with biased propagation on heterogeneous information networks
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
A multi-collection latent topic model for federated search
Information Retrieval
BPM'11 Proceedings of the 9th international conference on Business process management
What is the basic semantic unit of Chinese language? a computational approach based on topic models
MOL'11 Proceedings of the 12th biennial conference on The mathematics of language
The opposite of smoothing: a language model approach to ranking query-specific document clusters
Journal of Artificial Intelligence Research
ImpactWheel: Visual Analysis of the Impact of Online News
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Discovering missing click-through query language information for web search
Proceedings of the 20th ACM international conference on Information and knowledge management
Exploring categorization property of social annotations for information retrieval
Proceedings of the 20th ACM international conference on Information and knowledge management
Online conversation mining for author characterization and topic identification
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Optimizing semantic coherence in topic models
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A non-parametric visual-sense model of images--extending the cluster hypothesis beyond text
Multimedia Tools and Applications
Video fingerprinting using Latent Dirichlet Allocation and facial images
Pattern Recognition
Combining wikipedia-based concept models for cross-language retrieval
IRFC'10 Proceedings of the First international Information Retrieval Facility conference on Adbances in Multidisciplinary Retrieval
Learning to rank documents using similarity information between objects
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
User-oriented ontology-based clustering of stored memories
Expert Systems with Applications: An International Journal
Large scale microblog mining using distributed MB-LDA
Proceedings of the 21st international conference companion on World Wide Web
A probabilistic topic model with social tags for query reformulation in informational search
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Cross-language information retrieval with latent topic models trained on a comparable corpus
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
An empirical study of SLDA for information retrieval
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
BibRank: a language-based model for co-ranking entities in bibliographic networks
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
LDA-Based topic modeling in labeling blog posts with wikipedia entries
APWeb'12 Proceedings of the 14th international conference on Web Technologies and Applications
Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events
Pattern Recognition Letters
Practical collapsed variational bayes inference for hierarchical dirichlet process
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to diversify expert finding with subtopics
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Group matrix factorization for scalable topic modeling
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Incorporating statistical topic information in relevance feedback
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
User-sentiment topic model: refining user's topics with sentiment information
Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics
A keyword-topic model for contextual advertising
Proceedings of the Third Symposium on Information and Communication Technology
Reranking web search results for diversity
Information Retrieval
The generalized dirichlet distribution in enhanced topic detection
Proceedings of the 21st ACM international conference on Information and knowledge management
Incorporating word correlation into tag-topic model for semantic knowledge acquisition
Proceedings of the 21st ACM international conference on Information and knowledge management
Pervasive and Mobile Computing
Authorship attribution based on a probabilistic topic model
Information Processing and Management: an International Journal
Evaluating the use of clustering for automatically organising digital library collections
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Regularized Latent Semantic Indexing: A New Approach to Large-Scale Topic Modeling
ACM Transactions on Information Systems (TOIS)
Context-based query using dependency structures based on latent topic model
MEDI'12 Proceedings of the 2nd international conference on Model and Data Engineering
Group sparse topical coding: from code to topic
Proceedings of the sixth ACM international conference on Web search and data mining
Latent Business Networks Mining: A Probabilistic Generative Model
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
A study on query expansion based on topic distributions of retrieved documents
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
On collocations and topic models
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
A location-based news article recommendation with explicit localized semantic analysis
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Modeling click-through based word-pairs for web search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
An LDA-smoothed relevance model for document expansion: a case study for spoken document retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
One theme in all views: modeling consensus topics in multiple contexts
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Leveraging geographical metadata to improve search over social media
Proceedings of the 22nd international conference on World Wide Web companion
Unsupervised latent concept modeling to identify query facets
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Journal of Information Science
A graph-based topic extraction method enabling simple interactive customization
Proceedings of the 2013 ACM symposium on Document engineering
Efficient Nearest-Neighbor Search in the Probability Simplex
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Commonsense-based topic modeling
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
A novel neighborhood based document smoothing model for information retrieval
Information Retrieval
Mining causal topics in text data: iterative topic modeling with time series feedback
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Building user profiles from topic models for personalised search
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Improving pseudo-relevance feedback via tweet selection
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Query-driven context aware recommendation
Proceedings of the 7th ACM conference on Recommender systems
Recommending patents based on latent topics
Proceedings of the 7th ACM conference on Recommender systems
Modeling latent topic interactions using quantum interference for information retrieval
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing
I pinned it. where can i buy one like it?: automatically linking pinterest pins to online webshops
Proceedings of the 2013 workshop on Data-driven user behavioral modelling and mining from social media
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Development and evaluation of a biomedical search engine using a predicate-based vector space model
Journal of Biomedical Informatics
Towards Concept-Based Translation Models Using Search Logs for Query Expansion
Proceedings of the 21st ACM international conference on Information and knowledge management
A study on document retrieval system based on visualization to manage OCR documents
HCI'13 Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IV
Tag-weighted topic model for mining semi-structured documents
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Relational term-suggestion graphs incorporating multipartite concept and expertise networks
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Self-help: Seeking out perplexing images for ever improving topological mapping
International Journal of Robotics Research
Partial-update dimensionality reduction for accumulating co-occurrence events
Pattern Recognition Letters
Joint question clustering and relevance prediction for open domain non-factoid question answering
Proceedings of the 23rd international conference on World wide web
Latent word context model for information retrieval
Information Retrieval
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics
International Journal of Computer Vision
Enhanced semantic representation for improved ontology-based information retrieval
International Journal of Knowledge-based and Intelligent Engineering Systems - Selected papers of KES2012-Part 2 of 2
Hi-index | 0.00 |
Search algorithms incorporating some form of topic model have a long history in information retrieval. For example, cluster-based retrieval has been studied since the 60s and has recently produced good results in the language model framework. An approach to building topic models based on a formal generative model of documents, Latent Dirichlet Allocation (LDA), is heavily cited in the machine learning literature, but its feasibility and effectiveness in information retrieval is mostly unknown. In this paper, we study how to efficiently use LDA to improve ad-hoc retrieval. We propose an LDA-based document model within the language modeling framework, and evaluate it on several TREC collections. Gibbs sampling is employed to conduct approximate inference in LDA and the computational complexity is analyzed. We show that improvements over retrieval using cluster-based models can be obtained with reasonable efficiency.