Panorama: a semantic-aware application search framework

Authors:
Di Jiang;Jan Vosecky;Kenneth Wai-Ting Leung;Wilfred Ng
Affiliations:
The Hong Kong University of Science and Technology, Kowloon, Hong Kong;The Hong Kong University of Science and Technology, Kowloon, Hong Kong;The Hong Kong University of Science and Technology, Kowloon, Hong Kong;The Hong Kong University of Science and Technology, Kowloon, Hong Kong
Venue:
Proceedings of the 16th International Conference on Extending Database Technology
Year:
2013

Citing 20
Cited 1

Automatic Text Summarization Using a Machine Learning Approach

SBIA '02 Proceedings of the 16th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Latent dirichlet allocation

The Journal of Machine Learning Research
Web metasearch: rank vs. score based rank aggregation methods

Proceedings of the 2003 ACM symposium on Applied computing
Efficient query evaluation using a two-level retrieval process

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
The author-topic model for authors and documents

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
A system for query-specific document summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Efficient document retrieval in main memory

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Graphical Models, Exponential Families, and Variational Inference

Foundations and Trends® in Machine Learning
A study of global inference algorithms in multi-document summarization

ECIR'07 Proceedings of the 29th European conference on IR research
Using BM25F for semantic search

Proceedings of the 3rd International Semantic Search Workshop
Information Retrieval: Implementing and Evaluating Search Engines

Information Retrieval: Implementing and Evaluating Search Engines
Multi-document summarization via the minimum dominating set

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Geographical topic discovery and comparison

Proceedings of the 20th international conference on World wide web
A class of submodular functions for document summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Faster top-k document retrieval using block-max indexes

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Structured index organizations for high-throughput text querying

SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Optimized top-k processing with global page scores on block-max indexes

Proceedings of the fifth ACM international conference on Web search and data mining
Plink-LDA: using link as prior information in topic modeling

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
G-WSTD: a framework for geographic web search topic discovery

Proceedings of the 21st ACM international conference on Information and knowledge management

Dynamic multi-faceted topic discovery in twitter

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Third-party applications (or commonly referred to the apps) proliferate on the web and mobile platforms in recent years. The tremendous amount of available apps in app market-places suggests the necessity of designing effective app search engines. However, existing app search engines typically ignore the latent semantics in the app corpus and thus usually fail to provide high-quality app snippets and effective app rankings. In this paper, we present a novel framework named Panorama to provide independent search results for Android apps with semantic awareness. We first propose the App Topic Model (ATM) to discover the latent semantics from the app corpus. Based on the discovered semantics, we tackle two central challenges that are faced by current app search engines: (1) how to generate concise and informative snippets for apps and (2) how to rank apps effectively with respect to search queries. To handle the first challenge, we propose several new metrics for measuring the quality of the sentences in app description and develop a greedy algorithm with fixed probability guarantee of near-optimal performance for app snippet generation. To handle the second challenge, we propose a variety of new features for app ranking and also design a new type of inverted index to support efficient Top-k app retrieval. We conduct extensive experiments on a large-scale data collection of Android apps and build an app search engine prototype for human-based performance evaluation. The proposed framework demonstrates superior performance against several strong baselines with respect to different metrics.