An efficient algorithm for topic ranking and modeling topic evolution

Authors:
Kumar Shubhankar;Aditya Pratap Singh;Vikram Pudi
Affiliations:
Center for Data Engineering, International Institute of Information Technology, Hyderabad, India;Center for Data Engineering, International Institute of Information Technology, Hyderabad, India;Center for Data Engineering, International Institute of Information Technology, Hyderabad, India
Venue:
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Year:
2011

Citing 12
Cited 2

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient mining of association rules using closed itemset lattices

Information Systems
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Frequent term-based text clustering

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Probabilistic author-topic models for information discovery

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A Maximal Frequent Itemset Approach for Web Document Clustering

CIT '04 Proceedings of the The Fourth International Conference on Computer and Information Technology
Discovering evolutionary theme patterns from text: an exploration of temporal text mining

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Detecting research topics via the correlation between graphs and texts

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Topic Detection by Clustering Keywords

DEXA '08 Proceedings of the 2008 19th International Conference on Database and Expert Systems Application
Toward Theme Development Analysis with Topic Clustering

ICACTE '08 Proceedings of the 2008 International Conference on Advanced Computer Theory and Engineering

Historical analysis of legal opinions with a sparse mixed-effects latent variable model

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Evolution of Author's Topic in Authorship Network

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we introduce a novel and efficient approach to detect and rank topics in a large corpus of research papers. With rapidly growing size of academic literature, the problem of topic detection and topic ranking has become a challenging task. We present a unique approach that uses closed frequent keyword-set to form topics. We devise a modified time independent PageRank algorithm that assigns an authoritative score to each topic by considering the sub-graph in which the topic appears, producing a ranked list of topics. The use of citation network and the introduction of time invariance in the topic ranking algorithm reveal very interesting results. Our approach also provides a clustering technique for the research papers using topics as similarity measure. We extend our algorithms to study various aspects of topic evolution which gives interesting insight into trends in research areas over time. Our algorithms also detect hot topics and landmark topics over the years. We test our algorithms on the DBLP dataset and show that our algorithms are fast, effective and scalable.