Estimating number of citations using author reputation

Authors:
Carlos Castillo;Debora Donato;Aristides Gionis
Affiliations:
Yahoo! Research Barcelona, Barcelona, Catalunya, Spain;Yahoo! Research Barcelona, Barcelona, Catalunya, Spain;Yahoo! Research Barcelona, Barcelona, Catalunya, Spain
Venue:
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Year:
2007

Citing 11
Cited 6

Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Web Structure, Dynamics and Page Quality

SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
The link prediction problem for social networks

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Overview of the 2003 KDD Cup

ACM SIGKDD Explorations Newsletter
Structure and evolution of blogspace

Communications of the ACM - The Blogosphere
Page quality: in search of an unbiased web ranking

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Graphs over time: densification laws, shrinking diameters and possible explanations

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A probabilistic approach to spatiotemporal theme pattern mining on weblogs

Proceedings of the 15th international conference on World Wide Web
Temporal Analysis of the Wikigraph

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
The eigenrumor algorithm for calculating contributions in cyberspace communities

Trusting Agents for Trusting Electronic Societies

On the relationship between novelty and popularity of user-generated content

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Bibliometric positioning of scientific manufacturing journals: a comparative analysis

Scientometrics
Proposals for evaluating the regularity of a scientist's research output

Scientometrics
To better stand on the shoulder of giants

Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
On the Relationship between Novelty and Popularity of User-Generated Content

ACM Transactions on Intelligent Systems and Technology (TIST)
Which links should I use?: a variogram-based selection of relationship measures for prediction of node attributes in temporal multigraphs

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the problem of predicting the popularity of items in a dynamic environment in which authors post continuously new items and provide feedback on existing items. This problem can be applied to predict popularity of blog posts, rank photographs in a photo-sharing system, or predict the citations of a scientific article using author information and monitoring the items of interest for a short period of time after their creation. As a case study, we show how to estimate the number of citations for an academic paper using information about past articles written by the same author(s) of the paper. If we use only the citation information over a short period of time, we obtain a predicted value that has a correlation of r = 0.57 with the actual value. This is our baseline prediction. Our best-performing system can improve that prediction by adding features extracted from the past publishing history of its authors, increasing the correlation between the actual and the predicted values to r = 0.81.