Importance Sampling for a Monte Carlo Matrix Multiplication Algorithm, with Application to Information Retrieval

  • Authors:
  • Sylvester Eriksson-Bique;Mary Solbrig;Michael Stefanelli;Sarah Warkentin;Ralph Abbey;Ilse C. F. Ipsen

  • Affiliations:
  • sylvester.eriksson-bique@helsinki.fi;solbrigm@reed.edu;stefane3@tcnj.edu;swarkentin@hmc.edu;rwabbey@ncsu.edu and ipsen@math.ncsu.edu;-

  • Venue:
  • SIAM Journal on Scientific Computing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We perform importance sampling for a randomized matrix multiplication algorithm by Drineas, Kannan, and Mahoney and derive probabilities that minimize the expected value (with regard to the distributions of the matrix elements) of the variance. We compare these optimized probabilities with uniform probabilities and derive conditions under which the actual variance of the optimized probabilities is lower. Numerical experiments with query matching in information retrieval applications illustrate that the optimized probabilities produce more accurate matchings than the uniform probabilities and that they can also be computed efficiently.