A scalable approach to column-based low-rank matrix approximation

  • Authors:
  • Yifan Pi;Haoruo Peng;Shuchang Zhou;Zhihua Zhang

  • Affiliations:
  • Institute for Theoretical Computer Science, IIIS, Tsinghua University, Beijing, China;Department of Computer Science & Technology, Tsinghua University, Beijing, China;State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China and Google Inc., Beijing, China;College of Computer Science & Technology, Zhejiang University, Hangzhou, China

  • Venue:
  • IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we address the column-based low-rank matrix approximation problem using a novel parallel approach. Our approach is based on the divide-and-combine idea. We first perform column selection on submatrices of an original data matrix in parallel, and then combine the selected columns into the final output. Our approach enjoys a theoretical relative-error upper bound. In addition, our column-based low-rank approximation partitions data in a deterministic way and makes no assumptions about matrix coherence. Compared with other traditional methods, our approach is scalable on large-scale matrices. Finally, experiments on both simulated and real world data show that our approach is both efficient and effective.