Robust distributed orthogonalization based on randomized aggregation

  • Authors:
  • Wilfried N. Gansterer;Gerhard Niederbrucker;Hana Straková;Stefan Schulze Grotthoff

  • Affiliations:
  • University of Vienna, Vienna, Austria;University of Vienna, Vienna, Austria;University of Vienna, Vienna, Austria;University of Vienna, Vienna, Austria

  • Venue:
  • Proceedings of the second workshop on Scalable algorithms for large-scale systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The construction of distributed algorithms for matrix computations built on top of distributed data aggregation algorithms with randomized communication schedules is investigated. For this purpose, a new aggregation algorithm for summing or averaging distributed values, the push-flow algorithm, is developed, which achieves superior resilience properties with respect to node failures compared to existing aggregation methods. On a hypercube topology it asymptotically requires the same number of iterations as the optimal all-to-all reduction operation and it scales well with the number of nodes. Orthogonalization is studied as a prototypical matrix computation task. A new fault tolerant distributed orthogonalization method (rdmGS), which can produce accurate results even in the presence of node failures, is built on top of distributed data aggregation algorithms.