How Hadoop Clusters Break

  • Authors:
  • Ariel Rabkin;Randy Katz

  • Affiliations:
  • Princeton University;University of California, Berkeley

  • Venue:
  • IEEE Software
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article describes an examination of a sample of several hundred support tickets for the Hadoop ecosystem, a widely used group of big data storage and processing systems; a taxonomy of errors and how they are addressed by supporters; and the misconfigurations that are the dominant cause of failures. Some design "antipatterns" and missing platform features contribute to these problems. Developers can use various methods to build more robust distributed systems, thereby helping users and administrators prevent some of these rough edges.