Compiling machine learning algorithms with SystemML

  • Authors:
  • M. Boehm;D. Burdick;A. Evfimievski;B. Reinwald;P. Sen;S. Tatikonda;Y. Tian

  • Affiliations:
  • IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA

  • Venue:
  • Proceedings of the 4th annual Symposium on Cloud Computing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analytics on big data range from passenger volume prediction in transportation to customer satisfaction in automotive diagnostic systems, and from correlation analysis in social media data to log analysis in manufacturing. Expressing and running these analytics for varying data characteristics and at scale is challenging. To address these challenges, SystemML implements a declarative, high-level language using an R-like syntax extended with machine-learning-specific constructs, that is compiled to a MapReduce runtime [2]. The language is rich enough to express a wide class of statistical, predictive modeling and machine learning algorithms (Fig. 1). We chose robust algorithms that scale to large, and potentially sparse data with many features.