Open-source machine learning: R meets Weka

  • Authors:
  • Kurt Hornik;Christian Buchta;Achim Zeileis

  • Affiliations:
  • Wirtschaftsuniversität Wien, Department of Statistics and Mathematics, Augasse 2-6, 1090, Vienna, Austria;Wirtschaftsuniversität Wien, Institute for Tourism and Leisure Studies, Augasse 2-6, 1090, Vienna, Austria;Wirtschaftsuniversität Wien, Department of Statistics and Mathematics, Augasse 2-6, 1090, Vienna, Austria

  • Venue:
  • Computational Statistics - Proceedings of DSC 2007
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Two of the prime open-source environments available for machine/statistical learning in data mining and knowledge discovery are the software packages Weka and R which have emerged from the machine learning and statistics communities, respectively. To make the different sets of tools from both environments available in a single unified system, an R package RWeka is suggested which interfaces Weka’s functionality to R. With only a thin layer of (mostly R) code, a set of general interface generators is provided which can set up interface functions with the usual “R look and feel”, re-using Weka’s standardized interface of learner classes (including classifiers, clusterers, associators, filters, loaders, savers, and stemmers) with associated methods.