Experiment databases

  • Authors:
  • Joaquin Vanschoren;Hendrik Blockeel;Bernhard Pfahringer;Geoffrey Holmes

  • Affiliations:
  • LIACS, Universiteit Leiden, Leiden, The Netherlands 2333CA and Dept. of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium 3001;LIACS, Universiteit Leiden, Leiden, The Netherlands 2333CA and Dept. of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium 3001;Dept. of Computer Science, The University of Waikato, Hamilton, New Zealand 3240;Dept. of Computer Science, The University of Waikato, Hamilton, New Zealand 3240

  • Venue:
  • Machine Learning
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Thousands of machine learning research papers contain extensive experimental comparisons. However, the details of those experiments are often lost after publication, making it impossible to reuse these experiments in further research, or reproduce them to verify the claims made. In this paper, we present a collaboration framework designed to easily share machine learning experiments with the community, and automatically organize them in public databases. This enables immediate reuse of experiments for subsequent, possibly much broader investigation and offers faster and more thorough analysis based on a large set of varied results. We describe how we designed such an experiment database, currently holding over 650,000 classification experiments, and demonstrate its use by answering a wide range of interesting research questions and by verifying a number of recent studies.