Using finite experiments to study asymptotic performance

  • Authors:
  • Catherine McGeoch;Peter Sanders;Rudolf Fleischer;Paul R. Cohen;Doina Precup

  • Affiliations:
  • Amherst College, Amherst, MA;Max-Planck-Institut für Informatik, Saarbrücken, Germany;The Hong Kong University of Science and Technology, Hong Kong;University of Massachussetts, Amherst, MA;University of Massachussetts, Amherst, MA

  • Venue:
  • Experimental algorithmics
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the analysis of algorithms we are interested in obtaining closed form expressions for algorithmic complexity, or at least asymptotic expressions in O(ċ)-notation. It is often possible to use experimental results to make significant progress towards this goal, although there are fundamental reasons why we cannot guarantee to obtain such expressions from experiments alone. This paper investigates two approaches relating to problems of developing theoretical analyses based on experimental data.We first consider the scientific method, which views experimentation as part of a cycle alternating with theoretical analysis. This approach has been very successful in the natural sciences. Besides supplying preliminary ideas for theoretical analysis, experiments can test falsifiable hypotheses obtained by incomplete theoretical analysis. Asymptotic behavior can also sometimes be deduced from stronger hypotheses which have been induced from experiments. As long as complete mathematical analyses remains elusive, well tested hypotheses may have to take their place. Several examples are given where average complexity can be tested experimentally so that support for hypotheses is quite strong.A second question is how to approach systematically the problem of inferring asymptotic bounds from experimental data. Five heuristic rules for "empirical curve bounding" are presented, ogether with analytical results guaranteeing correctness for certain families of functions. Experimental evaluations of the correctness and tightness of bounds obtained by the rules for several constructed functions and real datasets are described.