The Unreasonable Effectiveness of Data

  • Authors:
  • Alon Halevy;Peter Norvig;Fernando Pereira

  • Affiliations:
  • Google;Google;Google

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Problems that involve interacting with humans, such as natural language understanding, have not proven to be solvable by concise, neat formulas like F = ma. Instead, the best approach appears to be to embrace the complexity of the domain and address it by harnessing the power of data: if other humans engage in the tasks and generate large amounts of unlabeled, noisy data, new algorithms can be used to build high-quality models from the data.