Google web 1T 5-grams made easy (but not for the computer)

  • Authors:
  • Stefan Evert

  • Affiliations:
  • University of Osnabrück, Germany

  • Venue:
  • WAC-6 '10 Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces Web1T5-Easy, a simple indexing solution that allows interactive searches of the Web 1T 5-gram database and a derived database of quasi-collocations. The latter is validated against co-occurrence data from the BNC and ukWaC on the automatic identification of non-compositional VPC.