Comparing corpora using frequency profiling

  • Authors:
  • Paul Rayson;Roger Garside

  • Affiliations:
  • Lancaster University, Lancaster, UK;Lancaster University, Lancaster, UK

  • Venue:
  • WCC '00 Proceedings of the workshop on Comparing corpora - Volume 9
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a method of comparing corpora which uses frequency profiling. The method can be used to discover key words in the corpora which differentiate one corpus from another. Using annotated corpora, it can be applied to discover key grammatical or word-sense categories. This can be used as a quick way in to find the differences between the corpora and is shown to have applications in the study of social differentiation in the use of English vocabulary, profiling of learner English and document analysis in the software engineering process.