Determining Stopping Criteria in the Generation of Web-Derived Langua ge Models

  • Authors:
  • Gary A. Monroe;David R. Mikesell;James C. French

  • Affiliations:
  • -;-;-

  • Venue:
  • Determining Stopping Criteria in the Generation of Web-Derived Langua ge Models
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work, we present a small-scale evaluation of two query-based sampling techniques for building language models, using a database comprised of world-wide web documents. We propose a metric by which it is possible to determine when to cease sampling a given web database, and we compare this new metric to other metrics that have been used in previous work to determine the fidelity of sampled language models.