Type/token-taken informetrics

  • Authors:
  • Leo Egghe

  • Affiliations:
  • LUC, Universitaire Campus, B-3590 Diepenbeek, Belgium and U/A, Universiteitsplein 1, B-2610 Wilrijk, Belgium

  • Venue:
  • Journal of the American Society for Information Science and Technology
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Type/Token-Taken informetrics is a new part of informetrics that studies the use of items rather than the items itself. Here, items are the objects that are produced by the sources (e.g., journals producing articles, authors producing papers, etc.). In linguistics a source is also called a type (e.g., a word), and an item a token (e.g., the use of words in texts). In informetrics, types that occur often, for example, in a database will also be requested often, for example, in information retrieval. The relative use of these occurrences will be higher than their relative occurrences itself; hence, the name Type/ Token-Taken informetrics. This article studies the frequency distribution of Type/Token-Taken informetrics, starting from the one of Type/Token informetrics (i.e., source-item relationships). We are also studying the average number µ* of item uses in Type/Token-Taken informetrics and compare this with the classical average number µ in Type/Token informetrics. We show that µ* ≥ µ always, and that µ* is an increasing function of µ. A method is presented to actually calculate µ* from µ, and a given α, which is the exponent in Lotka's frequency distribution of Type/Token informetrics. We leave open the problem of developing non-Lotkaian Type/Token-Taken informetrics.