Type/token-taken informetrics

Authors:
Leo Egghe
Affiliations:
LUC, Universitaire Campus, B-3590 Diepenbeek, Belgium and U/A, Universiteitsplein 1, B-2610 Wilrijk, Belgium
Venue:
Journal of the American Society for Information Science and Technology
Year:
2003

Citing 6
Cited 5

“Sources of information on specific subjects”

Journal of Information Science - Lecture notes in computer science, No. 207
Synchronous versus diachronous methods in the measurement of obsolescence by citation studies

Journal of Information Science
The duality of informetric systems with applications to the empirical laws

Journal of Information Science
A bibliometric study of Nieuwenhuysen's bibliography of microcomputer software for online information and documentation work

Journal of Information Science
The duality of informatic systems with applications to the empirical laws

The duality of informatic systems with applications to the empirical laws
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval

"Type/Token-taken" informetrics: some comments and further examples

Journal of the American Society for Information Science and Technology
Symmetry and other transformation features of Lorenz/Leimkuhler representations of informetric data

Information Processing and Management: an International Journal - Special issue: Infometrics
Symmetry and other transformation features of Lorenz/Leimkuhler representations of informetric data

Information Processing and Management: an International Journal - Special issue: Infometrics
The exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications

Mathematical and Computer Modelling: An International Journal
On the formation of the concept of informetrics (Review)

Scientific and Technical Information Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Type/Token-Taken informetrics is a new part of informetrics that studies the use of items rather than the items itself. Here, items are the objects that are produced by the sources (e.g., journals producing articles, authors producing papers, etc.). In linguistics a source is also called a type (e.g., a word), and an item a token (e.g., the use of words in texts). In informetrics, types that occur often, for example, in a database will also be requested often, for example, in information retrieval. The relative use of these occurrences will be higher than their relative occurrences itself; hence, the name Type/ Token-Taken informetrics. This article studies the frequency distribution of Type/Token-Taken informetrics, starting from the one of Type/Token informetrics (i.e., source-item relationships). We are also studying the average number µ* of item uses in Type/Token-Taken informetrics and compare this with the classical average number µ in Type/Token informetrics. We show that µ* ≥ µ always, and that µ* is an increasing function of µ. A method is presented to actually calculate µ* from µ, and a given α, which is the exponent in Lotka's frequency distribution of Type/Token informetrics. We leave open the problem of developing non-Lotkaian Type/Token-Taken informetrics.