Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Supervised term weighting for automated text categorization
Proceedings of the 2003 ACM symposium on Applied computing
Automatic Documentation and Mathematical Linguistics
The interpretation of Bradford's law in terms of geometric progression
Automatic Documentation and Mathematical Linguistics
Hi-index | 0.00 |
A classification of lexicographic resources is proposed to support automatic textanalysis systems. Four types of dictionaries are distinguished and described, namely, terminological, terminological-statistical, thesauri, and ontologies. Methods for dictionary generation are divided into static and dynamic, as well as linear and stepwise. Methods for weighting terms are divided into intertextual and intratextual. The features of the TF*IDF algorithm are considered in detail. Two techniques for dictionary generation are described.