Using correlation dimension for analysing text data

  • Authors:
  • Ilkka Kivimäki;Krista Lagus;Ilari T. Nieminen;Jaakko J. Väyrynen;Timo Honkela

  • Affiliations:
  • Adaptive Informatics Research Centre, Aalto University School of Science and Technology;Adaptive Informatics Research Centre, Aalto University School of Science and Technology;Adaptive Informatics Research Centre, Aalto University School of Science and Technology;Adaptive Informatics Research Centre, Aalto University School of Science and Technology;Adaptive Informatics Research Centre, Aalto University School of Science and Technology

  • Venue:
  • ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.02

Visualization

Abstract

In this article, we study the scale-dependent dimensionality properties and overall structure of text data with a method that measures correlation dimension in different scales. As experimental results, we present the analysis of text data sets with the Reuters and Europarl corpora, which are also compared to artificially generated point sets. A comparison is also made with speech data. The results reflect some of the typical properties of the data and the use of our method in improving various data analysis applications is discussed.