Kml: A package to cluster longitudinal data

  • Authors:
  • Christophe Genolini;Bruno Falissard

  • Affiliations:
  • Inserm, U669, Paris, France and Univ Paris-Sud and Univ Paris Descartes, UMR-S0669, Paris, France and Modal'X,Univ Paris-Nanterre, UMR-S0669, Paris, France;Inserm, U669, Paris, France and Univ Paris-Sud and Univ Paris Descartes, UMR-S0669, Paris, France and AP-HP, Hôpital de Bicêtre, Service de Psychiatrie, Le Kremlin-Bicêtre, France

  • Venue:
  • Computer Methods and Programs in Biomedicine
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cohort studies are becoming essential tools in epidemiological research. In these studies, measurements are not restricted to single variables but can be seen as trajectories. Thus, an important question concerns the existence of homogeneous patient trajectories. KmL is an R package providing an implementation of k-means designed to work specifically on longitudinal data. It provides several different techniques for dealing with missing values in trajectories (classical ones like linear interpolation or LOCF but also new ones like copyMean). It can run k-means with distances specifically designed for longitudinal data (like Frechet distance or any user-defined distance). Its graphical interface helps the user to choose the appropriate number of clusters when classic criteria are not efficient. It also provides an easy way to export graphical representations of the mean trajectories resulting from the clustering. Finally, it runs the algorithm several times, using various kinds of starting conditions and/or numbers of clusters to be sought, thus sparing the user a lot of manual re-sampling.