Soft Uncoupling of Markov chains for Permeable Language Distinction: A New Algorithm

  • Authors:
  • Richard Nock;Pascal Vaillant;Frank Nielsen;Claudia Henry

  • Affiliations:
  • Université des Antilles et de la Guyane, Schœlcher, Martinique (France);Université des Antilles et de la Guyane, Schœlcher, Martinique (France);Sony CS Labs, Tokyo, Japan;Université des Antilles et de la Guyane, Schœlcher, Martinique (France)

  • Venue:
  • Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Without prior knowledge, distinguishing different languages may be a hard task, especially when their borders are permeable. We develop an extension of spectral clustering ---a powerful unsupervised classification toolbox ---that is shown to resolve accurately the task of soft language distinction. At the heart of our approach, we replace the usual hard membership assignment of spectral clustering by a soft, probabilistic assignment, which also presents the advantage to bypass a well-known complexity bottleneck of the method. Experiments with a readily available system display the potential of the method, which brings a visually appealing soft distinction of languages that may define altogether a whole corpus.