Maximal phrases based analysis for prototyping online discussion forums postings

  • Authors:
  • Gaston Burek;Dale Gerdemann

  • Affiliations:
  • Tuebingen University, Tuebingen, Germany;Tuebingen University, Tuebingen, Germany

  • Venue:
  • AdaptLRTtoND '09 Proceedings of the Workshop on Adaptation of Language Resources and Technology to New Domains
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Chat texts produced in an educational environment are categorized and rated for the purpose of positioning (or placement) of the learner with respect to a learning program (appropriate courses, textbooks, etc). The difficulty lies in the fact that the texts are short and informal. A standard LSA/vector-space model is therefore combined with techniques appropriate for short texts. The approach uses phrases rather than words in the term-document matrix, and for determining prototypical documents of each category, a nonpara-metric permutation test is used.