Identifying writers' background by comparing personal sense thesauri

  • Authors:
  • Polina Panicheva;John Cardiff;Paolo Rosso

  • Affiliations:
  • Institute of Technology Tallaght, Dublin, Ireland and Natural Language Engineering Lab, ELiRF, Universidad Politécnica de Valencia, Spain;Institute of Technology Tallaght, Dublin, Ireland;Natural Language Engineering Lab, ELiRF, Universidad Politécnica de Valencia, Spain

  • Venue:
  • NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analysis of blogpost writings is an important and growing research area. Both objective and subjective characteristics of a writer are detected. Words have word meaning that is common in the language and that is represented in their usage. Another component of word meaning, "personal sense", not inherent in the language, but different for each person, reflects a meaning of words in terms of unique personal experience and carries the personal characteristics. In our research word meaning techniques are applied to represent personal sense of words in texts by different authors. Personalized concept structures are construed and used to infer authors' perspective from text: various notions of context combined with different thesaurus similarity scales are applied to confirm that from a certain perspective similarity in the personalized thesauri with some restrictions can correspond to similarities in the occupation of the authors.