Corpora building and processing

  • Authors:
  • Marija Brkic;Maja Matetic;Igor Jugo

  • Affiliations:
  • Department of Informatics, University of Rijeka, Croatia;Department of Informatics, University of Rijeka, Croatia;Department of Informatics, University of Rijeka, Croatia

  • Venue:
  • HSI'09 Proceedings of the 2nd conference on Human System Interactions
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Creativity is a basic feature of a language. Therefore, it is perfectly possible to create a completely new context that has never occurred before. This feature allows us to express our ideas, thoughts, knowledge and fears, but it also complicates the idea of human-machine communication. Since it became obvious that natural languages cannot be formalized and described as a whole, the idea of combining linguistic knowledge and corpora has arisen. The combination of these techniques has proven to give the best results and our research is based on that notion. Since data sparsity poses a huge problem, this work presents a practical solution in overcoming data sparsity problem and gives a detailed account of an advanced data processing technique.