Genre analysis of structured e-mails for corpus profiling

  • Authors:
  • Malcolm Clark;Ian Ruthven;Patrik O'Brian Holt

  • Affiliations:
  • School of Computing, The Robert Gordon University, Aberdeen;Department of Computer and Information Sciences, University of Strathclyde, Glasgow;School of Computing, The Robert Gordon University, Aberdeen

  • Venue:
  • IRSG'08 Proceedings of the 2008 BCS-IRSG conference on Corpus Profiling
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper reports on our approach to the analysis of genre recognition using eyetracking. We focused on a collection of different types of email which could represent different datasets, such as, mailing lists for calls for papers, newsletters, etc. We found that genre analysis based on purpose, form and layout features is potentially effective for identifying the characteristics of these datasets and we have highlighted some of the new important features of genres. The results from a pilot study showed a clear effect, with an interaction between the email texts and the visual cues or features perceived and also the strategies employed for the processing of the texts. We found, in our small sample, that readers can determine the purpose and form of genres and that during this process some readers do skim the shape of the emails (form).