User-based identification of Web genres

  • Authors:
  • Mark A. Rosso

  • Affiliations:
  • School of Business, North Carolina Central University, 122 Willis Commerce Bldg., Durham, NC 27707

  • Venue:
  • Journal of the American Society for Information Science and Technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This research explores the use of genre as a document descriptorin order to improve the effectiveness of Web searching. A majorissue to be resolved is the identification of what documentcategories should be used as genres. As genre is a kind of folktypology, document categories must enjoy widespread recognition bytheir intended user groups in order to qualify as genres. Threeuser studies were conducted to develop a genre palette and showthat it is recognizable to users. (Palette is a term used to denotea classification, attributable to Karlgren, Bretan, Dewe, Hallberg,and Wolkert, 1998.) To simplify the users' classification task, itwas decided to focus on Web pages from the edu domain. The firststudy was a survey of user terminology for Web pages. Threeparticipants separated 100 Web page printouts into stacks accordingto genre, assigning names and definitions to each genre. The secondstudy aimed to refine the resulting set of 48 (often conceptuallyand lexically similar) genre names and definitions into a smallerpalette of user-preferred terminology. Ten participants classifiedthe same 100 Web pages. A set of five principles for creating agenre palette from individuals' sortings was developed, and thelist of 48 was trimmed to 18 genres. The third study aimed to showthat users would agree on the genres of Web pages when choosingfrom the genre palette. In an online experiment in which 257participants categorized a new set of 55 pages using the 18 genres,on average, over 70% agreed on the genre of each page. Suggestionsfor improving the genre palette and future directions for the workare discussed. © 2008 Wiley Periodicals, Inc.