Exploring features for the automatic identification of user goals in web search

  • Authors:
  • Mauro Rojas Herrera;Edleno Silva de Moura;Marco Cristo;Thomaz Philippe Silva;Altigran Soares da Silva

  • Affiliations:
  • Department of Computer Science, Federal University of Amazonas Manaus, Brazil;Department of Computer Science, Federal University of Amazonas Manaus, Brazil;FUCAPI-Technological and Research Foundation, Manaus, Brazil;Department of Computer Science, Federal University of Amazonas Manaus, Brazil;Department of Computer Science, Federal University of Amazonas Manaus, Brazil

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Queries submitted to search engines can be classified according to the user goals into three distinct categories: navigational, informational, and transactional. Such classification may be useful, for instance, as additional information for advertisement selection algorithms and for search engine ranking functions, among other possible applications. This paper presents a study about the impact of using several features extracted from the document collection and query logs on the task of automatically identifying the users' goals behind their queries. We propose the use of new features not previously reported in literature and study their impact on the quality of the query classification task. Further, we study the impact of each feature on different web collections, showing that the choice of the best set of features may change according to the target collection. The results obtained indicate the new proposed set of features improves the quality of the classification task when compared to previous proposals. We report experiments with two web collections where we were able to obtain 82.5% and 77.67% of overall accuracy when classifying queries according to the three distinct user goals studied.