Scanning world wide web documents with the vector space model

  • Authors:
  • Cheryl Aasheim;Gary J. Koehler

  • Affiliations:
  • Department of Information Technology, Georgia Southern University, Statesboro, GA;Department of Decision and Information Sciences, Warrington College of Business Administration, University of Florida, Gainesville, FL

  • Venue:
  • Decision Support Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The vector space model used in Information Retrieval is combined with discriminant analysis to provide an automated WWW environment scanning system to detect signals of interest to an organization. The vector space model converts text-based information to numerical vectors that are then used in discriminant analysis. We illustrate the methodology using news articles pertaining to a predefined randomly selected set of stocks to test whether they provide predictive signals on whether the stock's return will increase or decrease relative to the market in the target period following the report or whether the stock's trading volume will increase or decrease.