CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications

  • Authors:
  • Kurt D. Bollacker;Steve Lawrence;C. Lee Giles

  • Affiliations:
  • University of Texas at Austin Austin, TX and NEC Research Institute Princeton, NJ;NEC Research Institute Princeton, NJ;UMIACS, University of Maryland, College Park, MD and NEC Research Institute Princeton, NJ

  • Venue:
  • AGENTS '98 Proceedings of the second international conference on Autonomous agents
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

Research papers available on the World Wide Web (WWW or Web) areoften poorly organized, often exist in forms opaque to searchengines (e.g. Postscript), and increase in quantity daily.Significant amounts of time and effort are typically needed inorder to find interesting and relevant publications on the Web. Wehave developed a Web based information agent that assists the userin the process of performing a scientific literature search. Givena set of keywords, the agent uses Web search engines and heuristicsto locate and download papers. The papers are parsed in order toextract information features such as the abstract and individuallyidentified citations. The agents Web interface can be used to findrelevant papers in the database using keyword searches, or bynavigating the links between papers formed by the citations. Linksto both citing and cited publications can be followed. In additionto simple browsing and keyword searches, the agent can find paperswhich are similar to a given paper using word information and byanalyzing common citations made by the papers.