An architecture to automatically store and update MEDLINE data for text mining

  • Authors:
  • Sirisha Kanda;Venu Dasigi

  • Affiliations:
  • Southern Polytechnic State University, Marietta, GA;Southern Polytechnic State University, Marietta, GA

  • Venue:
  • Proceedings of the 43rd annual Southeast regional conference - Volume 1
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses an architecture that is used in gene clustering based on functional keyword associations from MEDLNE abstracts. The architecture is designed to store the statistics of the words and documents and supports incremental updating of the statistics when a new batch of MEDLINE data becomes available. The data stored in the architecture are in a form ready to be used without a need for further calculations, thus reducing the time to run a query. The system is also intended to be used as a test bed to experiment with new algorithms.