MedFMI-SiR: a powerful DBMS solution for large-scale medical image retrieval

  • Authors:
  • Daniel S. Kaster;Pedro H. Bugatti;Marcelo Ponciano-Silva;Agma J. M. Traina;Paulo M. A. Marques;Antonio C. Santos;Caetano Traina, Jr.

  • Affiliations:
  • Department of Computer Science, University of Londrina, Londrina, PR, Brazil and Department of Computer Science, University of São Paulo, São Carlos, SP, Brazil;Department of Computer Science, University of São Paulo, São Carlos, SP, Brazil;Department of Computer Science, University of São Paulo, São Carlos, SP, Brazil;Department of Computer Science, University of São Paulo, São Carlos, SP, Brazil;Department of Internal Medicine, RPMS/University of São Paulo, Brazil;Department of Internal Medicine, RPMS/University of São Paulo, Brazil;Department of Computer Science, University of São Paulo, São Carlos, SP, Brazil

  • Venue:
  • ITBAM'11 Proceedings of the Second international conference on Information technology in bio- and medical informatics
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Medical systems increasingly demand methods to deal with the large amount of images that are daily generated. Therefore, the development of fast and scalable applications to store and retrieve images in large repositories becomes an important concern. Moreover, it is necessary to handle textual and content-based queries over such data coupled with DICOM image metadata and their visual patterns. While DBMSs have been extensively used to manage applications' textual information, content-based processing tasks usually rely on specific solutions. Most of these solutions are targeted to relatively small and controlled datasets, being unfeasible to be employed in real medical environments that deal with voluminous databases. Moreover, since in existing systems the content-based retrieval is detached from the DBMS, queries integrating content- and metadata-based predicates are executed isolated, having their results joined in additional steps. It is easy to realize that this approach prevent from many optimizations that would be employed in an integrated retrieval engine. In this paper we describe the MedFMI-SiR system, which handles medical data joining textual information, such as DICOM tags, and intrinsic image features integrated in the retrieval process. The goal of our approach is to provide a subsystem that can be shared by many complex data applications, such as data analysis and mining tools, providing fast and reliable content-based access over large sets of images. We present experiments that show that MedFMI-SiR is a fast and scalable solution, being able to quickly answer integrated content- and metadata-based queries over a terabyte-sized database with more than 10 million medical images from a large clinical hospital.