Enhancing graph database indexing by suffix tree structure

  • Authors:
  • Vincenzo Bonnici;Alfredo Ferro;Rosalba Giugno;Alfredo Pulvirenti;Dennis Shasha

  • Affiliations:
  • Dipartimento di Matematica ed Informatica, Università di Catania, Catania, Italy;Dipartimento di Matematica ed Informatica, Università di Catania, Catania, Italy;Dipartimento di Matematica ed Informatica, Università di Catania, Catania, Italy;Dipartimento di Matematica ed Informatica, Università di Catania, Catania, Italy;Courant Institute of Mathematical Sciences, New York University, New York

  • Venue:
  • PRIB'10 Proceedings of the 5th IAPR international conference on Pattern recognition in bioinformatics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Biomedical and chemical databases are large and rapidly growing in size. Graphs naturally model such kinds of data. To fully exploit the wealth of information in these graph databases, scientists require systems that search for all occurrences of a query graph. To deal efficiently with graph searching, advanced methods for indexing, representation and matching of graphs have been proposed. This paper presents GraphGrepSX. The system implements efficient graph searching algorithms together with an advanced filtering technique. GraphGrepSX is compared with SING, GraphFind, CTree and GCoding. Experiments show that GraphGrepSX outperforms the compared systems on a very large collection of molecular data. In particular, it reduces the size and the time for the construction of large database index and outperforms the most popular systems.