Comparing Different Sparse Matrix Storage Structures as Index Structure for Arabic Text Collection

  • Authors:
  • Basel Bani-Ismail;Ghassan Kanaan

  • Affiliations:
  • Department of Computer Science, Sultan Qaboos University, Muscat, Oman;Department of Computer Science, Amman Arab University, Amman, Jordan

  • Venue:
  • International Journal of Information Retrieval Research
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the authors' study they evaluate and compare the storage efficiency of different sparse matrix storage structures as index structure for Arabic text collection and their corresponding sparse matrix-vector multiplication algorithms to perform query processing in any Information Retrieval IR system. The study covers six sparse matrix storage structures including the Coordinate Storage COO, Compressed Sparse Row CSR, Compressed Sparse Column CSC, Block Coordinate BCO, Block Sparse Row BSR, and Block Sparse Column BSC. Evaluation depends on the storage space requirements for each storage structure and the efficiency of the query processing algorithm. The experimental results demonstrate that CSR is more efficient in terms of storage space requirements and query processing time than the other sparse matrix storage structures. The results also show that CSR requires the least amount of disk space and performs the best in terms of query processing time compared with the other point entry storage structures COO, CSC. The results demonstrate that BSR requires the least amount of disk space and performs the best in terms of query processing time compared with the other block entry storage structures BCO, BSC.