Partitioned posting files: a parallel inverted file structure for information retrieval
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction: parallel processing and information retrieval
Information Processing and Management: an International Journal - Special issue on parallel processing and information retrieval
Stress-testing general purpose digital library software
ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
How to Build a Digital Library, Second Edition
How to Build a Digital Library, Second Edition
Hi-index | 0.00 |
As very large digital library collections become more commonplace, software tools must adapt appropriately. This paper reports on an evolution of the Greenstone Digital Library software to support parallel processing during the collection building phase. A series of experiments were conducted to first establish a basic speed-up factor, and then deconstruct the parallelisation process to understand the execution profile of the application. Several bottlenecks were identified and resolved to further improve the performance. The adaptation of Greenstone confirms that the build phase is indeed a suitable candidate for parallelisation; and suggests that parallelisation of processing is a new avenue for exploration in emerging digital library architectures.