SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
An Information Retrieval Approach for Automatically Constructing Software Libraries
IEEE Transactions on Software Engineering
Proceedings of the 11th international conference on World Wide Web
Recovering Traceability Links between Code and Documentation
IEEE Transactions on Software Engineering
Recovering documentation-to-source-code traceability links using latent semantic indexing
Proceedings of the 25th International Conference on Software Engineering
Richer File System Metadata Using Links and Attributes
MSST '05 Proceedings of the 22nd IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies
Ranking Significance of Software Components Based on Use Relations
IEEE Transactions on Software Engineering
Toward a search architecture for software components: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
A cooperative classification mechanism for search and retrieval software components
Proceedings of the 2007 ACM symposium on Applied computing
Finding similar files in a large file system
WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
SEC+: an enhanced search engine for component-based software development
ACM SIGSOFT Software Engineering Notes
Evaluating the Software Architecture Competence of Organizations
WICSA '08 Proceedings of the Seventh Working IEEE/IFIP Conference on Software Architecture (WICSA 2008)
Searching and navigating petabyte-scale file systems based on facets
PDSW '07 Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
The Claremont report on database research
ACM SIGMOD Record
Effective Keyword Search for Software Resources Installed in Large-Scale Grid Infrastructures
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Minersoft: Software retrieval in grid and cloud computing infrastructures
ACM Transactions on Internet Technology (TOIT)
Automated Tagging for the Retrieval of Software Resources in Grid and Cloud Infrastructures
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Hi-index | 0.00 |
Grid infrastructures are in operation around the world, federating an impressive collection of computational resources and a wide variety of application software. In this context, it is important to establish advanced software discovery services that could help end-users locate software components suitable to their needs. In this paper, we present the design, architecture and implementation of an open-source keyword-based paradigm for the search of software resources in Grid infrastructures, called Minersoft. A key goal of Minersoft is to annotate automatically all the software resources with keyword-rich metadata. Using advanced Information Retrieval techniques, we locate software resources with respect to users queries. Experiments were conducted in EGEE, one of the largest Grid production services currently in operation. Results showed that Minersoft successfully crawled 12.3 million valid files (620 GB size) and sustained, in most sites, high crawling rates.