Lex & yacc
An Information Retrieval Approach for Automatically Constructing Software Libraries
IEEE Transactions on Software Engineering
InfoCrystal: a visual tool for information retrieval & management
CIKM '93 Proceedings of the second international conference on Information and knowledge management
An intelligent tool for re-engineering software modularity
ICSE '91 Proceedings of the 13th international conference on Software engineering
Extracting concepts from file names: a new file clustering criterion
Proceedings of the 20th international conference on Software engineering
Proceedings of the 24th International Conference on Software Engineering
Supporting reuse by delivering task-relevant and personalized information
Proceedings of the 24th International Conference on Software Engineering
Extracting and Restructuring the Design of Large Systems
IEEE Software
An Empirical Study of Representation Methods for Reusable Software Components
IEEE Transactions on Software Engineering
What's the code?: automatic classification of source code archives
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Recovering documentation-to-source-code traceability links using latent semantic indexing
Proceedings of the 25th International Conference on Software Engineering
Case Study of Feature Location Using Dependence Graph
IWPC '00 Proceedings of the 8th International Workshop on Program Comprehension
Comprehending Web Applications by a Clustering Based Approach
IWPC '02 Proceedings of the 10th International Workshop on Program Comprehension
An Empirical Study of Speed and Communication in Globally Distributed Software Development
IEEE Transactions on Software Engineering
An Approach to Classify Software Maintenance Requests
ICSM '02 Proceedings of the International Conference on Software Maintenance (ICSM'02)
ICTAI '00 Proceedings of the 12th IEEE International Conference on Tools with Artificial Intelligence
SVDPACKC (Version 1.0) User''s Guide
SVDPACKC (Version 1.0) User''s Guide
Automatic Categorization Algorithm for Evolvable Software Archive
IWPSE '03 Proceedings of the 6th International Workshop on Principles of Software Evolution
MUDABlue: An Automatic Categorization System for Open Source Repositories
APSEC '04 Proceedings of the 11th Asia-Pacific Software Engineering Conference
Using information retrieval based coupling measures for impact analysis
Empirical Software Engineering
Refining component description by leveraging user query logs
Journal of Systems and Software
Source code indexing for automated tracing
Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
Tools used in Global Software Engineering: A systematic mapping review
Information and Software Technology
Analyzing and mining a code search engine usage log
Empirical Software Engineering
Proceedings of the 34th International Conference on Software Engineering
Detecting similar software applications
Proceedings of the 34th International Conference on Software Engineering
Labeled topic detection of open source software from mining mass textual project profiles
Proceedings of the First International Workshop on Software Mining
Rendezvous: a search engine for binary code
Proceedings of the 10th Working Conference on Mining Software Repositories
Extraction of product evolution tree from source code of product variants
Proceedings of the 17th International Software Product Line Conference
Tag recommendation for open source software
Frontiers of Computer Science: Selected Publications from Chinese Universities
Hi-index | 0.00 |
Open Source communities typically use a software repository to archive various software projects with their source code, mailing list discussions, documentation, bug reports, and so forth. For example, SourceForge currently hosts over seventy thousand Open Source software systems. Because of the size of the rich information content, such repositories offer numerous opportunities for sharing information among projects. For example, one would like to know a set of projects that are related or similar to each other, so that the project groups can collaborate and share their work. With thousands of projects in typical repositories, however, manually locating related projects can be difficult. Hence, we propose MUDABlue, a tool that automatically categorizes software systems. MUDABlue has three major aspects: (1) it relies on no other information than the source code, (2) it determines category sets automatically, and (3) it allows a software system to be a member of multiple categories. MUDABlue has a Web interface to visualize determined categories, which eases browsing a software repository. We show the effectiveness of MUDABlue's categorization capability by comparing its generated categories with that of some other existing research tools.