Algorithms for clustering data
Algorithms for clustering data
Automating the assignment of submitted manuscripts to reviewers
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Inside a software design team: knowledge acquisition, sharing, and integration
Communications of the ACM
Identifying objects using cluster and concept analysis
Proceedings of the 21st international conference on Software engineering
Supporting program comprehension using semantic and structural information
ICSE '01 Proceedings of the 23rd International Conference on Software Engineering
Approaches to clustering for program comprehension and remodularization
Advances in software engineering
Recovering Traceability Links between Code and Documentation
IEEE Transactions on Software Engineering
X-means: Extending K-means with Efficient Estimation of the Number of Clusters
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Automatic Software Clustering via Latent Semantic Analysis
ASE '99 Proceedings of the 14th IEEE international conference on Automated software engineering
Using Clustering Algorithms in Legacy Systems Remodularization
WCRE '97 Proceedings of the Fourth Working Conference on Reverse Engineering (WCRE '97)
ICSM '01 Proceedings of the IEEE International Conference on Software Maintenance (ICSM'01)
Identification of High-Level Concept Clones in Source Code
Proceedings of the 16th IEEE international conference on Automated software engineering
ICTAI '00 Proceedings of the 12th IEEE International Conference on Tools with Artificial Intelligence
The Journal of Machine Learning Research
A brief survey of program slicing
ACM SIGSOFT Software Engineering Notes
The Conceptual Cohesion of Classes
ICSM '05 Proceedings of the 21st IEEE International Conference on Software Maintenance
Comparison of Clustering Algorithms in the Context of Software Evolution
ICSM '05 Proceedings of the 21st IEEE International Conference on Software Maintenance
Advancing Candidate Link Generation for Requirements Tracing: The Study of Methods
IEEE Transactions on Software Engineering
Using Version Information in Architectural Clustering - A Case Study
CSMR '06 Proceedings of the Conference on Software Maintenance and Reengineering
3D visualization for concept location in source code
Proceedings of the 28th international conference on Software engineering
ICSM '06 Proceedings of the 22nd IEEE International Conference on Software Maintenance
The Conceptual Coupling Metrics for Object-Oriented Systems
ICSM '06 Proceedings of the 22nd IEEE International Conference on Software Maintenance
Semantic clustering: Identifying topics in source code
Information and Software Technology
Clustering large software systems at multiple layers
Information and Software Technology
IEEE Transactions on Software Engineering
Combining Formal Concept Analysis with Information Retrieval for Concept Location in Source Code
ICPC '07 Proceedings of the 15th IEEE International Conference on Program Comprehension
ACM Transactions on Software Engineering and Methodology (TOSEM)
Mining business topics in source code using latent dirichlet allocation
ISEC '08 Proceedings of the 1st India software engineering conference
Defining clusters from a hierarchical cluster tree
Bioinformatics
Introduction to Information Retrieval
Introduction to Information Retrieval
A theory of aspects as latent topics
Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
Journal of Software Maintenance and Evolution: Research and Practice - Special Issue on the 12th Conference on Software Maintenance and Reengineering (CSMR 2008)
Towards Recovering Architectural Concepts Using Latent Semantic Indexing
CSMR '08 Proceedings of the 2008 12th European Conference on Software Maintenance and Reengineering
Seeing the forest for the trees
Bioinformatics
Mining source code to automatically split identifiers for software analysis
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Using Latent Dirichlet Allocation for automatic categorization of software
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Using Cluster Analysis to Improve the Design of Component Interfaces
ASE '08 Proceedings of the 2008 23rd IEEE/ACM International Conference on Automated Software Engineering
Survey of clustering algorithms
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
Latent Semantic Indexing (LSI) is a standard approach for extracting and representing the meaning of words in a large set of documents. Recently it has been shown that it is also useful for identifying concerns in source code. The tree cutting strategy plays an important role in obtaining the clusters, which identify the concerns. In this contribution the authors compare two tree cutting strategies: the Dynamic Hybrid cut and the commonly used fixed height threshold. Two case studies have been performed on the source code of Philips Healthcare to compare the results using both approaches. While some of the settings are particular to the Philips-case, the results show that applying a dynamic threshold, implemented by the Dynamic Hybrid cut, is an improvement over the fixed height threshold in the detection of clusters representing relevant concerns. This makes the approach as a whole more usable in practice.