The vocabulary problem in human-system communication
Communications of the ACM
An Information Retrieval Approach to Concept Location in Source Code
WCRE '04 Proceedings of the 11th Working Conference on Reverse Engineering
Advancing Candidate Link Generation for Requirements Tracing: The Study of Methods
IEEE Transactions on Software Engineering
Incremental Approach and User Feedbacks: a Silver Bullet for Traceability Recovery
ICSM '06 Proceedings of the 22nd IEEE International Conference on Software Maintenance
Using natural language program analysis to locate and understand action-oriented concerns
Proceedings of the 6th international conference on Aspect-oriented software development
Partial Domain Comprehension in Software Evolution and Maintenance
ICPC '08 Proceedings of the 2008 The 16th IEEE International Conference on Program Comprehension
Identifying Word Relations in Software: A Comparative Study of Semantic Similarity Tools
ICPC '08 Proceedings of the 2008 The 16th IEEE International Conference on Program Comprehension
Automatically capturing source code context of NL-queries for software maintenance and reuse
ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Exploring reductions for long web queries
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Estimating the Query Difficulty for Information Retrieval
Estimating the Query Difficulty for Information Retrieval
Towards mining replacement queries for hard-to-retrieve traces
Proceedings of the IEEE/ACM international conference on Automated software engineering
Improving verbose queries using subset distribution
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
A Survey of Automatic Query Expansion in Information Retrieval
ACM Computing Surveys (CSUR)
Evaluating the specificity of text retrieval queries to support software engineering tasks
Proceedings of the 34th International Conference on Software Engineering
Automatic query performance assessment during the retrieval of software artifacts
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Query quality prediction and reformulation for source code search: the refoqus tool
Proceedings of the 2013 International Conference on Software Engineering
Query quality prediction and reformulation for source code search: the refoqus tool
Proceedings of the 2013 International Conference on Software Engineering
Hi-index | 0.00 |
There are more than twenty distinct software engineering tasks addressed with text retrieval (TR) techniques, such as, traceability link recovery, feature location, refactoring, reuse, etc. A common issue with all TR applications is that the results of the retrieval depend largely on the quality of the query. When a query performs poorly, it has to be reformulated and this is a difficult task for someone who had trouble writing a good query in the first place. We propose a recommender (called Refoqus) based on machine learning, which is trained with a sample of queries and relevant results. Then, for a given query, it automatically recommends a reformulation strategy that should improve its performance, based on the properties of the query. We evaluated Refoqus empirically against four baseline approaches that are used in natural language document retrieval. The data used for the evaluation corresponds to changes from five open source systems in Java and C++ and it is used in the context of TR-based concept location in source code. Refoqus outperformed the baselines and its recommendations lead to query performance improvement or preservation in 84% of the cases (in average).