Tag recommendation in software information sites

Authors:
Xin Xia;David Lo;Xinyu Wang;Bo Zhou
Affiliations:
Zhejiang University, China;Singapore Management University, Singapore;Zhejiang University, China;Zhejiang University, China
Venue:
Proceedings of the 10th Working Conference on Mining Software Repositories
Year:
2013

Citing 28
Cited 0

Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
HT06, tagging paper, taxonomy, Flickr, academic article, to read

Proceedings of the seventeenth conference on Hypertext and hypermedia
ML-KNN: A lazy learning approach to multi-label learning

Pattern Recognition
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Random k-Labelsets: An Ensemble Method for Multilabel Classification

ECML '07 Proceedings of the 18th European conference on Machine Learning
How tagging helps bridge the gap between social and technical aspects in software development

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Learning from Imbalanced Data

IEEE Transactions on Knowledge and Data Engineering
Modern Information Retrieval

Modern Information Retrieval
Multi-label learning by exploiting label dependency

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Social media for software engineering

Proceedings of the FSE/SDP workshop on Future of software engineering research
The impact of social media on software engineering practices and tools

Proceedings of the FSE/SDP workshop on Future of software engineering research
Fuzzy set approach for automatic tagging in evolving software

ICSM '10 Proceedings of the 2010 IEEE International Conference on Software Maintenance
Mining Collaboration Patterns from a Large Developer Network

WCRE '10 Proceedings of the 2010 17th Working Conference on Reverse Engineering
Towards understanding twitter use in software engineering: preliminary findings, ongoing challenges and future questions

Proceedings of the 2nd International Workshop on Web 2.0 for Software Engineering
How do developers blog?: an exploratory study

Proceedings of the 8th Working Conference on Mining Software Repositories
MULAN: A Java Library for Multi-Label Learning

The Journal of Machine Learning Research
Using tag recommendations to homogenize folksonomies in microblogging environments

SocInfo'11 Proceedings of the Third international conference on Social informatics
Classifier chains for multi-label classification

Machine Learning
Recommending People in Developers' Collaboration Network

WCRE '11 Proceedings of the 2011 18th Working Conference on Reverse Engineering
Understanding a developer social network and its evolution

ICSM '11 Proceedings of the 2011 27th IEEE International Conference on Software Maintenance
Work Item Tagging: Communicating Concerns in Collaborative Software Development

IEEE Transactions on Software Engineering
Finding relevant answers in software forums

ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Semi-automatically extracting FAQs to improve accessibility of software development knowledge

Proceedings of the 34th International Conference on Software Engineering
Observatory of trends in software related microblogs

Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Automatic classification of software related microblogs

ICSM '12 Proceedings of the 2012 IEEE International Conference on Software Maintenance (ICSM)
Detecting similar applications with collaborative tagging

ICSM '12 Proceedings of the 2012 IEEE International Conference on Software Maintenance (ICSM)
Inferring semantically related software terms and their taxonomy by leveraging collaborative tagging

ICSM '12 Proceedings of the 2012 IEEE International Conference on Software Maintenance (ICSM)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, software engineers use a variety of online media to search and become informed of new and interesting technologies, and to learn from and help one another. We refer to these kinds of online media which help software engineers improve their performance in software development, maintenance and test processes as software information sites. It is common to see tags in software information sites and many sites allow users to tag various objects with their own words. Users increasingly use tags to describe the most important features of their posted contents or projects. In this paper, we propose TagCombine, an automatic tag recommendation method which analyzes objects in software information sites. TagCombine has 3 different components: 1. multi-label ranking component which considers tag recommendation as a multi-label learning problem; 2. similarity based ranking component which recommends tags from similar objects; 3. tag-term based ranking component which considers the relationship between different terms and tags, and recommends tags after analyzing the terms in the objects. We evaluate TagCombine on 2 software information sites, StackOverflow and Freecode, which contain 47,668 and 39,231 text documents, respectively, and 437 and 243 tags, respectively. Experiment results show that for StackOverflow, our TagCombine achieves recall@5 and recall@10 scores of 0.5964 and 0.7239, respectively; For Freecode, it achieves recall@5 and recall@10 scores of 0.6391 and 0.7773, respectively. Moreover, averaging over StackOverflow and Freecode results, we improve TagRec proposed by Al-Kofahi et al. by 22.65% and 14.95%, and the tag recommendation method proposed by Zangerle et al. by 18.5% and 7.35% for recall@5 and recall@10 scores.