A Comparative Study of Correlation Measurements for Searching Similar Tags
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Proceedings of the Second ACM International Conference on Web Search and Data Mining
RATC: A Robust Automated Tag Clustering Technique
EC-Web 2009 Proceedings of the 10th International Conference on E-Commerce and Web Technologies
A Neighborhood Search Method for Link-Based Tag Clustering
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Improving the Clustering of Blogosphere with a Self-term Enriching Technique
TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Clustering Blog Posts Using Tags and Relations in the Blogosphere
ICISE '09 Proceedings of the 2009 First IEEE International Conference on Information Science and Engineering
Blog classification using tags: an empirical study
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Hi-index | 0.00 |
Similarity calculation is a key step in the process of clustering. Because most tagged resources on the Internet lack text information, traditional similarity measures cannot obtain good results. We propose the STAC measure to solve the problem of calculating the similarity between tagged resources. In the calculation of STAC, the similarity between tags is calculated using tag co-occurrence information, and the similarity between tagged resources is calculated based on tag comparison. Experiments show the clustering results of tagged resources using STAC is significantly better than using other traditional metrics such as the Euclidean distance and Jaccard coefficient.