Slovak Blog Clustering Enhanced by Mining the Web Comments
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Hierarchically clustered technical blogs
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Finding keywords in blogs: Efficient keyword extraction in blog mining via user behaviors
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Web content clustering is very important part of topic detection and tracking issue. In our paper we focus on pre-processing phase of web content clustering. We focus on blog articles published in Slovak language. We evaluate the impact of different data pre-processing methods on success of blog clustering. We found out that applying various text data manipulation techniques in preprocessing can improve the quality of clusters. The quality of clusters is measured by traditional clustering metrics like precision, recall and F-measure.