Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
Scatter/Gather: a cluster-based approach to browsing large document collections
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Fast and effective text mining using linear-time document clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
ACM Computing Surveys (CSUR)
Evaluation of hierarchical clustering algorithms for document datasets
Proceedings of the eleventh international conference on Information and knowledge management
Mining all maximal frequent word sequences in a set of sentences
Proceedings of the 14th ACM international conference on Information and knowledge management
Data Mining and Knowledge Discovery
Synthetic Worlds: The Business and Culture of Online Games
Synthetic Worlds: The Business and Culture of Online Games
Learning Conversations in World of Warcraft
HICSS '07 Proceedings of the 40th Annual Hawaii International Conference on System Sciences
Text document clustering based on frequent word meaning sequences
Data & Knowledge Engineering
The use of Second Life for distance education
Journal of Computing Sciences in Colleges
Introduction to Information Retrieval
Introduction to Information Retrieval
Exploring gaming mechanisms to enhance knowledge acquisition in virtual worlds
Proceedings of the 3rd international conference on Digital Interactive Media in Entertainment and Arts
Evaluating the Jaccard-Tanimoto Index on Multi-core Architectures
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
An efficient euclidean distance transform
IWCIA'04 Proceedings of the 10th international conference on Combinatorial Image Analysis
Mahout in Action
Clustering avatars behaviours from virtual worlds interactions
Proceedings of the 4th International Workshop on Web Intelligence & Communities
Emotion-based character clustering for managing story-based contents: a cinemetric analysis
Multimedia Tools and Applications
Hi-index | 0.00 |
Virt-UAM (Virtual Worlds at Universidad Autónoma de Madrid) platform allows to design and implement virtual spaces where a set of avatars can be intensively monitored using a set of tools which can be managed by an administrator. In a virtual world, the users can move and interact between them with a high degree of freedom. The movements, interactions and any other information related to the avatars conversations can be stored. Hence this data is available for processing and analysing to obtain the user behavioural patterns. Document clustering techniques have been intensively applied to automatically organize a document corpus into clusters or similar groups. The topic detection problem can be considered as a special case of document clustering, therefore, these techniques can be used over textual chat to detect clusters from the data, and then extract the conversation topics. Mahout(TM) machine learning library is an Apache(TM) project whose main goal is to build scalable machine learning libraries. This library provides a set of algorithms for data mining and for information retrieval ready to use. This paper shows a practical application of some of these available clustering mahout algorithms, in a virtual world-based scenario. These algorithms have been applied to extract the topics based on clusters obtained from the text messages. Finally, a comparative study of these document clustering algorithms used is presented.