Combining multiple clusterings of chemical structures using cumulative voting-based aggregation algorithm

Authors:
Faisal Saeed;Naomie Salim;Ammar Abdo;Hamza Hentabli
Affiliations:
Faculty of Computing, Universiti Teknologi Malaysia, Malaysia, Information Technology Department, Sanhan Community College, Sana'a, Yemen;Faculty of Computing, Universiti Teknologi Malaysia, Malaysia;Computer Science Department, Hodeidah University, Hodeidah, Yemen, LIFL UMR CNRS 8022, Universite' Lille 1 and INRIA Lille Nord Europe, Villeneuve d'Ascq cedex, France;Faculty of Computing, Universiti Teknologi Malaysia, Malaysia
Venue:
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II
Year:
2013

Citing 6
Cited 0

Similarity and Clustering in Chemical Information Systems

Similarity and Clustering in Chemical Information Systems
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions

The Journal of Machine Learning Research
Bagging for Path-Based Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Analysis of Consensus Partition in Cluster Ensemble

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Cumulative Voting Consensus Method for Partitions with Variable Number of Clusters

IEEE Transactions on Pattern Analysis and Machine Intelligence
On voting-based consensus of cluster ensembles

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

The use of consensus clustering methods in chemoinformatics is motivated because of the success of consensus scoring (data fusion) in virtual screening and also because of the ability of consensus clustering to improve the robustness, novelty, consistency and stability of individual clusterings in other areas. In this paper, Cumulative Voting-based Aggregation Algorithm (CVAA) was examined for combining multiple clusterings of chemical structures. The effectiveness of clusterings was evaluated based on the extent to which they clustered compounds, which belong to the same activity class, together. Then, the results were compared to other consensus clustering and Ward's methods. The MDL Drug Data Report (MDDR) database was used for experiments and the results were obtained by combining multiple clusterings that were applied using different distance measures. The experiments show that the voting-based consensus method can efficiently improve the effectiveness of chemical structures clusterings.