Mining uncertain data for constrained frequent sets
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Towards mobility-based clustering
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluating the distance between two uncertain categorical objects
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Feature selection with mutual information for uncertain data
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Distance-based feature selection on classification of uncertain objects
AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
Improving classification accuracy on uncertain data by considering multiple subclasses
AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
UV-diagram: a voronoi diagram for uncertain spatial databases
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdf). We show that the UK-means algorithm, which generalises the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (ED) between objects and cluster representatives. For arbitrary pdf's, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculation. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previous known in the literature. We conduct experiments to evaluate the effectiveness of our pruning techniques and to show that our techniques significantly outperform previous methods.