Knowledge discovery in databases terminology
Advances in knowledge discovery and data mining
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
CACTUS—clustering categorical data using summaries
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Data mining: concepts and techniques
Data mining: concepts and techniques
A robust and scalable clustering algorithm for mixed type attributes in large database environment
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Data Mining: Concepts, Models, Methods and Algorithms
Data Mining: Concepts, Models, Methods and Algorithms
COOLCAT: an entropy-based algorithm for categorical clustering
Proceedings of the eleventh international conference on Information and knowledge management
Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values
Data Mining and Knowledge Discovery
Efficient and Effective Clustering Methods for Spatial Data Mining
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
SSD '95 Proceedings of the 4th International Symposium on Advances in Spatial Databases
ROCK: A Robust Clustering Algorithm for Categorical Attributes
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Catching the Trend: A Framework for Clustering Concept-Drifting Categorical Data
IEEE Transactions on Knowledge and Data Engineering
Least squares quantization in PCM
IEEE Transactions on Information Theory
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Hi-index | 0.00 |
Identification of useful clusters in large datasets has attracted considerable interest in clustering process. Since data in the World Wide Web is increasing exponentially that affects on clustering accuracy and decision making, change in the concept between every cluster occurs named concept drift. The new data must be assigned to any one of generated clusters called data labeling. To say that data labeling was performed well the clusters must be efficient. Selecting initial cluster center (centroid) is the key factor that has high affection in generating effective clusters. So we are proposing approaches. Ore previous work was concentrated on finding minimum point that act as initial cluster centroid for single variable functions. This paper introduces a two methods simplex and graphical methods to select optimal cluster centroid for the linear functions of two variables and concludes with preferable method for functions of same nature. After finding initial cluster centroid we can then apply any existing clustering algorithm.