In search of optimal clusters using genetic algorithms
Pattern Recognition Letters
Concept decompositions for large sparse text data using clustering
Machine Learning
A Document Clustering Method Based on Hierarchical Algorithm with Model Clustering
AINAW '08 Proceedings of the 22nd International Conference on Advanced Information Networking and Applications - Workshops
An Improved Fuzzy Clustering Method for Text Mining
NSWCTC '10 Proceedings of the 2010 Second International Conference on Networks Security, Wireless Communications and Trusted Computing - Volume 01
Low-complexity fuzzy relational clustering algorithms for Web mining
IEEE Transactions on Fuzzy Systems
OntoClippy: A User-Friendly Ontology Design and Creation Methodology
International Journal of Intelligent Information Technologies
Construction of Domain Ontologies: Sourcing the World Wide Web
International Journal of Intelligent Information Technologies
International Journal of Intelligent Information Technologies
Deontic Logic Based Ontology Alignment Technique for E-Learning
International Journal of Intelligent Information Technologies
Hi-index | 0.00 |
The increase in the number of documents has aggravated the difficulty of classifying those documents according to specific needs. Clustering analysis in a distributed environment is a thrust area in artificial intelligence and data mining. Its fundamental task is to utilize characters to compute the degree of related corresponding relationship between objects and to accomplish automatic classification without earlier knowledge. Document clustering utilizes clustering technique to gather the documents of high resemblance collectively by computing the documents resemblance. Recent studies have shown that ontologies are useful in improving the performance of document clustering. Ontology is concerned with the conceptualization of a domain into an individual identifiable format and machine-readable format containing entities, attributes, relationships, and axioms. By analyzing types of techniques for document clustering, a better clustering technique depending on Genetic Algorithm GA is determined. Non-Dominated Ranked Genetic Algorithm NRGA is used in this paper for clustering, which has the capability of providing a better classification result. The experiment is conducted in 20 newsgroups data set for evaluating the proposed technique. The result shows that the proposed approach is very effective in clustering the documents in the distributed environment.