Classifying news stories using memory based reasoning
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic indexing based on Bayesian inference networks
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Data mining: concepts and techniques
Data mining: concepts and techniques
Japanese probabilistic information retrieval using location and category information
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
Data Mining: Technologies, Techniques, Tools, and Trends
Data Mining: Technologies, Techniques, Tools, and Trends
Machine Learning
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Similarity Model and Term Association For Document Categorization
DEXA '02 Proceedings of the 13th International Workshop on Database and Expert Systems Applications
A text mining approach for automatic construction of hypertexts
Expert Systems with Applications: An International Journal
Neighbor-weighted K-nearest neighbor for unbalanced text corpus
Expert Systems with Applications: An International Journal
Using text classification and multiple concepts to answer e-mails
Expert Systems with Applications: An International Journal
An effective refinement strategy for KNN text classifier
Expert Systems with Applications: An International Journal
A new approach on search for similar documents with multiple categories using fuzzy clustering
Expert Systems with Applications: An International Journal
Application of fuzzy clustering in financial analysis of logistic companies
MATH'07 Proceedings of the 11th WSEAS International Conference on Applied Mathematics
Constructing and application of multimedia TV-news archives
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Comparison of similarity measures for clustering Turkish documents
Intelligent Data Analysis
Using a new relational concept to improve the clustering performance of search engines
Information Processing and Management: an International Journal
Research of fast SOM clustering for text information
Expert Systems with Applications: An International Journal
An implicit approach for building communities of web service registries
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Development of a semantic-based content mapping mechanism for information retrieval
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Hi-index | 12.06 |
Searching for similar documents has a crucial role in document management. This paper aims for developing a fast and high quality method of searching similar documents based on fuzzy clustering in large document collections. In order to perform these requirements, a two layers structure is proposed. Formerly, finding the similarity in documents is based on the strategy that uses word-by-word comparison. The proposed method in this study uses two layers structure and lets the documents pass through it to find the similarities. In this system, predefined fuzzy clusters are used to extract feature vectors of related documents for finding similar documents of them. Similarity measure is estimated based on these vectors. To do this, a distance based similarity measure is proposed. It has been seen in empirical results that the proposed system uses new similarity measure and has better performance compared with conventional similarity measurement systems.