Algorithms for clustering data
Algorithms for clustering data
Recent trends in hierarchic document clustering: a critical review
Information Processing and Management: an International Journal
Probabilistic and genetic algorithms in document retrieval
Communications of the ACM
Adaptation in natural and artificial systems
Adaptation in natural and artificial systems
Scatter/Gather: a cluster-based approach to browsing large document collections
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Genetic algorithms + data structures = evolution programs (2nd, extended ed.)
Genetic algorithms + data structures = evolution programs (2nd, extended ed.)
Journal of the American Society for Information Science
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On the merits of building categorization systems by supervised clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Grouper: a dynamic clustering interface to Web search results
WWW '99 Proceedings of the eighth international conference on World Wide Web
ACM Computing Surveys (CSUR)
Applying genetic algorithms to query optimization in document retrieval
Information Processing and Management: an International Journal
Communications of the ACM
Evaluating document clustering for interactive information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Practical Handbook of Genetic Algorithms
Practical Handbook of Genetic Algorithms
Genetic Algorithms in Search, Optimization and Machine Learning
Genetic Algorithms in Search, Optimization and Machine Learning
Information Retrieval
Genetic Approach to Query Space Exploration
Information Retrieval
Web Search Using a Genetic Algorithm
IEEE Internet Computing
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Query Optimization in Information Retrieval Using Genetic Algorithms
Proceedings of the 5th International Conference on Genetic Algorithms
A New Study on Using HTML Structures to Improve Retrieval
ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Using the structure of HTML documents to improve retrieval
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Vague knowledge search in the design for outsourcing using fuzzy decision tree
Computers and Operations Research
Using genetic algorithms to evolve a population of topical queries
Information Processing and Management: an International Journal
Query Recommendation for Improving Search Engine Results
International Journal of Information Retrieval Research
State-of-the-art review on relevance of genetic algorithm to internet web search
Applied Computational Intelligence and Soft Computing
Hi-index | 0.01 |
The problem of obtaining relevant results in web searching has been tackled with several approaches. Although very effective techniques are currently used by the most popular search engines when no a priori knowledge on the user's desires beside the search keywords is available, in different settings it is conceivable to design search methods that operate on a thematic database of web pages that refer to a common body of knowledge or to specific sets of users. We have considered such premises to design and develop a search method that deploys data mining and optimization techniques to provide a more significant and restricted set of pages as the final result of a user search. We adopt a vectorization method based on search context and user profile to apply clustering techniques that are then refined by a specially designed genetic algorithm. In this paper we describe the method, its implementation, the algorithms applied, and discuss some experiments that has been run on test sets of web pages.