Probabilistic and genetic algorithms in document retrieval
Communications of the ACM
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Class-based n-gram models of natural language
Computational Linguistics
Query expansion using local and global document analysis
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
An association-based method for automatic indexing with a controlled vocabulary
Journal of the American Society for Information Science
Effective Data Mining Using Neural Networks
IEEE Transactions on Knowledge and Data Engineering
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
COMPSAC '98 Proceedings of the 22nd International Computer Software and Applications Conference
Feature Selection Using Association Word Mining for Classification
DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
Visualizing Association Rules for Text Mining
INFOVIS '99 Proceedings of the 1999 IEEE Symposium on Information Visualization
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Improving system performance in case-based iterative optimization through knowledge filtering
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Optimal associative neighbor mining using attributes for ubiquitous recommendation systems
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Hi-index | 0.00 |
Query expansion in knowledge based on information retrieval system requires knowledge base being considered semantic relations between words. Since Apriori algorithm extracts association word without taking user preference into account, recall is improved but accuracy is reduced. This paper shows how to establish optimized association word knowledge base with improved accuracy only including association word that users prefer among association words being considered semantic relations between words. Toward this end, web documents related to computer are classified into eight classes, and nouns are extracted from web document of each class. Association word is extracted from nouns through Apriori algorithm, and association word that users do not favor is excluded from knowledge base through genetic algorithm.