Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Incremental relevance feedback
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Measuring the informativeness of a retrieval process
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
The effect of adding relevance information in a relevance feedback environment
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory
The nature of statistical learning theory
Evaluating and optimizing autonomous text classification systems
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Information storage and retrieval
Information storage and retrieval
Journal of the American Society for Information Science - Special topic issue on the history of documentation and information science: part II
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Boosting and Rocchio applied to text filtering
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Less is More: Active Learning with Support Vector Machines
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Support Vector Machine Active Learning with Application sto Text Classification
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)
Support vector machines for spam categorization
IEEE Transactions on Neural Networks
Applications of Support Vector Machines for Pattern Recognition: A Survey
SVM '02 Proceedings of the First International Workshop on Pattern Recognition with Support Vector Machines
Genetic algorithms in relevance feedback: a second test and new contributions
Information Processing and Management: an International Journal
Filtering search results using an optimal set of terms identified by an artificial neural network
Information Processing and Management: an International Journal
Interactive relevance feedback mechanism for image retrieval using rough set
Knowledge-Based Systems
Content personalization and adaptation for three-screen services
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
An innovative analyser for multi-classifier e-mail classification based on grey list analysis
Journal of Network and Computer Applications
Learning to Rank for Information Retrieval
Foundations and Trends in Information Retrieval
Filtering search results using an optimal set of terms identified by an artificial neural network
Information Processing and Management: an International Journal
Architecture of adaptive spam filtering based on machine learning algorithms
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
A multi-tier phishing detection and filtering approach
Journal of Network and Computer Applications
Hi-index | 0.00 |
We compare support vector machines (SVMs) to Rocchio, Ide regular and Ide dec-hi algorithms in information retrieval (IR) of text documents using relevancy feedback. It is assumed a preliminary search finds a set of documents that the user marks as relevant or not and then feedback iterations commence. Particular attention is paid to IR searches where the number of relevant documents in the database is low and the preliminary set of documents used to start the search has few relevant documents. Experiments show that if inverse document frequency (IDF) weighting is not used because one is unwilling to pay the time penalty needed to obtain these features, then SVMs are better whether using term-frequency (TF) or binary weighting. SVM performance is marginally better than Ide dec-hi if TF-IDF weighting is used and there is a reasonable number of relevant documents found in the preliminary search. If the preliminary search is so poor that one has to search through many documents to find at least one relevant document, then SVM is preferred.