Feature selection for ranking

Authors:
Xiubo Geng;Tie-Yan Liu;Tao Qin;Hang Li
Affiliations:
Microsoft Research Asia, Beijing, China and Institute of Computing Technology, Beijing, China;Microsoft Research Asia, Beijing, China;Microsoft Research Asia, Beijing, China and Tsinghua University, Beijing, China;Microsoft Research Asia, Beijing, China
Venue:
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2007

Citing 23
Cited 41

OHSUMED: an interactive retrieval evaluation and new large test collection for research

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Selection of relevant features and examples in machine learning

Artificial Intelligence - Special issue on relevance
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Making large-scale support vector machine learning practical

Advances in kernel methods
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
Cumulated gain-based evaluation of IR techniques

ACM Transactions on Information Systems (TOIS)
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Feature Selection for Unbalanced Class Distribution and Naive Bayes

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
The concept of relevance in IR

Journal of the American Society for Information Science and Technology
An introduction to variable and feature selection

The Journal of Machine Learning Research
An extensive empirical study of feature selection metrics for text classification

The Journal of Machine Learning Research
Discriminative models for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Feature selection, L1 vs. L2 regularization, and rotational invariance

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Exploiting the hierarchical structure for link analysis

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A study of relevance propagation for web search

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Learning to rank using gradient descent

ICML '05 Proceedings of the 22nd international conference on Machine learning
TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)

TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)
Adapting ranking SVM to document retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Pattern Recognition, Third Edition

Pattern Recognition, Third Edition
Input feature selection for classification problems

IEEE Transactions on Neural Networks

Learning to rank relational objects and its application to web search

Proceedings of the 17th international conference on World Wide Web
Learning to rank with partially-labeled data

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
On profiling blogs with representative entries

Proceedings of the second workshop on Analytics for noisy unstructured text data
Locally Adaptive Neighborhood Selection for Collaborative Filtering Recommendations

AH '08 Proceedings of the 5th international conference on Adaptive Hypermedia and Adaptive Web-Based Systems
Scalable Feature Selection for Multi-class Problems

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Representative entry selection for profiling blogs

Proceedings of the 17th ACM conference on Information and knowledge management
Comparison of Feature Construction Methods for Video Relevance Prediction

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Multi-facet Rating of Product Reviews

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Preferential text classification: learning algorithms and evaluation measures

Information Retrieval
Learning to Rank for Information Retrieval

Foundations and Trends in Information Retrieval
Rank Aggregation Based Text Feature Selection

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Efficient feature weighting methods for ranking

Proceedings of the 18th ACM conference on Information and knowledge management
Feature selection for ranking using boosted trees

Proceedings of the 18th ACM conference on Information and knowledge management
Enabling multi-level relevance feedback on pubmed by integrating rank learning into DBMS

Proceedings of the third international workshop on Data and text mining in bioinformatics
Weighted Rank Correlation in Information Retrieval Evaluation

AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Online reranking via ordinal informative concepts for context fusion in concept detection and video search

IEEE Transactions on Circuits and Systems for Video Technology
Hierarchical feature selection for ranking

Proceedings of the 19th international conference on World wide web
An axiomatic approach to exploit term dependencies in language model

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Robust observation selection for intrusion detection

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Extracting temporal signatures for comprehending systems biology models

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Ranking under temporal constraints

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Rank learning for factoid question answering with linguistic and semantic constraints

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Feature selection under learning to rank model for multimedia retrieve

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
TAKES: a fast method to select features in the kernel space

Proceedings of the 20th ACM international conference on Information and knowledge management
Learning to rank results in relational keyword search

Proceedings of the 20th ACM international conference on Information and knowledge management
Learning location naming from user check-in histories

Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Association rule-based feature selection method for Alzheimer's disease diagnosis

Expert Systems with Applications: An International Journal
Visual interactive failure analysis: supporting users in information retrieval evaluation

Proceedings of the 4th Information Interaction in Context Symposium
Mammographic parenchymal texture analysis for estrogen-receptor subtype specific breast cancer risk estimation

IWDM'12 Proceedings of the 11th international conference on Breast Imaging
Feature selection for link prediction

Proceedings of the 5th Ph.D. workshop on Information and knowledge
User guided entity similarity search using meta-path selection in heterogeneous information networks

Proceedings of the 21st ACM international conference on Information and knowledge management
On the usefulness of query features for learning to rank

Proceedings of the 21st ACM international conference on Information and knowledge management
A zipf-like distant supervision approach for multi-document summarization using wikinews articles

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Can social features help learning to rank youtube videos?

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
A comprehensive study on learning to rank for content-based image retrieval

Signal Processing
Ordinal regularized manifold feature extraction for image ranking

Signal Processing
Truncated power method for sparse eigenvalue problems

The Journal of Machine Learning Research
Learning relatedness measures for entity linking

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A computer vision framework for finger-tapping evaluation in Parkinson's disease

Artificial Intelligence in Medicine
Feature selection for ordinal text classification

Neural Computation
Predicting community preference of comments on the Social Web

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ranking is a very important topic in information retrieval. While algorithms for learning ranking models have been intensively studied, this is not the case for feature selection, despite of its importance. The reality is that many feature selection methods used in classification are directly applied to ranking. We argue that because of the striking differences between ranking and classification, it is better to develop different feature selection methods for ranking. To this end, we propose a new feature selection method in this paper. Specifically, for each feature we use its value to rank the training instances, and define the ranking accuracy in terms of a performance measure or a loss function as the importance of the feature. We also define the correlation between the ranking results of two features as the similarity between them. Based on the definitions, we formulate the feature selection issue as an optimization problem, for which it is to find the features with maximum total importance scores and minimum total similarity scores. We also demonstrate how to solve the optimization problem in an efficient way. We have tested the effectiveness of our feature selection method on two information retrieval datasets and with two ranking models. Experimental results show that our method can outperform traditional feature selection methods for the ranking task.