Incorporating prior domain knowledge into a kernel based feature selection algorithm

  • Authors:
  • Ting Yu;Simeon J. Simoff;Donald Stokes

  • Affiliations:
  • The Faculty of Information Technology and School of Accounting, University of Technology, Sydney, Broadway, NSW, Australia and Capital Markets Cooperative Research Centre, Australia;The Faculty of Information Technology and School of Accounting, University of Technology, Sydney, Broadway, NSW, Australia and Capital Markets Cooperative Research Centre, Australia;The Faculty of Information Technology and School of Accounting, University of Technology, Sydney, Broadway, NSW, Australia and Capital Markets Cooperative Research Centre, Australia

  • Venue:
  • PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a new method of incorporating prior domain knowledge into a kernel based feature selection algorithm. The proposed feature selection algorithm combines the Fast Correlation-Based Filter (FCBF) and the kernel methods in order to uncover an optimal subset of features for the support vector regression. In the proposed algorithm, the Kernel Canonical Correlation Analysis (KCCA) is employed as a measurement of mutual information between feature candidates. Domain knowledge in forms of constraints is used to guide the tuning of the KCCA. In the second experiments, the audit quality research carried by Yang Li and Donald Stokes [1] provides the domain knowledge, and the result extends the original subset of features.