Decision forest for classification of gene expression data

  • Authors:
  • Jianping Huang;Hong Fang;Xiaohui Fan

  • Affiliations:
  • Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, 388 YuHangTang Road, Hangzhou 310058, China and National Center for Toxicological Research, U.S. Food ...;Z-Tech Corporation, An ICF International Company at NCTR/FDA, 3900 NCTR Road, Jefferson, AR 72079, USA;Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, 388 YuHangTang Road, Hangzhou 310058, China

  • Venue:
  • Computers in Biology and Medicine
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study attempts to propose an improved decision forest (IDF) with an integrated graphical user interface. Based on four gene expression data sets, the IDF not only outperforms the original decision forest, but also is superior or comparable to other state-of-the-art machine learning methods, especially in dealing with high dimensional data. With an integrated built-in feature selection (FS) mechanism and fewer parameters to tune, it can be trained more efficiently than methods such as support vector machine, and can be built with much fewer trees than other popular tree-based ensemble methods. Moreover, it suffers less from the curse of dimensionality.