Ensembles of Cascading Trees

  • Authors:
  • Jinyan Li;Huiqing Liu

  • Affiliations:
  • -;-

  • Venue:
  • ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a new method, called CS4, to constructcommittees of decision trees for classification. The methodconsiders different top-ranked features as the root nodes ofmember trees. This idea is particularly suitable for dealingwith high-dimensional bio-medical data as top-ranked featuresin this type of data usually possess similar merits forclassification. To make a decision, the committee combinesthe power of individual trees in a weighted manner. UnlikeBagging or Boosting which uses bootstrapped trainingdata, our method builds all the member trees of a committeeusing exactly the same set of training data. We have testedthese ideas on UCI data sets as well as recent bio-medicaldata sets of gene expression or proteomic profiles that areusually described by more than 10,000 features. All the experimentalresults show that our method is efficient and thatthe classification performance are superior to C4.5 familyalgorithms.