IEEE Internet Computing
Contextual classifier ensembles
BIS'07 Proceedings of the 10th international conference on Business information systems
Hi-index | 0.00 |
Error-Correcting Output Codes (ECOC) are commonly used to decompose a multi-category problem into many dichotomies. Therefore, the text categorisation task is performed by an ensemble of binary classifiers instead of a single monolithic classifier. The ensemble performance largely depends on the characteristics of the decomposition. We propose a decomposition approach where both the categories and the classifiers are well separated in order to maximise the decision boundaries and minimise correlated predictions. We apply this design to the El Mundo corpus (newspaper articles in Spanish) and the well-known ModApté split of the Reuters-21578 corpus. The results using ensembles are favourably compared to those using a monolithic classifier.