Predictive model of insolvency risk for Australian corporations

  • Authors:
  • Rohan A. Baxter;Mark Gawler;Russell Ang

  • Affiliations:
  • Australian Taxation Office, ACT;Australian Taxation Office, ACT;Australian Taxation Office, ACT

  • Venue:
  • AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper describes the development of a predictive model for corporate insolvency risk in Australia. The model building methodology is empirical with out-of-sample future year test sets. The regression method used is logistic regression after pre-processing by quantisation of interval (or numeric) attributes. We show that logistic regression matches the performance of ensemble methods, such as random forests and ada boost, provided that pre-processing and variable selection is performed. A distinctive feature of the insolvency risk model described in this paper is its breadth; since we are using income tax return data we are able to risk score one million companies across all industries, all corporation types (public, private) and all sizes, as measured either by assets or number of employees. This is an application paper that uses standard credit scoring methodology on a new data source. The contribution is to demonstrate that insolvency risk can be estimated using income tax return data. The corporate insolvency prediction model is still in development and so we welcome suggestions for improvements on the current methodology.