Discovery of exogenous variables in data with more variables than observations

  • Authors:
  • Yasuhiro Sogawa;Shohei Shimizu;Aapo Hyvärinen;Takashi Washio;Teppei Shimamura;Seiya Imoto

  • Affiliations:
  • The Institute of Scientific and Industrial Research, Osaka University, Japan;The Institute of Scientific and Industrial Research, Osaka University, Japan;Dept. Comp. Sci. Dept. Math. and Stat., University of Helsinki, Finland;The Institute of Scientific and Industrial Research, Osaka University, Japan;Human Genome Center, Institute of Medical Science, University of Tokyo, Japan;Human Genome Center, Institute of Medical Science, University of Tokyo, Japan

  • Venue:
  • ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations. However, modern datasets including gene expression data increase the needs of high-dimensional causal modeling in challenging situations with orders of magnitude more variables than observations. In this paper, we propose a method to find exogenous variables in a linear non-Gaussian causal model, which requires much smaller sample sizes than conventional methods and works even when orders of magnitude more variables than observations. Exogenous variables work as triggers that activate causal chains in the model, and their identification leads to more efficient experimental designs and better understanding of the causal mechanism. We present experiments with artificial data and real-world gene expression data to evaluate the method.