Identification of biochemical networks by S-tree based genetic programming

  • Authors:
  • Dong-Yeon Cho;Kwang-Hyun Cho;Byoung-Tak Zhang

  • Affiliations:
  • School of Computer Science and Engineering, Seoul National University Seoul 151-742, Korea;College of Medicine, Seoul National University Seoul 110-799, Korea;School of Computer Science and Engineering, Seoul National University Seoul 151-742, Korea

  • Venue:
  • Bioinformatics
  • Year:
  • 2006

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Most previous approaches to model biochemical networks have focused either on the characterization of a network structure with a number of components or on the estimation of kinetic parameters of a network with a relatively small number of components. For system-level understanding, however, we should examine both the interactions among the components and the dynamic behaviors of the components. A key obstacle to this simultaneous identification of the structure and parameters is the lack of data compared with the relatively large number of parameters to be estimated. Hence, there are many plausible networks for the given data, but most of them are not likely to exist in the real system. Results: We propose a new representation named S-trees for both the structural and dynamical modeling of a biochemical network within a unified scheme. We further present S-tree based genetic programming to identify the structure of a biochemical network and to estimate the corresponding parameter values at the same time. While other evolutionary algorithms require additional techniques for sparse structure identification, our approach can automatically assemble the sparse primitives of a biochemical network in an efficient way. We evaluate our algorithm on the dynamic profiles of an artificial genetic network. In 20 trials for four settings, we obtain the true structure and their relative squared errors are -2. To demonstrate the usefulness of the proposed algorithm for real experimental biological data, we provide an additional example on the transcriptional network of SOS response to DNA damage in Escherichia coli. We confirm that the proposed algorithm can successfully identify the true structure except only one relation. Availability: The executable program and data are available from the authors upon request. Contact:ckh-sb@snu.ac.kr or btzhang@snu.ac.kr