Conditional random fields for predicting and analyzing histone occupancy, acetylation and methylation areas in DNA sequences

  • Authors:
  • Dang Hung Tran;Tho Hoan Pham;Kenji Satou;Tu Bao Ho

  • Affiliations:
  • School of Knowledge Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan;Faculty of Information Technology, Hanoi University of Pedagogy, Hanoi, Vietnam;School of Knowledge Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan;School of Knowledge Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan

  • Venue:
  • EuroGP'06 Proceedings of the 2006 international conference on Applications of Evolutionary Computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Eukaryotic genomes are packaged by the wrapping of DNA around histone octamers to form nucleosomes. Nucleosome occupancies together with their acetylation and methylation are important modification factors on all nuclear processes involving DNA. There have been recently many studies of mapping these modifications in DNA sequences and of relationship between them and various genetic activities, such as transcription, DNA repair, and DNA remodeling. However, most of these studies are experimental approaches. In this paper, we introduce a computational approach to both predicting and analyzing nucleosome occupancy, acetylation, and methylation areas in DNA sequences. Our method employs conditional random fields (CRFs) to discriminate between DNA areas with high and low relative occupancy, acetylation, or methylation; and rank features of DNA sequences based on their weight in the CRFs model trained from the datasets of these DNA modifications. The results from our method on the yeast genome reveal genetic area preferences of nucleosome occupancy, acetylation, and methylation are consistent with previous studies.