GAPK: genetic algorithms with prior knowledge for motif discovery in DNA sequences

Authors:
Dianhui Wang;Xi Li
Affiliations:
Department of Computer Science and Computer Engineering, La Trobe University, Melbourne, Victoria, Australia;Department of Computer Science and Computer Engineering, La Trobe Univ., Melbourne, Victoria, Australia and Department of Primary Industries, Bioscience Research Division, Victorian AgriBioscience ...
Venue:
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Year:
2009

Citing 11
Cited 2

Performance standards and evaluations in IR test collections: cluster-based retrieval models

Information Processing and Management: an International Journal
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
FMGA: Finding Motifs by Genetic Algorithm

BIBE '04 Proceedings of the 4th IEEE Symposium on Bioinformatics and Bioengineering
MDGA: motif discovery using a genetic algorithm

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Transcription factor binding site identification using the self-organizing map

Bioinformatics
GAME: detecting cis-regulatory elements using a genetic algorithm

Bioinformatics
An Evaluation of Information Content as a Metric for the Inference of Putative Conserved Noncoding Regions in DNA Sequences Using a Genetic Algorithms Approach

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
GAPWM

Bioinformatics
Quality estimation of multiple sequence alignments by Bayesian hypothesis testing

Bioinformatics
TFBS identification based on genetic algorithm with combined representations and adaptive post-processing

Bioinformatics
Motif discoveries in unaligned molecular sequences using self-organizing neural networks

IEEE Transactions on Neural Networks

iGAPK: improved GAPK algorithm for regulatory DNA motif discovery

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: models and applications - Volume Part II
Gravitational search algorithm-based design of fuzzy control systems with a reduced parametric sensitivity

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discovery of transcription factor binding sites (TFBSs) or DNA motifs in promoter regions of genes plays a key role in understanding the regulations of gene expression. In the past decade computational approaches, including evolutionary computation techniques, for searching for motifs have demonstrated good potential, and some results reported in literature are quite promising. Recently, some favorable progresses on evolutionary mining of motifs have been made and documented in GAME and GALF-P, where GAME employs a Bayesian-based scoring function and GALF-P aims to improve the algorithm performance with local filtering and adaptive post-processing. To improve discovering performance in terms of the recall, precision rates and algorithm reliability, this paper presents an alternative genetic algorithm termed as GAPK for resolving the problem of motifs discovery. In our proposed GAPK framework, a prior knowledge on motifs in a given dataset is used to initialize a population. Our technical contributions include a matrix representation for k-mers, a mismatch-based filtering method for search space reduction, a model mismatch score (MMS) as fitness function, new genetic operations and a model refinement processing. Some benchmarked datasets associated with eight transcription factors are used in our experiments. Comparative studies were carried out with well-known tools including GAME, GALF-P, MEME, MDScan and AlignACE. Results show that our method outperforms other techniques in terms of F-measure.