The evolutionary computation approach to motif discovery in biological sequences
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
In prior work, we presented GAMID, an extension of GAMI (Genetic Algorithms for Motif Inference), which allows the system to ignore some of the sequences when looking for candidate conserved motifs in noncoding DNA. This ability is useful both when looking for candidate motifs in co-expressed genes (where it is not expected that all genes respond to the same transcription factors) and when looking for candidate motifs in divergent species (where functional elements might appear only in related species). In these cases, we would like to allow the inferred motif to be present in only a subset of the input data. By excluding some sequences from the match process, GAMID succeeded at finding known functional elements. Here we use the results of experiments using artificial data with GAMID to show that GAMID's success in inferring motifs in subsets of the input data results in it finding fewer motifs when they are present in all the sequences. Therefore, GAMID is useful as an adjunct tool to GAMI, but is not a replacement for its functionality.