Foundations of genetic programming
Foundations of genetic programming
Genetic Programming and Data Structures: Genetic Programming + Data Structures = Automatic Programming!
A Field Guide to Genetic Programming
A Field Guide to Genetic Programming
A Survey of Spatial Defects in Homo Sapiens Affymetrix GeneChips
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Genetic programming needs better benchmarks
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Automatic generation of regular expressions from examples with genetic programming
Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Hi-index | 0.00 |
RNAnet [3] http://bioinformatics.essex.ac.uk/users/wlangdon/rnanet/ allows the user to calculate correlations of gene expression, both between genes and between components within genes. We investigate all of Ensembl http://www.ensembl.org and find all the Homo Sapiens exons for which there are sufficient robust Affymetrix HG-U133 Plus 2 GeneChip probes. Calculating correlation between mRNA probe measurements for the same exon shows many exons whose components are consistently up regulated and down regulated. However we identify other Ensembl exons where sub-regions within them are self consistent but these transcript blocks are not well correlated with other blocks in the same exon. We suggest many current Ensembl exon definitions are incomplete. Secondly, having identified exon with substructure we use machine learning to try and identify patterns in the DNA sequence lying between blocks of high correlation which might yield biological or technological explanations. A Backus-Naur form (BNF) context-free grammar constrains strongly typed genetic programming (STGP) to evolve biological motifs in the form of regular expressions (RE) (e.g. TCTTT) which classify gene exons with potential alternative mRNA expression from those without. We show biological patterns can be data mined by a GP written in gawk and using egrep from NCBI's GEO http://www.ncbi.nlm.nih.gov/geo/ database. The automatically produced DNA motifs suggest that alternative polyadenylation is not responsible. (Full version in TR-09-02 [7].) Blocky exons can be found in http://bioinformatics.essex.ac.uk/users/wlangdon/tr-09-02.tar.gz