On the power of universal bases in sequencing by hybridization
RECOMB '99 Proceedings of the third annual international conference on Computational molecular biology
MFCS '94 Proceedings of the 19th International Symposium on Mathematical Foundations of Computer Science 1994
Optimal sequencing by hybridization in rounds
RECOMB '01 Proceedings of the fifth annual international conference on Computational biology
Restricting SBH Ambiguity via Restriction Enzymes
WABI '02 Proceedings of the Second International Workshop on Algorithms in Bioinformatics
Better Filtering with Gapped q-Grams
CPM '01 Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching
Better filtering with gapped q-grams
Fundamenta Informaticae - Special issue on computing patterns in strings
Sensitivity analysis and efficient method for identifying optimal spaced seeds
Journal of Computer and System Sciences
On spaced seeds for similarity search
Discrete Applied Mathematics
Accelerating DNA sequencing-by-hybridization with noise
Proceedings of the 5th international workshop on Bioinformatics
Restricting SBH ambiguity via restriction enzymes
Discrete Applied Mathematics
Better Filtering with Gapped q-Grams
Fundamenta Informaticae - Computing Patterns in Strings
Hi-index | 0.00 |
In a recent paper [PFU99) we have introduced a novel probing scheme for DNA sequencing by hybridization (SBH). The new gapped-probe scheme combines natural and universal bases in a well defined periodic pattern. It was shown in [PFU99] that the performance of the gapped-probe scheme (in terms of the length of a sequence that can be uniquely reconstructed using a given library size of probes) is significantly better than the standard scheme based on oligomer probes.In this paper we present and analyze a new, more powerful, sequencing algorithm for the gapped-probe scheme. We prove that the new algorithm exploits the full potential of the SBH technology with high-confidence performance, that comes within a small constant factor (about 2) of the information-theory bound. Moreover, this performance is achieved while maintaining running time linear in the target sequence length.