Computer Networks: The International Journal of Computer and Telecommunications Networking
Hi-index | 0.00 |
Pattern matching with wildcards is very important in many fields such as information retrieval and bioinformatics. Suffix trees are used in pattern matching with variable length wildcards. But the construction of a suffix tree needs significant time and space overload. This paper presents a new pattern matching algorithm, PST, based on multiple suffix trees. The PST algorithm uses a cutting process to divide a string S into several parts firstly, and then establishes a suffix tree for each part of S respectively. If multiple patterns are to be retrieved, the suffix trees should be adjusted according to the cutting points: prefix sequence deletion and suffix sequence addition; prefix sequence addition and suffix sequence deletion. Theoretical analysis and experiments show that the PST algorithm can decrease the time and space overload than other peers.