ABC-SG: a new artificial bee colony algorithm-based distance of sequential data using sigma grams

Authors:
Muhammad Marwan Muhammad Fuad
Affiliations:
Norwegian University of Science and Technology (NTNU), Trondheim, Norway
Venue:
AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Year:
2012

Citing 9
Cited 0

The String-to-String Correction Problem

Journal of the ACM (JACM)
Fast Time Sequence Indexing for Arbitrary Lp Norms

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
A symbolic representation of time series, with implications for streaming algorithms

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Similarity Search: The Metric Space Approach (Advances in Database Systems)

Similarity Search: The Metric Space Approach (Advances in Database Systems)
A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm

Journal of Global Optimization
Artificial Bee Colony (ABC) Optimization Algorithm for Solving Constrained Optimization Problems

IFSA '07 Proceedings of the 12th international Fuzzy Systems Association world congress on Foundations of Fuzzy Logic and Soft Computing
Extending the Edit Distance Using Frequencies of Common Characters

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Querying and mining of time series data: experimental comparison of representations and distance measures

Proceedings of the VLDB Endowment
The multi-resolution extended edit distance

Proceedings of the 3rd international conference on Scalable information systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The problem of similarity search is one of the main problems in computer science. This problem has many applications in text-retrieval, web search, computational biology, bioinformatics and others. Similarity between two data objects can be depicted using a similarity measure or a distance metric. There are numerous distance metrics in the literature, some are used for a particular data type, and others are more general. In this paper we present a new distance metric for sequential data which is based on the sum of n-grams. The novelty of our distance is that these n-grams are weighted using artificial bee colony; a recent optimization algorithm based on the collective intelligence of a swarm of bees on their search for nectar. This algorithm has been used in optimizing a large number of numerical problems. We validate the new distance experimentally.