An approach to dimensionality reduction in time series

Authors:
Maciej Krawczak;Grayna Szkatuła
Affiliations:
-;-
Venue:
Information Sciences: an International Journal
Year:
2014

Citing 37
Cited 0

Original Contribution: Principal components, minor components, and linear neural networks

Neural Networks
Hypothesis-Driven Constructive Induction in AQ17-HCI: A Method and Experiments

Machine Learning - Special issue on evaluating and changing representation
Fast subsequence matching in time-series databases

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
A statistical perspective on knowledge discovery in databases

Advances in knowledge discovery and data mining
Locally adaptive dimensionality reduction for indexing large time series databases

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
An integer programming approach to inductive learning using genetic and greedy algorithms

New learning paradigms in soft computing
Approximate Queries and Representations for Large Data Sequences

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
When Is ''Nearest Neighbor'' Meaningful?

ICDT '99 Proceedings of the 7th International Conference on Database Theory
Learning First Order Logic Time Series Classifiers: Rules and Boosting

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Pattern Extraction for Time Series Classification

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Fast Time Sequence Indexing for Arbitrary Lp Norms

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
On the need for time series data mining benchmarks: a survey and empirical demonstration

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
TSA-Tree: A Wavelet-Based Approach to Improve the Efficiency of Multi-Level Surprise and Trend Queries on Time-Series Data

SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
Efficient Time Series Matching by Wavelets

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Fast similarity search in the presence of longitudinal scaling in time series databases

ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
A symbolic representation of time series, with implications for streaming algorithms

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Benchmarking Attribute Selection Techniques for Discrete Class Data Mining

IEEE Transactions on Knowledge and Data Engineering
Interval and dynamic time warping-based decision trees

Proceedings of the 2004 ACM symposium on Applied computing
Distance-function design and fusion for sequence data

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Parallel Coordinates: Visual Multidimensional Geometry and Its Applications

Parallel Coordinates: Visual Multidimensional Geometry and Its Applications
Feature Subset Selection and Feature Ranking for Multivariate Time Series

IEEE Transactions on Knowledge and Data Engineering
On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Using multi-scale histograms to answer pattern existence and shape match queries

SSDBM'2005 Proceedings of the 17th international conference on Scientific and statistical database management
A Bit Level Representation for Time Series Data Mining with Shape Based Similarity

Data Mining and Knowledge Discovery
Fast time series classification using numerosity reduction

ICML '06 Proceedings of the 23rd international conference on Machine learning
Neural Networks

Neural Networks
Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)

Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)
Experiencing SAX: a novel symbolic representation of time series

Data Mining and Knowledge Discovery
Constructive induction on decision trees

IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Information science: On the choice of sampling rates in parametric identification of time series

Information Sciences: an International Journal
A review on time series data mining

Engineering Applications of Artificial Intelligence
An inductive learning algorithm with a partial completeness and consistence via a modified set covering problem

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Design of multiple classifier systems for time series data

MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
A softened formulation of inductive learning and its use for coronary disease data

ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
A novel bit level time series representation with implication of similarity search and clustering

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
A clustering algorithm based on distinguishability for nominal attributes

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
Utilizing symbolic representation in synergistic neural networks classifier of control chart patterns

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV

Quantified Score

Hi-index	0.07

Visualization

Abstract

Many methods of dimensionality reduction of data series (time series) have been introduced over the past decades. Some of them rely on a symbolic representation of the original data, however in this case the obtained dimensionality reduction is not substantial. In this paper, we introduce a new approach referred to as Symbolic Essential Attributes Approximation (SEAA) to reduce the dimensionality of multidimensional time series. In such a way we form a new nominal representation of the original data series. The approach is based on the concept of data series envelopes and essential attributes generated by a multilayer neural network. The real-valued attributes are discretized, and in this way symbolic data series representation is formed. The SEAA generates a vector of nominal values of new attributes which form the compressed representation of original data series. The nominal attributes are synthetic, and while not being directly interpretable, they still retain important features of the original data series. A validation of usefulness of the proposed dimensionality reduction is carried out for classification and clustering tasks. The experiments have shown that even for a significant reduction of dimensionality, the new representation retains information about the data series sufficient for classification and clustering of the time series.