Hypothesis-Driven Constructive Induction in AQ17-HCI: A Method and Experiments
Machine Learning - Special issue on evaluating and changing representation
Fast subsequence matching in time-series databases
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
A statistical perspective on knowledge discovery in databases
Advances in knowledge discovery and data mining
Locally adaptive dimensionality reduction for indexing large time series databases
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
An integer programming approach to inductive learning using genetic and greedy algorithms
New learning paradigms in soft computing
Approximate Queries and Representations for Large Data Sequences
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
When Is ''Nearest Neighbor'' Meaningful?
ICDT '99 Proceedings of the 7th International Conference on Database Theory
Learning First Order Logic Time Series Classifiers: Rules and Boosting
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Pattern Extraction for Time Series Classification
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Fast Time Sequence Indexing for Arbitrary Lp Norms
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
On the need for time series data mining benchmarks: a survey and empirical demonstration
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
Efficient Time Series Matching by Wavelets
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Fast similarity search in the presence of longitudinal scaling in time series databases
ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
A symbolic representation of time series, with implications for streaming algorithms
DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Benchmarking Attribute Selection Techniques for Discrete Class Data Mining
IEEE Transactions on Knowledge and Data Engineering
Interval and dynamic time warping-based decision trees
Proceedings of the 2004 ACM symposium on Applied computing
Distance-function design and fusion for sequence data
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Parallel Coordinates: Visual Multidimensional Geometry and Its Applications
Parallel Coordinates: Visual Multidimensional Geometry and Its Applications
Feature Subset Selection and Feature Ranking for Multivariate Time Series
IEEE Transactions on Knowledge and Data Engineering
On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Using multi-scale histograms to answer pattern existence and shape match queries
SSDBM'2005 Proceedings of the 17th international conference on Scientific and statistical database management
A Bit Level Representation for Time Series Data Mining with Shape Based Similarity
Data Mining and Knowledge Discovery
Fast time series classification using numerosity reduction
ICML '06 Proceedings of the 23rd international conference on Machine learning
Neural Networks
Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)
Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)
Experiencing SAX: a novel symbolic representation of time series
Data Mining and Knowledge Discovery
Constructive induction on decision trees
IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Information science: On the choice of sampling rates in parametric identification of time series
Information Sciences: an International Journal
A review on time series data mining
Engineering Applications of Artificial Intelligence
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Design of multiple classifier systems for time series data
MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
A softened formulation of inductive learning and its use for coronary disease data
ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
A novel bit level time series representation with implication of similarity search and clustering
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
A clustering algorithm based on distinguishability for nominal attributes
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
Hi-index | 0.07 |
Many methods of dimensionality reduction of data series (time series) have been introduced over the past decades. Some of them rely on a symbolic representation of the original data, however in this case the obtained dimensionality reduction is not substantial. In this paper, we introduce a new approach referred to as Symbolic Essential Attributes Approximation (SEAA) to reduce the dimensionality of multidimensional time series. In such a way we form a new nominal representation of the original data series. The approach is based on the concept of data series envelopes and essential attributes generated by a multilayer neural network. The real-valued attributes are discretized, and in this way symbolic data series representation is formed. The SEAA generates a vector of nominal values of new attributes which form the compressed representation of original data series. The nominal attributes are synthetic, and while not being directly interpretable, they still retain important features of the original data series. A validation of usefulness of the proposed dimensionality reduction is carried out for classification and clustering tasks. The experiments have shown that even for a significant reduction of dimensionality, the new representation retains information about the data series sufficient for classification and clustering of the time series.