Clock synchronization for internet measurements: a clustering algorithm
Computer Networks: The International Journal of Computer and Telecommunications Networking
Learning States and Rules for Detecting Anomalies in Time Series
Applied Intelligence
Signal segmentation and modelling based on equipartition principle
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Modified Gath--Geva clustering for fuzzy segmentation of multivariate time-series
Fuzzy Sets and Systems
Running damage extraction technique for identifying fatigue damaging events
WSEAS Transactions on Mathematics
Evaluation of BIC and Cross Validation for model selection on sequence segmentations
International Journal of Data Mining and Bioinformatics
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Correlation based dynamic time warping of multivariate time series
Expert Systems with Applications: An International Journal
ACM Computing Surveys (CSUR)
Estimating the predominant number of clusters in a dataset
Intelligent Data Analysis
Hi-index | 0.00 |
Segmentation is a popular technique for discoveringstructure in time series data. We address the largely openproblem of estimating the number of segments that can bereliably discovered. We introduce a novel method for theproblem, called Pete. Pete is based on permutation testing.The problem is an instance of model (dimension) selection.The proposed method analyzes the possible overfitof a model to the available data rather than uses a termfor penalizing model complexity. In this respect the approachis more similar to cross-validation than regulariza-tionbased techniques (e.g., AIC, BIC, MDL, MML). Further,the method produces a p value for each increase in thenumber of segments. This gives the user an overview of thestatistical significance of the segmentations. We evaluatethe performance of the proposed method using both syntheticand real time series data. The experiments show thatpermutation testing gives realistic results about the numberof reliably identifiable segments and that it compares favorablywith the Monte Carlo cross-validation (MCCV) andcommonly used BIC criteria.