XML-aided phrase indexing for hypertext documents
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based genetic segmentation of time series with DWT
Pattern Recognition Letters
UNDERTOW: multi-level segmentation of real-valued time series
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Lexical and grammatical inference
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
A new unsupervised approach to word segmentation
Computational Linguistics
Hi-index | 0.00 |
This paper describes an unsupervised algorithm forsegmenting categorical time series into episodes. TheVOTING-EXPERTS algorithm first collects statistics aboutthe frequency and boundary entropy of ngrams, then passesa window over the series and has two "expert methods" decidewhere in the window boundaries should be drawn. Thealgorithm successfully segments text into words in four languages.The algorithm also segments time series of robotsensor data into subsequences that represent episodes inthe life of the robot. We claim that VOTING-EXPERTSfinds meaningful episodes in categorical time series becauseit exploits two statistical characteristics of meaningfulepisodes.