Discovering forward sequences from temporal data

  • Authors:
  • Yu-Chin Liu

  • Affiliations:
  • Department of Information Management, Shih Hsin University, 1, Lane 17, Sec. 1, Mu-Cha Rd., Wunshan Dist., Taipei 116, Taiwan, ROC

  • Venue:
  • Knowledge-Based Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditionally, sequence pattern mining has been used to mine items occurs in time sequences and items were deemed to be irrelevant to each other. However, in real applications, sequence items shown in a record may have some relation. For example, in mining students' learning portfolios, the learning progressions must contain learning objects with forward-directed relations. Namely, the learning objects (items) themselves are evidence of a pre-existing relationship. In addition, most sequence mining algorithms assume the sequence records in databases are all of the same age. Each data record is observed at the same starting and ending point. But, according to the occurrences of events in a given period, the lengths of some sequences are longer than others. Hence the sequences with longer time spans might contain longer patterns than those with shorter time spans. As a result, the frequency of possible patterns shown in longer sequences might be underestimated due to fewer of records. This research proposed two methods, FSP and FSP-LC, to analyze forward sequence data. Latter, a real-life database which records all employee progression histories in a large company was used to verify and explain the proposed methods. The experimental results show that the proposed methods can mine specific and longer sequences to further improve and re-design their personnel systems.