On mining multi-time-interval sequential patterns

Ya Han Hu, Tony Cheng Kui Huang, Hui Ru Yang, Yen Liang Chen

Research output: Contribution to journalArticlepeer-review

31 Scopus citations

Abstract

Sequential pattern mining is essential in many applications, including computational biology, consumer behavior analysis, web log analysis, etc. Although sequential patterns can tell us what items are frequently to be purchased together and in what order, they cannot provide information about the time span between items for decision support. Previous studies dealing with this problem either set time constraints to restrict the patterns discovered or define time-intervals between two successive items to provide time information. Accordingly, the first approach falls short in providing clear time-interval information while the second cannot discover time-interval information between two non-successive items in a sequential pattern. To provide more time-related knowledge, we define a new variant of time-interval sequential patterns, called multi-time-interval sequential patterns, which can reveal the time-intervals between all pairs of items in a pattern. Accordingly, we develop two efficient algorithms, called the MI-Apriori and MI-PrefixSpan algorithms, to solve this problem. The experimental results show that the MI-PrefixSpan algorithm is faster than the MI-Apriori algorithm, but the MI-Apriori algorithm has better scalability in long sequence data.

Original languageEnglish
Pages (from-to)1112-1127
Number of pages16
JournalData and Knowledge Engineering
Volume68
Issue number10
DOIs
StatePublished - Oct 2009

Keywords

  • Data mining
  • Knowledge discovery
  • Multi-time-interval
  • Sequential pattern
  • Time-interval

Fingerprint

Dive into the research topics of 'On mining multi-time-interval sequential patterns'. Together they form a unique fingerprint.

Cite this