Constraint-based sequential pattern mining: The consideration of recency and compactness

Research output: Contribution to journalArticlepeer-review

48 Scopus citations

Abstract

Sequential pattern mining is an important data-mining method for determining time-related behavior in sequence databases. The information obtained from sequential pattern mining can be used in marketing, medical records, sales analysis, and so on. Existing methods only focus on the concept of frequency because of the assumption that sequences' behaviors do not change over time. The environment from which the data is generated is often dynamic, however, so the sequences' behaviors may change over time. To adapt the discovered patterns to these changes, two new concepts, recency and compactness, are incorporated into traditional sequential pattern mining. The concept of recency causes patterns to quickly adapt to the latest behaviors in sequence databases, while the concept of compactness ensures reasonable time spans for the discovered patterns. We named the new patterns CFR-patterns because three concepts (compactness, frequency, and recency) are simultaneously considered. An efficient method is presented to find CFR-patterns. Empirical evaluation shows that the proposed methods are computationally efficient and that they are more advantageous than traditional methods when sequences' behaviors change over time.

Original languageEnglish
Pages (from-to)1203-1215
Number of pages13
JournalDecision Support Systems
Volume42
Issue number2
DOIs
StatePublished - Nov 2006

Keywords

  • Constraint-based mining
  • Sequential pattern
  • Temporal database

Fingerprint

Dive into the research topics of 'Constraint-based sequential pattern mining: The consideration of recency and compactness'. Together they form a unique fingerprint.

Cite this