TY - JOUR
T1 - A new approach for discovering fuzzy quantitative sequential patterns in sequence databases
AU - Chen, Yen Liang
AU - Huang, Tony Cheng Kui
PY - 2006/6/16
Y1 - 2006/6/16
N2 - Given a sequence database and minimum support threshold, the goal of mining quantitative sequential patterns is to discover the complete set of sequential patterns with purchased quantities in databases. Although this type of pattern can provide more information than the traditional sequential pattern, it also causes a sharp boundary problem. This means that when an item's quantity is close to the boundary of two adjacent quantity intervals, it is either ignored or overemphasized. In view of this weakness, a recent paper from Hong, Kuo, and Chi proposed a new kind of extended patterns, called fuzzy quantitative sequential patterns (FQSP), where an item's quantity in the pattern is represented by a fuzzy term rather than a quantity interval. In their work an Apriori-like algorithm was developed to mine all FQSP. In this paper, we propose a new and novel algorithm to mine FQSP based on the divide-and-conquer strategy. Since the proposed algorithm greatly reduces the candidate subsequence generation efforts, the performance is improved significantly. Experiments show that the proposed algorithm is much more efficient and scalable than the previous algorithm.
AB - Given a sequence database and minimum support threshold, the goal of mining quantitative sequential patterns is to discover the complete set of sequential patterns with purchased quantities in databases. Although this type of pattern can provide more information than the traditional sequential pattern, it also causes a sharp boundary problem. This means that when an item's quantity is close to the boundary of two adjacent quantity intervals, it is either ignored or overemphasized. In view of this weakness, a recent paper from Hong, Kuo, and Chi proposed a new kind of extended patterns, called fuzzy quantitative sequential patterns (FQSP), where an item's quantity in the pattern is represented by a fuzzy term rather than a quantity interval. In their work an Apriori-like algorithm was developed to mine all FQSP. In this paper, we propose a new and novel algorithm to mine FQSP based on the divide-and-conquer strategy. Since the proposed algorithm greatly reduces the candidate subsequence generation efforts, the performance is improved significantly. Experiments show that the proposed algorithm is much more efficient and scalable than the previous algorithm.
KW - Data mining
KW - Fuzzy sets
KW - Quantitative data
KW - Sequence data
KW - Sequential patterns
UR - http://www.scopus.com/inward/record.url?scp=33646238952&partnerID=8YFLogxK
U2 - 10.1016/j.fss.2006.02.008
DO - 10.1016/j.fss.2006.02.008
M3 - 期刊論文
AN - SCOPUS:33646238952
SN - 0165-0114
VL - 157
SP - 1641
EP - 1661
JO - Fuzzy Sets and Systems
JF - Fuzzy Sets and Systems
IS - 12
ER -