Discovering fuzzy time-interval sequential patterns in sequence databases

Yen Liang Chen, Tony Cheng Kui Huang

研究成果: 雜誌貢獻期刊論文同行評審

79 引文 斯高帕斯(Scopus)

摘要

Given a sequence database and minimum support threshold, the task of sequential pattern mining is to discover the complete set of sequential patterns in databases. From the discovered sequential patterns, we can know what items are frequently brought together and in what order they appear. However, they cannot tell us the time gaps between successive items in patterns. Accordingly, Chen et al. have proposed a generalization of sequential patterns, called time-interval sequential patterns, which reveals not only the order of items, but also the time intervals between successive items [9]. An example of time-interval sequential pattern has a form like (A, I2, B, I1, C), meaning that we buy A first, then after an interval of I2 we buy B, and finally after an interval of I1 we buy C, where I2 and I1 are predetermined time ranges. Although this new type of pattern can alleviate the above concern, it causes the sharp boundary problem. That is, when a time interval is near the boundary of two predetermined time ranges, we either ignore or overemphasize it. Therefore, this paper uses the concept of fuzzy sets to extend the original research so that fuzzy time-interval sequential patterns are discovered from databases. Two efficient algorithms, the fuzzy time interval (FTI)-Apriori algorithm and the FTI-PrefixSpan algorithm, are developed for mining fuzzy time-interval sequential patterns. In our simulation results, we find that the second algorithm outperforms the first one, not only in computing time but also in scalability with respect to various parameters.

原文???core.languages.en_GB???
頁(從 - 到)959-972
頁數14
期刊IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
35
發行號5
DOIs
出版狀態已出版 - 10月 2005

指紋

深入研究「Discovering fuzzy time-interval sequential patterns in sequence databases」主題。共同形成了獨特的指紋。

引用此