Perfect hashing schemes for mining traversal patterns

Chin Chen Chang, Chih Yang Lin, Henry Chou

研究成果: 雜誌貢獻回顧評介論文同行評審

6 引文 斯高帕斯(Scopus)


Hashing schemes are a common technique to improve the performance in mining not only association rules but also sequential patterns or traversal patters. However, the collision problem in hash schemes may result in severe performance degradation. In this paper, we propose perfect hashing schemes for mining traversal patterns to avoid collisions in the hash table. The main idea is to transform each large itemsets into one large 2-itemset by employing a delicate encoding scheme. Then perfect hash schemes designed only for itemsets of length two, rather than varied lengths, are applied. The experimental results show that our method is more than twice as faster than FS algorithm. The results also show our method is scalable to database sizes. One variant of our perfect hash scheme, called partial hash, is proposed to cope with the enormous memory space required by typical perfect hash functions. We also give a comparison of the performances of different perfect hash variants and investigate their properties.

頁(從 - 到)185-202
期刊Fundamenta Informaticae
出版狀態已出版 - 2006


深入研究「Perfect hashing schemes for mining traversal patterns」主題。共同形成了獨特的指紋。