Clustering transactions with an unbalanced hierarchical product structure

Min Tzu Wang, Ping Yu Hsu, K. C. Lin, Shiuann Shuoh Chen

研究成果: 書貢獻/報告類型會議論文篇章同行評審

4 引文 斯高帕斯(Scopus)

摘要

The datasets extracted from large retail stores often contain sparse information composed of a huge number of items and transactions, with each transaction only containing a few items. These data render basket analysis with extremely low item support, customer clustering with large intra cluster distance and transaction classifications having huge classification trees. Although a similarity measure represented by counting the depth of the least common ancestor normalized by the depth of the concept tree lifts the limitation of binary equality, it produces counter intuitive results when the concept hierarchy is unbalanced since two items in deeper subtrees are very likely to have a higher similarity than two items in shallower subtrees. The research proposes to calculate the distance between two items by counting the edge traversal needed to link them in order to solve the issues. The method is straight forward yet achieves better performance with retail store data when concept hierarchy is unbalanced.

原文???core.languages.en_GB???
主出版物標題Data Warehousing and Knowledge Discovery - 9th International Conference, DaWaK 2007, Proceedings
發行者Springer Verlag
頁面251-261
頁數11
ISBN(列印)9783540745525
DOIs
出版狀態已出版 - 2007
事件9th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2007 - Regensburg, Germany
持續時間: 3 9月 20077 9月 2007

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4654 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???9th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2007
國家/地區Germany
城市Regensburg
期間3/09/077/09/07

指紋

深入研究「Clustering transactions with an unbalanced hierarchical product structure」主題。共同形成了獨特的指紋。

引用此