TY - JOUR
T1 - Techniques for finding similarity knowledge in OLAP reports
AU - Hsu, Kevin Chihcheng
AU - Li, Ming Zhong
PY - 2011/4
Y1 - 2011/4
N2 - On-line analytical processing (OLAP) is a common solution that modern enterprises use to generate, monitor, share, and administrate their analysis reports. When daily, weekly, and/or monthly reports are generated or published by the OLAP operators, all analyses on the contents of reports are left for the report readers. To discover hidden rules, similar reports, or trend inside the potentially huge amount of reports, the report readers can only rely on their smart eyes to find out any rules of such kinds. Data mining is a well-developed field for finding hidden rules inside the data itself. However, there are few techniques focus on finding hidden rules, similarity, or trend using OLAP reports as the unit of analysis. In this paper, we explore how to use data mining techniques on OLAP reports in order to automatically and effectively find the similarity knowledge of OLAP reports. We also address the appropriate presentation of this similarity knowledge to OLAP users. We compare the difference between traditional data mining and finding similarity knowledge from OLAP reports. We then proposed three methods (called OLAP-MDS, OLAP-CLU, and OLAP-M+C in this paper) to explore the effectiveness of discovering similarity knowledge from OLAP reports. Finally, we compare the pros and cons of the proposed three methods with experiments and conclude that the OLAP-M+C method should be the best in most cases.
AB - On-line analytical processing (OLAP) is a common solution that modern enterprises use to generate, monitor, share, and administrate their analysis reports. When daily, weekly, and/or monthly reports are generated or published by the OLAP operators, all analyses on the contents of reports are left for the report readers. To discover hidden rules, similar reports, or trend inside the potentially huge amount of reports, the report readers can only rely on their smart eyes to find out any rules of such kinds. Data mining is a well-developed field for finding hidden rules inside the data itself. However, there are few techniques focus on finding hidden rules, similarity, or trend using OLAP reports as the unit of analysis. In this paper, we explore how to use data mining techniques on OLAP reports in order to automatically and effectively find the similarity knowledge of OLAP reports. We also address the appropriate presentation of this similarity knowledge to OLAP users. We compare the difference between traditional data mining and finding similarity knowledge from OLAP reports. We then proposed three methods (called OLAP-MDS, OLAP-CLU, and OLAP-M+C in this paper) to explore the effectiveness of discovering similarity knowledge from OLAP reports. Finally, we compare the pros and cons of the proposed three methods with experiments and conclude that the OLAP-M+C method should be the best in most cases.
KW - Clustering
KW - Data mining
KW - MDS
KW - OLAP
KW - Similarity knowledge
UR - http://www.scopus.com/inward/record.url?scp=78650707296&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2010.09.033
DO - 10.1016/j.eswa.2010.09.033
M3 - 期刊論文
AN - SCOPUS:78650707296
SN - 0957-4174
VL - 38
SP - 3743
EP - 3756
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 4
ER -