TY - JOUR
T1 - Mining typical patterns from databases
AU - Hu, Hui Ling
AU - Chen, Yen Liang
PY - 2008/10/1
Y1 - 2008/10/1
N2 - There have been many approaches used to discover useful information patterns from databases, such as concept description, associations, sequential patterns, classification, clustering, and deviation detection. This paper proposes a new type of information pattern, called a typical pattern, which is a small subset of objects selected from a large dataset that provides a compact and suitable representation of the original dataset. The Typical Patterns Mining (TPM) algorithm is developed to mine typical patterns from databases. Extensive experiments are carried out using a real dataset to demonstrate the usefulness of typical patterns in practical situations. The experimental results indicate that TPM is a computationally efficient method and that typical patterns can provide a compact and suitable representation of the original dataset.
AB - There have been many approaches used to discover useful information patterns from databases, such as concept description, associations, sequential patterns, classification, clustering, and deviation detection. This paper proposes a new type of information pattern, called a typical pattern, which is a small subset of objects selected from a large dataset that provides a compact and suitable representation of the original dataset. The Typical Patterns Mining (TPM) algorithm is developed to mine typical patterns from databases. Extensive experiments are carried out using a real dataset to demonstrate the usefulness of typical patterns in practical situations. The experimental results indicate that TPM is a computationally efficient method and that typical patterns can provide a compact and suitable representation of the original dataset.
KW - Clustering
KW - Data mining
KW - Typical patterns mining
UR - http://www.scopus.com/inward/record.url?scp=50949119270&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2008.05.036
DO - 10.1016/j.ins.2008.05.036
M3 - 期刊論文
AN - SCOPUS:50949119270
SN - 0020-0255
VL - 178
SP - 3683
EP - 3696
JO - Information Sciences
JF - Information Sciences
IS - 19
ER -