Char: An automatic way to describe characteristics of data

Yu Chin Liu, Ping Yu Hsu

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

As e-business software prevails worldwide, large amount of data are accumulated automatically in databases of most sizable companies. Managers in organizations now face the problems of making sense out of the data. In this paper, an algorithm to automatically produce characteristic rules to describe the major characteristics of data in a table is proposed. In contrast to traditional Attribute Oriented Induction methods, the algorithm, named as Char Algorithm, does not need a concept tree and only requires setting a desired coverage threshold to generate a minimal set of characteristic rules to describe the given dataset. Our simulation results show that the characteristic rules found by Char are fairly consistent even when the number of records and attributes increase.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsZheng Rong Yang, Richard Everson, Hujun Yin
PublisherSpringer Verlag
Pages366-376
Number of pages11
ISBN (Print)3540228810, 9783540228813
DOIs
StatePublished - 2004

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3177
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'Char: An automatic way to describe characteristics of data'. Together they form a unique fingerprint.

Cite this