A grain preservation translation algorithm: From ER diagram to multidimensional model

Yen Ting Chen, Ping Yu Hsu

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Many IT practitioners and researchers advocate that data models of data warehouses should incorporate the sources of their data in order to achieve maximum efficiency. As the source data are probably derived from system designed with ER diagrams, a great deal of research has been devoted to the design of methodologies for building multidimensional models based on source ER diagrams. However, to the best of our knowledge, no algorithm has been proposed that can systematically translate an entire ER diagram into a multidimensional model with hierarchical snowflake structures. In this paper, we propose an algorithm that achieves the above goal because it incorporates two features, namely, grain preservation and the minimal distance from each dimension table to the fact table. The grain preservation feature guarantees that the translated multidimensional model will maintain cohesive granularity among the entities. Meanwhile, the minimal distance feature guarantees that if an entity can be connected to the fact table in the multidimensional model by more than one path, the path with the smallest number of hops will always be chosen. The first feature is derived by translating ambiguous relationships between entities into weighting factors stored in bridge tables and enhancing fact tables with unique primary keys. The second feature results from including a revised shortest path algorithm in the translating algorithm, with the distance being calculated as the number of relationships required between entities. A prototype system based on the methodology is also developed, and snapshots of the screens used for the system's execution are presented.

Original languageEnglish
Pages (from-to)3679-3695
Number of pages17
JournalInformation Sciences
Volume177
Issue number18
DOIs
StatePublished - 15 Sep 2007

Keywords

  • Data warehouse
  • Entity relationship diagram
  • Grain preservation
  • Multidimensional models
  • Star schema

Fingerprint

Dive into the research topics of 'A grain preservation translation algorithm: From ER diagram to multidimensional model'. Together they form a unique fingerprint.

Cite this