LaDy: Enabling Locality-aware Deduplication Technology on Shingled Magnetic Recording Drives

Jung Hsiu Chang, Tzu Yu Chang, Yi Chao Shih, Tseng Yi Chen

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The continuous increase in data volume has led to the adoption of shingled-magnetic recording (SMR) as the primary technology for modern storage drives. This technology offers high storage density and low unit cost but introduces significant performance overheads due to the read-update-write operation and garbage collection (GC) process. To reduce these overheads, data deduplication has been identified as an effective solution as it reduces the amount of written data to an SMR-based storage device. However, deduplication can result in poor data locality, leading to decreased read performance. To tackle this problem, this study proposes a data locality-aware deduplication technology, LaDy, that considers both the overheads of writing duplicate data and the impact on data locality to determine whether the duplicate data should be written. LaDy integrates with DiskSim, an open-source project, and modifies it to simulate an SMR-based drive. The experimental results demonstrate that LaDy can significantly reduce the response time in the best-case scenario by 87.3% compared with CAFTL on the SMR drive. LaDy achieves this by selectively writing duplicate data, which preserves data locality, resulting in improved read performance. The proposed solution provides an effective and efficient method for mitigating the performance overheads associated with data deduplication in SMR-based storage devices.

Original languageEnglish
Article number127
JournalACM Transactions on Embedded Computing Systems
Volume22
Issue number5 s
DOIs
StatePublished - 9 Sep 2023

Keywords

  • SMR
  • Shingled magnetic recording
  • data deduplication
  • disk technology
  • locality

Fingerprint

Dive into the research topics of 'LaDy: Enabling Locality-aware Deduplication Technology on Shingled Magnetic Recording Drives'. Together they form a unique fingerprint.

Cite this