BOSS: An Efficient Data Distribution Strategy for Object Storage Systems with Hybrid Devices

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Hybrid object storage systems provide opportunities to achieve high performance and energy efficiency with low cost for enterprise data centers. Existing object storage systems, however, distribute data objects in the system without considering the heterogeneity of the underlying devices and the asymmetric data access patterns. Therefore, the system performance and energy efficiency may degrade as data are placed on improper storage devices. For example, energy-efficient high-density archive hard disk drives (archive HDDs) are significantly slower than normal HDDs and solid state disks (SSDs), which mean that the archive HDDs are not appropriate for storing frequently accessed objects. Besides, flash-based SSDs have limited write endurance, which makes SSDs vulnerable for storing write-intensive objects. In this paper, we analyze various real enterprise workloads and find that read and write requests are not uniformly distributed to data objects. Based on the observations, we propose a novel strategy, biased object storage strategy (BOSS), to reduce writes to SSDs and improve system performance for hybrid object storage systems. Different from conventional uniform and fixed data distribution strategies, the BOSS can distribute and migrate data objects to various types of devices dynamically, according to the data access patterns collected online. The experimental results show that the BOSS can reduce 64% of writes on SSDs and improve system performance by 29.51% on average, while maintaining a high level of load balance.

Original languageEnglish
Article number8025645
Pages (from-to)23979-23993
Number of pages15
JournalIEEE Access
Volume5
DOIs
StatePublished - 3 Sep 2017

Keywords

  • Ceph
  • Enterprise storage
  • data distribution
  • hybrid storage systems
  • object storage

Fingerprint

Dive into the research topics of 'BOSS: An Efficient Data Distribution Strategy for Object Storage Systems with Hybrid Devices'. Together they form a unique fingerprint.

Cite this