Artykuł w czasopiśmie
Brak miniatury
Licencja

ClosedAccessDostęp zamknięty

InftyDedup: Scalable and Cost-Effective Cloud Tiering with Deduplication

Autor
Iwanicki, Konrad
Dubnicki, Cezary
Wełnicki, Michał
Lichota, Krzysztof
Jackowski, Andrzej
Kotlarska, Iwona
Data publikacji
2023
Abstrakt (EN)

Cloud tiering is the process of moving selected data from on-premise storage to the cloud, which has recently become important for backup solutions. As subsequent backups usually contain repeating data, deduplication in cloud tiering can significantly reduce cloud storage utilization, and hence costs. In this paper, we introduce InftyDedup, a novel system for cloud tiering with deduplication. Unlike existing solutions, it maximizes scalability by utilizing cloud services not only for storage but also for computation. Following a distributed batch approach with dynamically assigned cloud computation resources, InftyDedup can deduplicate multi-petabyte backups from multiple sources at costs on the order of a couple of dollars. Moreover, by selecting between hot and cold cloud storage based on the characteristics of each data chunk, our solution further reduces the overall costs by up to 26%-44%. InftyDedup is implemented in a state-of-the-art commercial backup system and evaluated in the cloud of a hyperscaler.

Dyscyplina PBN
informatyka
Strony od-do
33-48
Licencja otwartego dostępu
Dostęp zamknięty