Backup and Replication > Archive Old Backups to Amazon S3 Glacier

When a user chooses to archive old backups to S3 in the backup job settings, it means that old backups that do not comply with retention policy settings will be copied to a special S3 bucket created by CDSB. This feature is available for both instances and standalone volumes.

1.An S3 bucket is created by CDSB automatically as soon as there is a need to place backups into it.

2.The name of the S3 bucket is “cdsb-<bucket ID>”. The Bucket ID is a unique key formed by the current AWS account and region.

3.The S3 bucket has the following structure:

ovol-<volume ID>

✍metadata

✍backups

•backup ID

ometadata

oblockdata

CDSB-U~1_img35

4.Backups of every volume are stored in a separate folder named as vol-<volume ID>.

5.The first backup of every volume is a full backup and the subsequent backups are incremental and store just changed blocks data.

6.For every volume in a backup job CDBS starts a temporary worker instance, which is responsible for copying data from this volume to S3. After the data is copied, a worker instance is terminated by CDSB.

7.A special lifecycle rule is automatically created for the CDSB bucket that regulates the transition of S3 bucket data to Amazon S3 Glacier.

CDSB-U~1_img36

By default, block data is copied to Amazon S3 Glacier immediately, metadata is not copied to Glacier under any circumstances.

CDSB-U~1_img37

You can change these settings if needed.

Backups that are stored in the Amazon S3 Glacier bucket are marked as archived in the Backups listing, as shown below.

CDSB-U~1_img38

The restore from archived backups can take a lot longer than restore from a backup based on an EC2 snapshot, especially if it was moved to Amazon S3 Glacier. Therefore, it is recommended to set a balance between storage price and recovery time.