Vacuums
All Databricks Delta Lake tables that are created and managed by Etleap are vacuumed once per day.
Vacuums remove any files from the Delta Lake table’s directory that Delta does not manage or that contain data that is no longer in the destination table. Etleap uses Databrick’s default retention period of 7 days, only data deleted more than 7 days ago will be removed from the underlying storage. Running daily vacuums prevents your S3 buckets from growing in size due to unused data.
Running vacuums on Delta Lake tables prevents users from time traveling to a version before the retention period of the last vacuum. This means that the option to time travel on tables managed by Etleap is restricted to the last 7 days.