[pbs-devel] [PATCH v2 proxmox-backup 1/2] docs: add security implications of prune and change detection mode
Christian Ebner
c.ebner at proxmox.com
Thu Nov 14 10:43:41 CET 2024
On 11/14/24 10:25, Gabriel Goller wrote:
> On 13.11.2024 16:55, Christian Ebner wrote:
>> diff --git a/docs/maintenance.rst b/docs/maintenance.rst
>> index 4bb135e4e..e8a26d69c 100644
>> --- a/docs/maintenance.rst
>> +++ b/docs/maintenance.rst
>> @@ -6,8 +6,34 @@ Maintenance Tasks
>> Pruning
>> -------
>>
>> -Prune lets you specify which backup snapshots you want to keep.
>> -The following retention options are available:
>> +Prune lets you specify which backup snapshots you want to keep,
>> removing others.
>> +When pruning a snapshot, only the snapshot metadata (manifest,
>> indices, blobs,
>> +log and notes) is removed. The chunks containing the actual backup
>> data and
>> +previously referenced by the pruned snapshot, have to be removed by a
>> garbage
>> +collection run.
>> +
>> +.. Caution:: Take into consideration that sensitive information
>> stored in a
>> + given data chunk will outlive pruned snapshots and remain present
>> in the
>> + datastore as long as referenced by at least one backup snapshot.
>> Further,
>> + *even* if no snapshot references a given chunk, it will remain
>> present until
>> + removed by the garbage collection.
>> +
>> + Further, file-level backups created using the change detection mode
>
> Second sentence that begins with 'Further' – maybe substitute this one
> with 'Moreover' or 'Additionally' so it reads better.
True, `Moreover` sounds better to me...
>
>> + `metadata` can reference backup chunks containing files which have
>> vanished
>
> use double backticks here to highlight correctly, so: ``metadata``.
Acked, thx!
>
>> + since the previous backup, but might still be accessible when
>> reading the
>> + chunks raw data is possible (client or server side).
>
> This sentence is a bit messy and long, maybe we could rewrite it as:
>
> Moreover, file-level backups created using the change detection mode
> ``metadata`` can reference backup chunks containing files which have
> vanished since the previous backup. These might still be accessible
Yes, I do agree that splitting this into two sentences makes it easier
to read. In that case I would even suggest to explicitly mention that
this refers to the files, non just the chunks, e.g. by:
...
vanished since the previous backup. These files might still be accessible.
...
> by reading the raw data (client or server side).
>
>> + To remove chunks containing sensitive data, prune any snapshot
>> made while the
>> + data was part of the backup input and run a garbage collection.
>> Further, if
>> + using file-based backups with change detection mode `metadata`,
>> additionally
>
> s/`metadata`/``metadata``/
Acked, thx!
>
>> + prune all snapshots since the sensitive data was no longer part of
>> the backup
>> + input and run a garbage collection.
>> +
>> + The no longer referenced chunks will then be marked for deletion
>> on the next
>> + garbage collection run and removed by a subsequent run after the
>> grace
>> + period.
>> +
>> +The following retention options are available for pruning:
>>
>> ``keep-last <N>``
>> Keep the last ``<N>`` backup snapshots.
>
> Everything else is fine!
OK, will send a new version, thx!
More information about the pbs-devel
mailing list