[pbs-devel] [PATCH proxmox-backup v9 01/46] datastore: add helpers for path/digest to s3 object key conversion

Hannes Laimer h.laimer at proxmox.com
Mon Jul 21 16:20:25 CEST 2025


On Mon Jul 21, 2025 at 4:15 PM CEST, Christian Ebner wrote:
> On 7/21/25 3:58 PM, Hannes Laimer wrote:
>> On Sat Jul 19, 2025 at 2:49 PM CEST, Christian Ebner wrote:
>>> Adds helper methods to generate the s3 object keys given a relative
>>> path and filename for datastore contents or digest in case of chunk
>>> files.
>>>
>>> Regular datastore contents are stored by grouping them with a content
>>> prefix in the object key. In order to keep the object key length
>>> small, given the max limit of 1024 bytes {0], `.cnt` is used as
>>> content prefix. Chunks on the other hand are prefixed by `.chunks`,
>>> same as on regular datastores.
>>>
>>> The prefix allows for selective listing of either contents or chunks
>>> by providing the prefix to the respective api calls.
>>>
>>> [0] https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-keys.html
>>>
>>> Signed-off-by: Christian Ebner <c.ebner at proxmox.com>
>>> ---
>>> changes since version 8:
>>> - added unit tests for helper functions
>>>
>>>   Cargo.toml               |   1 +
>>>   pbs-datastore/Cargo.toml |   1 +
>>>   pbs-datastore/src/lib.rs |   1 +
>>>   pbs-datastore/src/s3.rs  | 114 +++++++++++++++++++++++++++++++++++++++
>>>   4 files changed, 117 insertions(+)
>>>   create mode 100644 pbs-datastore/src/s3.rs
>>>
>>> diff --git a/Cargo.toml b/Cargo.toml
>>> index adfa427d1..97783ddd5 100644
>>> --- a/Cargo.toml
>>> +++ b/Cargo.toml
>>> @@ -77,6 +77,7 @@ proxmox-rest-server = { version = "1", features = [ "templates" ] }
>>>   proxmox-router = { version = "3.2.2", default-features = false }
>>>   proxmox-rrd = "1"
>>>   proxmox-rrd-api-types = "1.0.2"
>>> +proxmox-s3-client = "1.0.0"
>>>   # everything but pbs-config and pbs-client use "api-macro"
>>>   proxmox-schema = "4"
>>>   proxmox-section-config = "3"
>>> diff --git a/pbs-datastore/Cargo.toml b/pbs-datastore/Cargo.toml
>>> index 56f6e9094..c42eff165 100644
>>> --- a/pbs-datastore/Cargo.toml
>>> +++ b/pbs-datastore/Cargo.toml
>>> @@ -34,6 +34,7 @@ proxmox-borrow.workspace = true
>>>   proxmox-human-byte.workspace = true
>>>   proxmox-io.workspace = true
>>>   proxmox-lang.workspace=true
>>> +proxmox-s3-client = { workspace = true, features = [ "impl" ] }
>>>   proxmox-schema = { workspace = true, features = [ "api-macro" ] }
>>>   proxmox-serde = { workspace = true, features = [ "serde_json" ] }
>>>   proxmox-sys.workspace = true
>>> diff --git a/pbs-datastore/src/lib.rs b/pbs-datastore/src/lib.rs
>>> index 5014b6c09..ffd0d91b2 100644
>>> --- a/pbs-datastore/src/lib.rs
>>> +++ b/pbs-datastore/src/lib.rs
>>> @@ -182,6 +182,7 @@ pub mod manifest;
>>>   pub mod paperkey;
>>>   pub mod prune;
>>>   pub mod read_chunk;
>>> +pub mod s3;
>>>   pub mod store_progress;
>>>   pub mod task_tracking;
>>>   
>>> diff --git a/pbs-datastore/src/s3.rs b/pbs-datastore/src/s3.rs
>>> new file mode 100644
>>> index 000000000..79e7548fb
>>> --- /dev/null
>>> +++ b/pbs-datastore/src/s3.rs
>>> @@ -0,0 +1,114 @@
>>> +use std::path::{Path, PathBuf};
>>> +
>>> +use anyhow::{bail, format_err, Error};
>>> +
>>> +use proxmox_s3_client::S3ObjectKey;
>>> +
>>> +/// Object key prefix to group regular datastore contents (not chunks)
>>> +pub const S3_CONTENT_PREFIX: &str = ".cnt";
>>> +
>>> +/// Generate a relative object key with content prefix from given path and filename
>>> +pub fn object_key_from_path(path: &Path, filename: &str) -> Result<S3ObjectKey, Error> {
>>> +    // Force the use of relative paths, otherwise this would loose the content prefix
>>> +    if path.is_absolute() {
>>> +        bail!("cannot generate object key from absolute path");
>>> +    }
>>> +    if filename.contains('/') {
>>> +        bail!("invalid filename containing slashes");
>>> +    }
>>> +    let mut object_path = PathBuf::from(S3_CONTENT_PREFIX);
>>> +    object_path.push(path);
>>> +    object_path.push(filename);
>> 
>> similarly to what I wrote on [18/46] with NS names maxed out this could
>> end up being longer than the max len for a key... not sure how to best
>> deal with that though. Limiting NS name len won't really work cause we
>> just read them from the FS...
>
> yes, but we cannot support object keys longer than this, and there is 
> not really another viable option. So the user will be forced to limit 
> namespace length anyways (also on sync sources for example).

yup
well, we could hash the path when making the key :S but I don't think we want that




More information about the pbs-devel mailing list