[pbs-devel] [PATCH proxmox-backup 2/3] partial fix #6049: datastore: use config fast-path in Drop

Fabian Grünbichler f.gruenbichler at proxmox.com
Wed Nov 12 12:24:52 CET 2025


On November 11, 2025 1:29 pm, Samuel Rufinatscha wrote:
> The Drop impl of DataStore re-read datastore.cfg to decide whether
> the entry should be evicted from the in-process cache (based on
> maintenance mode’s clear_from_cache). During the investigation of
> issue #6049 [1], a flamegraph [2] showed that the config reload in Drop
> accounted for a measurable share of CPU time under load.
> 
> This patch makes Drop O(1) on the fast path by reusing the maintenance-

I am not sure what the O(1) is refering to? This patch implements a
faster cache lookup in front of the (slow) config parsing variant, but
that doesn't really align well with what the "Big O" notation tries to
express ;)

The parsing below still scales with the number of datastores in the
config, after all. It can just be skipped sometimes :)

> mode decision captured at lookup time and stored with the cached
> datastore entry. When the last reference goes away we:
> - decrement active-operation counters, and
> - evict only if the cached decision mandates eviction.
> 
> If the cache tag is absent or not fresh, a subsequent slow-path lookup
> will be performed.
> 
> Testing
> 
> Compared flamegraphs before and after: prior to this change
> (on top of patch 1), stacks originating from Drop included
> pbs_config::datastore::config(). After the change, those vanish from
> the drop path.
> 
> An end-to-end benchmark using `/status?verbose=0` with 1000 datastores,
> 5 requests per store, and 16-way parallelism shows a further
> improvement:
> 
> | Metric                  | After commit 1 | After commit 2 | Δ (abs) | Δ (%)   |
> |-------------------------|:--------------:|:--------------:|:-------:|:-------:|
> | Total time              | 11s            | 10s            | −1s     | −9.09%  |
> | Throughput (all rounds) | 454.55         | 500.00         | +45.45  | +10.00% |
> | Cold RPS (round #1)     | 90.91          | 100.00         | +9.09   | +10.00% |
> | Warm RPS (rounds 2..N)  | 363.64         | 400.00         | +36.36  | +10.00% |
> 
> Optimizing Drop improves overall throughput by ~10%. The gain appears
> in both cold and warm rounds, and the flamegraph confirms the config
> reload no longer sits on the hot path.
> 
> Links
> 
> [1] Bugzilla: https://bugzilla.proxmox.com/show_bug.cgi?id=6049
> [2] cargo-flamegraph: https://github.com/flamegraph-rs/flamegraph
> 
> Fixes: #6049
> Signed-off-by: Samuel Rufinatscha <s.rufinatscha at proxmox.com>
> ---
>  pbs-datastore/src/datastore.rs | 31 +++++++++++++++++++++++++++----
>  1 file changed, 27 insertions(+), 4 deletions(-)
> 
> diff --git a/pbs-datastore/src/datastore.rs b/pbs-datastore/src/datastore.rs
> index 18eebb58..da80416a 100644
> --- a/pbs-datastore/src/datastore.rs
> +++ b/pbs-datastore/src/datastore.rs
> @@ -200,15 +200,38 @@ impl Drop for DataStore {
>              // remove datastore from cache iff
>              //  - last task finished, and
>              //  - datastore is in a maintenance mode that mandates it
> -            let remove_from_cache = last_task
> -                && pbs_config::datastore::config()
> +
> +            // first check: check if last task finished
> +            if !last_task {
> +                return;
> +            }
> +
> +            let cached_tag = self.inner.cached_config_tag.as_ref();
> +            let last_gen_num = cached_tag.and_then(|c| c.last_generation);
> +            let gen_num = ConfigVersionCache::new()
> +                .ok()
> +                .map(|c| c.datastore_generation());
> +
> +            let cache_is_fresh = match (last_gen_num, gen_num) {
> +                (Some(a), Some(b)) => a == b,
> +                _ => false,
> +            };

this is just last_gen_num == gen_num and checking that either is Some.
if we make the tag always contain a generation instead of an option, we
can simplify this code ;)

> +
> +            let mm_mandate = if cache_is_fresh {
> +                cached_tag
> +                    .and_then(|c| c.last_maintenance_mode.as_ref())
> +                    .is_some_and(|m| m.clear_from_cache())
> +            } else {
> +                pbs_config::datastore::config()
>                      .and_then(|(s, _)| s.lookup::<DataStoreConfig>("datastore", self.name()))
>                      .is_ok_and(|c| {
>                          c.get_maintenance_mode()
>                              .is_some_and(|m| m.clear_from_cache())
> -                    });
> +                    })
> +            };
>  
> -            if remove_from_cache {
> +            // second check: check maintenance mode mandate
> +            if mm_mandate {
>                  DATASTORE_MAP.lock().unwrap().remove(self.name());
>              }
>          }
> -- 
> 2.47.3
> 
> 
> 
> _______________________________________________
> pbs-devel mailing list
> pbs-devel at lists.proxmox.com
> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
> 




More information about the pbs-devel mailing list