[pbs-devel] [PATCH proxmox-backup] chunk_store: insert_chunk: write chunk again if sizes don't match
Dominik Csapak
d.csapak at proxmox.com
Mon May 9 13:51:07 CEST 2022
On 5/9/22 13:34, Thomas Lamprecht wrote:
> On 09/05/2022 12:40, Dominik Csapak wrote:
>> if the on-disk size of a chunk is not correct, write it again when
>> inserting and log a warning.
>>
>> This is currently possible if PBS crashes, but the rename of the chunk
>> was flushed to disk, when the actual data was not.
>
> could be also interesting to note here that we basically got all data
> required to do that already anyway.
what exactly do you mean here? just adding
'since we already have the complete chunk data, we are able to overwrite it'
(or similar?)
>
> And I'd think that a verify would catch this too and rename it to .bad, albeit
> that can naturally be to late if one is unlucky, so still good to do.
yes a verify will trigger that, but as you said, that can be too late ;)
also any input on @fabians suggestion to bail out when the old_size != 0 but
!= new_size?
>
> small nit inline (can be probably just fixed up on apply)
>
>>
>> Suggested-by: Fabian Grünbichler <f.gruenbichler at proxmox.com>
>> Signed-off-by: Dominik Csapak <d.csapak at proxmox.com>
>> ---
>> pbs-datastore/src/chunk_store.rs | 23 ++++++++++++++++-------
>> 1 file changed, 16 insertions(+), 7 deletions(-)
>>
>> diff --git a/pbs-datastore/src/chunk_store.rs b/pbs-datastore/src/chunk_store.rs
>> index 8d7df513..93f56e8b 100644
>> --- a/pbs-datastore/src/chunk_store.rs
>> +++ b/pbs-datastore/src/chunk_store.rs
>> @@ -458,17 +458,29 @@ impl ChunkStore {
>>
>> let lock = self.mutex.lock();
>>
>> + let raw_data = chunk.raw_data();
>> + let encoded_size = raw_data.len() as u64
>> +
>> if let Ok(metadata) = std::fs::metadata(&chunk_path) {
>> - if metadata.is_file() {
>> - self.touch_chunk(digest)?;
>> - return Ok((true, metadata.len()));
>> - } else {
>> + if !metadata.is_file() {
>> bail!(
>> "Got unexpected file type on store '{}' for chunk {}",
>> self.name,
>> digest_str
>> );
>> }
>> + let new_len = metadata.len();
>> + if encoded_size == new_len {
>> + self.touch_chunk(digest)?;
>> + return Ok((true, new_len));
>> + } else {
>> + log::warn!(
>> + "chunk size mismatch on insert for {}: old {} - new {}",
>
> fyi: you can now use variable names directly in format strings:
>
> "chunk size mismatch on insert for {digest_str}: old {encoded_size} - new {new_len}",
>
> nit: why is one named a "size" and one a "len", if they're the same thing it'd be
> nice to have it consistent.
>
argh... 'new_len' is actually the old one. so it'll be
'encoded_size' and 'old_size'..
thanks for making me look again ^^
>> + digest_str,
>> + encoded_size,
>> + new_len
>> + );
>> + }
>> }
>>
>> let mut tmp_path = chunk_path.clone();
>> @@ -483,9 +495,6 @@ impl ChunkStore {
>> )
>> })?;
>>
>> - let raw_data = chunk.raw_data();
>> - let encoded_size = raw_data.len() as u64;
>> -
>> file.write_all(raw_data).map_err(|err| {
>> format_err!(
>> "writing temporary chunk on store '{}' failed for {} - {}",
>
More information about the pbs-devel
mailing list