[pbs-devel] [PATCH proxmox-backup] tape: wait for calibration of LTO-9 tapes
Dominik Csapak
d.csapak at proxmox.com
Wed Apr 16 08:22:33 CEST 2025
On 4/15/25 17:51, Thomas Lamprecht wrote:
> (re-send with list in CC, sorry)
>
> On 15/04/2025 13:40, Dominik Csapak wrote:
>> Since LTO-9, initial loading of tapes into a drive can block up to 2
>> hours according to the spec. In case we run into a ready check timeout,
>> query the drive, and increase the timeout to 2 hours and 5 minutes if
>> it's calibrating (5 minutes headroom).
>
> Is there any (spec) reference we can link here?
To my knowledge the LTO spec is not published in a standard format like
e.g. other RFC or such, but one can find the IBM and HP LTO
SCSI references rather easily [0][1]
as for the timeout, IBM says it only in their recommendations:
> Although most optimizations will complete within 60 minutes some optimizations may take up to 2
hours.
and HP:
> Media initialization adds a variable amount of time to the initialization process that typically
takes between 20 minutes and 2 hours.
So it seems there not a hard limit and depends...
In my tests it always took around the 1 hour mark (on available hardware here)
>
> And the 2h5m would be on top of the previous max_wait, AFAICT.
>
no actually. i overwrite the old 'max_wait' variable and we calculate
from the same start point (with start.elapsed()) we only check if that
is greater than the timeout, so I increase *to* 2h5m
> And, didn't we already fix something like this? Or at least had some
> other handling added for long initial initialisation of LTO9 tapes, or
> was that something different or just a discussion without a patch.
> Not really important though, just wondering.
Sorry I should have added more context to the commit message. We did in fact
increase the timeout, but only for long formatting, and put a section
in the docs explaining the initialization and that it's recommended
to do that beforehand with the vendor tools (IMHO it still is
even with this patch)
Still, forum users encountered timeout issues e.g. when using
barcode labeling and not initializing beforehand, so I thought
adding a general handling for that in the 'wait_until_ready'
(where most of the timeouts from this will occur) makes sense
to decrease the friction of the UX (even if it takes longer to finish).
>
> Changes look alright besides of those meta things, so I could fleece
> any relevant info into the commit message too on applying, if nothing
> else comes up.
>
thanks!
0: IBM LTO-9
https://www.ibm.com/support/pages/system/files/inline-files/LTO%20SCSI%20Reference_GA32-0928-05%20(EXTERNAL)_0.pdf
1: HP LTO-9
https://support.hpe.com/hpesc/public/docDisplay?docId=sd00001239en_us&page=GUID-D7147C7F-2016-0901-0921-000000000450.html
More information about the pbs-devel
mailing list