[pbs-devel] Scheduler causing connectivity issues?

dea dea at corep.it
Fri Jul 8 11:40:40 CEST 2022


Yes, same behavior for me too.

release 2.2-3 is problematic (NFS storage connected to PBS)

Previous releases work fine


Thanks

Luca


Il 08/07/22 11:36, Jorge Boncompte ha scritto:
> El 8/7/22 a las 9:09, Thomas Lamprecht escribió:
>> Hi,
>>
>> On 07/07/2022 17:49, Mark Schouten wrote:
>>> We’re getting complaints that one of our PBS’es is periodically unreachable. After investigation if the network might be at fault (even though it’s handling about 5.5Gbit at night), we found that PBS is piling up waiting connections every minute, on the minute, as you can see below. You see the output of `date`, combined with `ss -np | grep -c 8007`, the number of active connections.
>>>
>>> At first I thought that pvestatd was ddossing PBS, but pvestatd seems to run more often than once in a minute.
>>>
>>> So stracing the API process, I found that that process is also just waiting for something; must be the proxy-process.
>>>
>>> grepping for ‘minute’ in the code, I stumbled upon the function `next_minute` in ./src/bin/proxmox-backup-proxy.rs. I’m not quite sure if I understand it correctly, but it seems that every minute, the scheduler is going to try and find out if it should be doing something.
>>>
>>> Drilling down on that in my strace-foo, I think I see quite some read/write/rename actions on jobstate-files. Which leads me to conclude that the proxy process is waiting for the scheduler..
>>>
>>> This is just guess-work, but you guys can surely find out better what’s going on than me.
>>>
>>> This PBS is running with 45 users and 67 datastores.
>>>
>>> Hope you guys can find something.. If I need to debug anything, let me know!
>> Thanks for the info, this already helps quite a bit. We'll look into it and re-check
>> with you if we need more info.
> 	Hi, We've been having a problem that resembles this one with several
> proxmox-backup-server 2.2.x versions. Our PBS stopped accepting backups
> jobs sometimes, but if we retried manully they started fine. The only
> message I could find was:
>
>    backup failed: could not activate storage 'XXXXXXX': XXXXXX: error
> fetching datastores - 500 Can't connect to XXXXXXXX:8007
>
> 	Restarting the proxy seemed to help to get it working one or two days
> more. We have reverted proxmox-backup-server to 2.1.8-1 and every is
> fine again.
>
> 	Regards.
>
>> cheers,
>> Thomas
>>
>>
>> _______________________________________________
>> pbs-devel mailing list
>> pbs-devel at lists.proxmox.com
>> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
>
> _______________________________________________
> pbs-devel mailing list
> pbs-devel at lists.proxmox.com
> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel




More information about the pbs-devel mailing list