[PVE-User] MTU size changed on a running cluster
Marco M. Gabriel
marcomgabriel at gmail.com
Wed Nov 23 16:16:27 CET 2016
Hi there,
on a productive 5 node Proxmox VE Ceph cluster, we experienced some strange
behaviour:
Based on
http://pve.proxmox.com/wiki/Open_vSwitch#Example_2:_Bond_.2B_Bridge_.2B_Internal_Ports
we
have an internal network for cluster/corosync communication and another
internal network for Ceph Storage traffic. The Ceph OVS bridge was set to
MTU 9000 in /etc/network/interfaces and ran without a problem since a week.
Today we've seen Ceph errors like "x requests are blocked > 32 sec".
After a troubleshooting, we's seen that packets got dropped because they
were > 1500 bytes on the Ceph interface. That was strange as we had set
them to MTU 9000 and it was running since a week.
We checked the Interfaces and on two nodes, we saw a MTU of 1500 while the
other three nodes still had MTU 9000.
Has anybody experiences something like that? I read that an OVS bridge
automatically sets it's own MTU according to the lowest MTU of the member
interfaces, but I am not sure if this could be a problem here.
Any hints appreciated,
Marco
More information about the pve-user
mailing list