[PVE-User] Ceph: sudden slow ops, freezes, and slow-downs

Branislav Viest info at branoviest.com
Thu Jun 23 10:24:10 CEST 2022


OSD1,3,5,2,8 

All drives are Samsung NVMe 

Model Number: SAMSUNG MZQLB1T9HAJR-00007 

According to SMART value "Percentage Used", all are up to 10%. All SMART overall-health self-assessment test result are PASSED. 

------------ 
Best Regards 

Branislav Brian Viest 

------------ 
Legal Disclaimer: This e-mail and any attached files are confidential and may be legally privileged. If you are not the addressee, any disclosure, reproduction, copying, distribution, or other dissemination or use of this communication is strictly prohibited. If you have received this transmission in error please notify the sender immediately and then delete this e-mail. The sender does not accept liability for the correct and complete transmission of the information, nor for any delay or interruption of the transmission, nor for damages arising from the use of or reliance on the information. All e-mail messages addressed to, received or sent by sender are deemed to be professional in nature. Accordingly, the sender or recipient of these messages agrees that they may be read by other sender employees than the official recipient or sender in order to ensure the continuity of work-related activities and allow supervision thereof. 


Od: "Eneko Lacunza" <elacunza at binovo.es> 
Komu: "Branislav Viest" <info at branoviest.com>, "Proxmox VE user list" <pve-user at lists.proxmox.com> 
Odoslané: štvrtok, 23. jún 2022 9:22:08 
Predmet: Re: [PVE-User] Ceph: sudden slow ops, freezes, and slow-downs 

Hi, 

What numbers are those 5 OSDs? 

Hace you checked SSD drive manufacturer and models? 

El 23/6/22 a las 8:54, Branislav Viest escribió: 



Hello,

ID  CLASS  WEIGHT    TYPE NAME       STATUS  REWEIGHT  PRI-AFF
-1         15.52213  root default                             
-3          5.18097      host node1                           
 0    ssd   1.72699          osd.0       up   1.00000  1.00000
 1    ssd   1.72699          osd.1       up   1.00000  1.00000
 2    ssd   1.72699          osd.2       up   1.00000  1.00000
-5          3.45398      host node2                           
 3    ssd   1.72699          osd.3       up   1.00000  1.00000
 5    ssd   1.72699          osd.5       up   1.00000  1.00000
-7          1.70740      host node3                           
 6    ssd   0.85370          osd.6       up   1.00000  1.00000
 7    ssd   0.85370          osd.7       up   1.00000  1.00000
-9          5.17978      host node4                           
 8    ssd   1.72659          osd.8       up   1.00000  1.00000
 9    ssd   1.72659          osd.9       up   1.00000  1.00000
10    ssd   1.72659          osd.10      up   1.00000  1.00000

Since slow ops are reported the most of the time within multiple OSDs, I did not try to perform tests with some OSDs out. 

Now I check the logs from the last 2-3 days and slow ops are reported mostly on the 5 OSDs from total 10. 

------------
Best Regards

Branislav Brian Viest
------------
Legal Disclaimer: This e-mail and any attached files are confidential and may be legally privileged. If you are not the addressee, any disclosure, reproduction, copying, distribution, or other dissemination or use of this communication is strictly prohibited. If you have received this transmission in error please notify the sender immediately and then delete this e-mail. The sender does not accept liability for the correct and complete transmission of the information, nor for any delay or interruption of the transmission, nor for damages arising from the use of or reliance on the information. All e-mail messages addressed to, received or sent by sender are deemed to be professional in nature. Accordingly, the sender or recipient of these messages agrees that they may be read by other sender employees than the official recipient or sender in order to ensure the continuity of work-related activities and allow supervision thereof.

----- Pôvodná správa -----
Od: "Eneko Lacunza via pve-user" [ mailto:pve-user at lists.proxmox.com | <pve-user at lists.proxmox.com> ] Komu: "pve-user" [ mailto:pve-user at lists.proxmox.com | <pve-user at lists.proxmox.com> ] Kópia: "Eneko Lacunza" [ mailto:elacunza at binovo.es | <elacunza at binovo.es> ] Odoslané: štvrtok, 23. jún 2022 8:29:40
Predmet: Re: [PVE-User] Ceph: sudden slow ops, freezes, and slow-downs

_______________________________________________
pve-user mailing list [ mailto:pve-user at lists.proxmox.com | pve-user at lists.proxmox.com ] [ https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user | https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user ] 



Eneko Lacunza
Zuzendari teknikoa | Director técnico
Binovo IT Human Project

Tel. +34 943 569 206 | [ https://www.binovo.es/ | https://www.binovo.es ] Astigarragako Bidea, 2 - 2º izda. Oficina 10-11, 20180 Oiartzun [ https://www.youtube.com/user/CANALBINOVO | https://www.youtube.com/user/CANALBINOVO ] [ https://www.linkedin.com/company/37269706/ | https://www.linkedin.com/company/37269706/ ] 




More information about the pve-user mailing list