[PVE-User] Message from PVE-HA-CRM

Gilberto Nunes gilberto.nunes32 at gmail.com
Fri Feb 5 16:26:04 CET 2016


> Also if you have no more VM under HA you can remove the HA resource config
>
> # rm /etc/pve/ha/resources.cfg

But if do this, in the future, when I realize that I need insert the
previously node removed, it will work?


> But the errors from the CRM should _not_ be related to a stopping VM, I'm
pretty sure there's another > cause for that :)

That's is what I am trying figure out.


> If you have the logs from around the time the VM stopped look if
something strange happened.

Well... This happen later of night and, seems to me that is happen when
backup go...
I notice, in other PVE host, sometimes some randonly VM's just restart at
night...
I any case, logs from proxmox are where difficult to get, mainly because
the same of syslog system and I get a lot of garbage...
We have the /var/log/pve and /var/log;pveproxy/ but nothing useful at
all....

I will try another approuch to solve this issues....

Thanks anyway....

2016-02-05 12:08 GMT-02:00 Thomas Lamprecht <t.lamprecht at proxmox.com>:

>
>
> On 02/05/2016 01:39 PM, Gilberto Nunes wrote:
>
> Sorry... I don'e mention it, but now, I have just one node and had disable
> the HA for that VM... There's no other VM.... Just one, with HA disable...
>
>
> Ah okay, now i understand.
>
> Can you try:
>
> # systemctl restart pve-ha-lrm
>
> Also if you have no more VM under HA you can remove the HA resource config
>
> # rm /etc/pve/ha/resources.cfg
>
> So the CRM won't start as it sees that no resource is configured. :)
>
> But the errors from the CRM should _not_ be related to a stopping VM, I'm
> pretty sure there's another cause for that :)
>
> If you have the logs from around the time the VM stopped look if something
> strange happened. It could also be that there was an error in the VM OS
> itself?
>
>
>
> BTW, here the outputs...
>
> # systemctl status pve-ha-lrm
>
> systemctl status pve-ha-lrm
> ● pve-ha-lrm.service - PVE Local HA Ressource Manager Daemon
>    Loaded: loaded (/lib/systemd/system/pve-ha-lrm.service; enabled)
>    Active: active (running) since Wed 2016-01-20 08:12:07 BRST; 2 weeks 2
> days ago
>  Main PID: 1900 (pve-ha-lrm)
>    CGroup: /system.slice/pve-ha-lrm.service
>            └─1900 pve-ha-lrm
>
> Jan 20 08:12:07 proxmox01 pve-ha-lrm[1900]: starting server
> Jan 20 08:12:07 proxmox01 pve-ha-lrm[1900]: status change startup =>
> wait_for_agent_lock
>
>
> # ha-manager status
>
> ha-manager status
> quorum OK
> master proxmox01 (active, Fri Feb  5 10:37:56 2016)
> lrm proxmox01 (active, Fri Feb  5 10:38:02 2016)
>
> And the log attached
>
> Thanks a lot
>
>
>
>
>
> 2016-02-05 10:15 GMT-02:00 Thomas Lamprecht <t.lamprecht at proxmox.com>:
>
>> Hi,
>>
>> the obvious questions: what did you do before the error came to light?
>>
>> With this error the HA CRM cannot do any action (start, stop, migrate,
>> ...)
>>
>> whats the output from
>>
>> # systemctl status pve-ha-lrm
>>
>> # ha-manager status
>>
>> on this node (or both nodes), be sure that pve-ha-lrm is started!
>>
>> Also if no obvious error is visible please append the logs with:
>>
>> # journalctl -u pve-ha-lrm -u pve-ha-crm
>>
>> You can redirect the output to an file and if yo
>>
>> journalctl -u pve-ha-lrm -u pve-ha-crm > out.log
>>
>>
>>
>> On 02/05/2016 01:04 PM, Gilberto Nunes wrote:
>>
>> Hello list
>>
>> In the past, I had a cluster with two nodes plus storage with HA
>> enabled...
>> About 3 month, I note that the VM just stop, from nothing at all....
>> The PVE remain alive and storage too...
>>
>> I get this message in syslog:
>>
>> Feb  4 19:15:42 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 19:16:12 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 19:16:32 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 19:16:52 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 19:17:12 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 22:27:52 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>> Feb  4 22:28:02 proxmox01 pve-ha-crm[1894]: got unexpected error - can't
>> open '/etc/pve/nodes/proxmox01/lrm_status' - No such file or directory
>>
>> That's would be the reason for VM stoped???
>>
>>
>>
>>
>> _______________________________________________
>> pve-user mailing listpve-user at pve.proxmox.comhttp://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>>
>>
>>
>> _______________________________________________
>> pve-user mailing list
>> pve-user at pve.proxmox.com
>> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>>
>>
>
>
> --
>
> Gilberto Ferreira
> +55 (47) 9676-7530
> Skype: gilberto.nunes36
>
>
>
> _______________________________________________
> pve-user mailing listpve-user at pve.proxmox.comhttp://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>
>
>
> _______________________________________________
> pve-user mailing list
> pve-user at pve.proxmox.com
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>
>


-- 

Gilberto Ferreira
+55 (47) 9676-7530
Skype: gilberto.nunes36
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://pve.proxmox.com/pipermail/pve-user/attachments/20160205/3913028f/attachment-0015.html>


More information about the pve-user mailing list