Error hosted-engine: Engine status: {"reason": "bad vm status", "health": "bad", "vm": "down_unexpected", "detail": "Down"}

Hello. I have a 4 host infrastructure in ovirt and a few days ago the hosted-engine was turned off and I cannot turn it on from any host. Any ideas? Thanks. --== Host host1.myhost.com (id: 7) status ==-- conf_on_shared_storage : True Status up-to-date : True Hostname : host1.myhost.com Host ID : 7 Engine status : {"reason": "bad vm status", "health": "bad", "vm": "down_unexpected", "detail": "Down"} Score : 0 stopped : False Local maintenance : False crc32 : d5633613 local_conf_timestamp : 31176590 Host timestamp : 31176590

Hello, Did you try starting using hosted-engine --vm-start Please check/share all relevant logs. from engine and hosts, and at least: engine.log & /var/log/vdsm/* On Mon, Mar 8, 2021 at 3:54 PM <jesado74@gmail.com> wrote:
Hello. I have a 4 host infrastructure in ovirt and a few days ago the hosted-engine was turned off and I cannot turn it on from any host. Any ideas? Thanks.
--== Host host1.myhost.com (id: 7) status ==--
conf_on_shared_storage : True Status up-to-date : True Hostname : host1.myhost.com Host ID : 7 Engine status : {"reason": "bad vm status", "health": "bad", "vm": "down_unexpected", "detail": "Down"} Score : 0 stopped : False Local maintenance : False crc32 : d5633613 local_conf_timestamp : 31176590 Host timestamp : 31176590 _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/URWDMU5RRK2OXO...

For me, what happens every few months is the Hosted Engine fills up its /var/log, specifically the httpd folder. Once this partition is full, the HE only runs for a few seconds before shutting down and trying a different host. Obviously that makes no difference, so it just starts & stops over and over until the log dir is emptied. so do a hosted_engine --vm-stop, and then --vm-start. SSH into it and take a look at the partitions disk space. *Vincent Royer* *778-825-1057* <http://www.epicenergy.ca/> *SUSTAINABLE MOBILE ENERGY SOLUTIONS* On Mon, Mar 8, 2021 at 3:07 AM Ritesh Chikatwar <rchikatw@redhat.com> wrote:
Hello,
Did you try starting using hosted-engine --vm-start
Please check/share all relevant logs. from engine and hosts, and at least:
engine.log & /var/log/vdsm/*
On Mon, Mar 8, 2021 at 3:54 PM <jesado74@gmail.com> wrote:
Hello. I have a 4 host infrastructure in ovirt and a few days ago the hosted-engine was turned off and I cannot turn it on from any host. Any ideas? Thanks.
--== Host host1.myhost.com (id: 7) status ==--
conf_on_shared_storage : True Status up-to-date : True Hostname : host1.myhost.com Host ID : 7 Engine status : {"reason": "bad vm status", "health": "bad", "vm": "down_unexpected", "detail": "Down"} Score : 0 stopped : False Local maintenance : False crc32 : d5633613 local_conf_timestamp : 31176590 Host timestamp : 31176590 _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/URWDMU5RRK2OXO...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/EXMIFDXHUZ7HGO...

Good Morning; Thanks for the answers. I tried to put the cluster in maintenance, shutdown the hosted-engine and then a --vm-start, the hosted-engine started but after a few days the hosted-engine crashed again and there is no way to get it up. I have shut down machines in the hots to make sure it is not a lack of memory problem. But there is no way to turn it on. I have tried to turn it on from each of the hots (especially the one with the best score) but it turns on and off. Any more ideas; Do you know any tutorial or manual where this type of problem may appear. I have read red-hat virtualization manuals and ovirt manuals, but can't find anything. In the log / var / log / vdsm this error appears, it looks like a cpu problem, but I can't interpret it correctly. -------------------------------------------------- -------------------------------------------------- ------- 2021-04-14 10: 46: 00,856 + 0200 INFO (jsonrpc / 3) [api.virt] FINISH getStats return = {'status': {' message ':' Done ',' code ': 0},' statsList ': [{' status': 'Down', 'exitMessage': "internal error: qemu unexpectedly closed the monitor: 2021-04-14T08: 37: 18.376261Z qemu-kvm: warning: All CPU (s) up to maxcpus should be described in NUMA config, ability to start up with partial NUMA mappings is obsoleted and will be removed in future \ n2021-04-14T08: 37: 18.384137Z qemu-kvm: cannot set up guest memory 'pc.ram': Cannot allocate memory ", 'statusTime': '41000949110', 'vmId': 'c94c7a4b-adbd-4ee0-879d-8cb1f399bf90', 'exitReason': 1, 'exitCode': 1}]} from = :: 1,48086, vmId = c94c7a4b-adbd-4ee0-879d-8cb1f399bf90 (api: 54) 2021-04-14 10: 46: 00,857 + 0200 INFO (jsonrpc / 3) [jsonrpc.JsonRpcServer] RPC call VM.getStats succeeded in 0.00 seconds (__init __: 312) -------------------------------------------------- -------------------------------------------------- ------------ I am new at this. I appreciate your help and answers. All the best.
účastníci (3)
-
jesado74@gmail.com
-
Ritesh Chikatwar
-
Vincent Royer