On the healthy node try restarting: ovirt-ha-agent, ovirt-ha-broker and
vdsmd services.
Verify that the three services are running properly.
Wait a couple of minutes the execute hosted-engine --vm-status to see if
the node returns VM down message, if you get there try to start the VM
engine again.
On Sat, Jun 25, 2022, 11:49 <adam_xu(a)adagene.com.cn> wrote:
I’m using a fc-san storage. The storage is ok.
*发件人:* wodel youchi <wodel.youchi(a)gmail.com>
*发送时间:* 2022年6月25日 18:40
*收件人:* adam_xu(a)adagene.com.cn
*抄送:* users <users(a)ovirt.org>
*主题:* [ovirt-users]Re: 回复: Re: can not access engine when hosted engine
is up
Verify the ovirt-ha-agent service on the healthy node.
What type of storage are you using for the VM engine? Is it still
reachable?
I don't think that the update broke your VM. The should at least boot.
On Sat, Jun 25, 2022, 09:44 <adam_xu(a)adagene.com.cn> wrote:
Vdsm log:
2022-06-25 01:24:48,206-0700 INFO (jsonrpc/6) [vdsm.api] START
multipath_health() from=::1,53638,
task_id=33f083e2-1a06-449c-afce-66cf0c831be0 (api:48)
2022-06-25 01:24:48,206-0700 INFO (jsonrpc/6) [vdsm.api] FINISH
multipath_health return={} from=::1,53638,
task_id=33f083e2-1a06-449c-afce-66cf0c831be0 (api:54)
2022-06-25 01:24:48,206-0700 WARN (jsonrpc/6) [throttled] MOM not
available. Error: [Errno 111] Connection refused (throttledlog:104)
2022-06-25 01:24:48,207-0700 WARN (jsonrpc/6) [throttled] MOM not
available, KSM stats will be missing. Error: (throttledlog:104)
2022-06-25 01:24:48,207-0700 WARN (jsonrpc/6) [root] Failed to retrieve
Hosted Engine HA info, is Hosted Engine setup finished? (api:168)
2022-06-25 01:24:48,208-0700 INFO (jsonrpc/6) [api.host] FINISH getStats
return={'status': {'code': 0, 'message': 'Done'},
'info': (suppressed)}
from=::1,53638 (api:54)
2022-06-25 01:24:49,323-0700 INFO (jsonrpc/4) [vdsm.api] START
repoStats(domains=['74ff7aac-f9e9-4e89-86db-2f40e48ddd85']) from=::1,53638,
task_id=448f81dd-f0bd-4c92-92ca-9d6400be4a7e (api:48)
2022-06-25 01:24:49,323-0700 INFO (jsonrpc/4) [vdsm.api] FINISH repoStats
return={'74ff7aac-f9e9-4e89-86db-2f40e48ddd85': {'code': 0,
'lastCheck':
'2.0', 'delay': '0.000278052', 'valid': True,
'version': 5, 'acquired':
True, 'actual': True}} from=::1,53638,
task_id=448f81dd-f0bd-4c92-92ca-9d6400be4a7e (api:54)
2022-06-25 01:24:51,778-0700 INFO (jsonrpc/1) [api.host] START getStats()
from=::1,53638 (api:48)
2022-06-25 01:24:51,791-0700 INFO (jsonrpc/1) [vdsm.api] START
repoStats(domains=()) from=::1,53638,
task_id=a9ab92c1-87e0-42df-a059-36e2588d5a78 (api:48)
2022-06-25 01:24:51,791-0700 INFO (jsonrpc/1) [vdsm.api] FINISH repoStats
return={'74ff7aac-f9e9-4e89-86db-2f40e48ddd85': {'code': 0,
'lastCheck':
'1.1', 'delay': '0.000278052', 'valid': True,
'version': 5, 'acquired':
True, 'actual': True}, 'fa5059e6-38ad-4f71-ad7d-0fc30aedf254':
{'code': 0,
'lastCheck': '4.4', 'delay': '0.000361428',
'valid': True, 'version': 0,
'acquired': True, 'actual': True}} from=::1,53638,
task_id=a9ab92c1-87e0-42df-a059-36e2588d5a78 (api:54)
2022-06-25 01:24:51,792-0700 INFO (jsonrpc/1) [vdsm.api] START
multipath_health() from=::1,53638,
task_id=9845c9ca-e20e-425a-9fd2-f473b256380e (api:48)
2022-06-25 01:24:51,792-0700 INFO (jsonrpc/1) [vdsm.api] FINISH
multipath_health return={} from=::1,53638,
task_id=9845c9ca-e20e-425a-9fd2-f473b256380e (api:54)
2022-06-25 01:24:51,793-0700 WARN (jsonrpc/1) [root] Failed to retrieve
Hosted Engine HA info, is Hosted Engine setup finished? (api:168)
2022-06-25 01:24:51,793-0700 INFO (jsonrpc/1) [api.host] FINISH getStats
return={'status': {'code': 0, 'message': 'Done'},
'info': (suppressed)}
from=::1,53638 (api:54)
2022-06-25 01:24:52,482-0700 INFO (jsonrpc/7) [api.virt] START getStats()
from=::1,53638, vmId=39877e08-26da-4694-a6a0-53bda1f1f87d (api:48)
2022-06-25 01:24:52,482-0700 INFO (jsonrpc/7) [api.virt] FINISH getStats
return={'status': {'code': 0, 'message': 'Done'},
'statsList':
[{'statusTime': '11315340662', 'status': 'Down',
'vmId':
'39877e08-26da-4694-a6a0-53bda1f1f87d', 'exitCode': 0,
'exitMessage': 'User
shut down from within the guest', 'exitReason': 7}]} from=::1,53638,
vmId=39877e08-26da-4694-a6a0-53bda1f1f87d (api:54)
*发件人:* adam_xu(a)adagene.com.cn <adam_xu(a)adagene.com.cn>
*发送时间:* 2022年6月25日 16:39
*收件人:* 'wodel youchi' <wodel.youchi(a)gmail.com>
*抄送:* 'users' <users(a)ovirt.org>
*主题:* [ovirt-users]回复: Re: can not access engine when hosted engine is up
I restart vdsmd process failed. And when I try to restart the only healthy
node. Now no vm is running now.
Too bad.
# hosted-engine --vm-status
The hosted engine configuration has not been retrieved from shared storage
yet,
please ensure that ovirt-ha-agent service is running.
I think the engine vm is down because of “dnf update” on it.
Is there any recovery mode I can do now?
*发件人:* wodel youchi <wodel.youchi(a)gmail.com>
*发送时间:* 2022年6月25日 16:23
*收件人:* adam_xu(a)adagene.com.cn
*抄送:* users <users(a)ovirt.org>
*主题:* [ovirt-users] Re: can not access engine when hosted engine is up
Try restarting the vdmd process in that node and you can try to start the
VM engine in another healthy hosted node.
On Sat, Jun 25, 2022, 09:07 <adam_xu(a)adagene.com.cn> wrote:
I can not ping the engine. And no kvm process of engine in any node.
I try to run “hosted-engine --vm-start”, it said:
# hosted-engine --vm-start
Traceback (most recent call last):
File "/usr/lib64/python3.6/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/usr/lib64/python3.6/runpy.py", line 85, in _run_code
exec(code, run_globals)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 214, in <module>
args.command(args)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 42, in func
f(*args, **kwargs)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 91, in checkVmStatus
cli = ohautil.connect_vdsm_json_rpc()
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line
474, in connect_vdsm_json_rpc
__vdsm_json_rpc_connect(logger, timeout)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line
415, in __vdsm_json_rpc_connect
timeout=VDSM_MAX_RETRY * VDSM_DELAY
RuntimeError: Couldn't connect to VDSM within 60 seconds
Traceback (most recent call last):
File "/usr/lib64/python3.6/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/usr/lib64/python3.6/runpy.py", line 85, in _run_code
exec(code, run_globals)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 214, in <module>
args.command(args)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 42, in func
f(*args, **kwargs)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py",
line 57, in create
cli = ohautil.connect_vdsm_json_rpc()
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line
474, in connect_vdsm_json_rpc
__vdsm_json_rpc_connect(logger, timeout)
File
"/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line
415, in __vdsm_json_rpc_connect
timeout=VDSM_MAX_RETRY * VDSM_DELAY
RuntimeError: Couldn't connect to VDSM within 60 seconds
VM failed to launch
*发件人:* wodel youchi <wodel.youchi(a)gmail.com>
*发送时间:* 2022年6月25日 15:56
*收件人:* adam_xu(a)adagene.com.cn
*抄送:* users <users(a)ovirt.org>
*主题:* [ovirt-users] Re: can not access engine when hosted engine is up
Hi,
Can you ping the VM engine? If yes then it's up and running, you may ssh
into it and verify the ovirt-engine service if it is running properly.
If the VM engine doesn't ping, search for its kvm process in your hosted
nodes (all of them), for example: ps -ef | grep qemu-kvm | grep -i hosted
If the process exists then VM exists but may be it is paused, if it does
not, then try to start the VM : hosted-engine --vm-start
When the global maintenance is active if the VM engine is rebooted or
shutdown it is not restarted.
Regards.
On Sat, Jun 25, 2022, 08:42 <adam_xu(a)adagene.com.cn> wrote:
Hi ovirt list,
I need help.
I ran “dnf update” on my engine and reboot it. Then I lost connection of
the engine.
When I access one of my host web management
https://ovirthost1:9090, I
saw engine status is “Hosted Engine is up!”, but another host of the
cluster is in down status.
How can I bring up my engine since it is “UP” now.
_______________________________________________
Users mailing list -- users(a)ovirt.org
To unsubscribe send an email to users-leave(a)ovirt.org
Privacy Statement:
https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives:
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DYVOECFXW57...