<html><head></head><body><div>I made more tests and i have two hosts (hosted_engine1 and&nbsp;hosted_engine2) configured to be hosted engine and after sent down the network interface (ifdown bond0 and i have&nbsp;ovirtmgmt over bond0)</div><div>on hosted_engine1, the hosted_engined2 try to start engine:</div><div><br></div><div><br></div><pre>==&gt; /var/log/ovirt-hosted-engine-ha/agent.log &lt;==</pre><pre>MainThread::INFO::2016-11-07 18:17:18,598::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan) Found OVF_STORE: imgUUID:8f567072-82d7-4a80-b643-76a1e1d273ef, volUUID:7b12f649-b8cd-426a-95d8-37bb223ac86f</pre><pre>MainThread::INFO::2016-11-07 18:17:18,875::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF) Extracting Engine VM OVF from the OVF_STORE</pre><pre>MainThread::INFO::2016-11-07 18:17:18,896::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF) OVF_STORE volume path: /rhev/data-center/mnt/blockSD/4c0104e0-dc8d-44b6-ab70-9d46599eb7f6/images/8f567072-82d7-4a80-b643-76a1e1d273ef/7b12f649-b8cd-426a-95d8-37bb223ac86f&nbsp;</pre><pre>MainThread::INFO::2016-11-07 18:17:18,991::config::226::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(refresh_local_conf_file) Found an OVF for HE VM, trying to convert</pre><pre>MainThread::INFO::2016-11-07 18:17:18,994::config::231::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(refresh_local_conf_file) Got vm.conf from OVF_STORE</pre><pre>MainThread::INFO::2016-11-07 18:17:18,994::hosted_engine::1148::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_clean_vdsm_state) Ensuring VDSM state is clear for engine VM</pre><pre>MainThread::INFO::2016-11-07 18:17:19,011::hosted_engine::1160::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_clean_vdsm_state) Vdsm state for VM clean</pre><pre>MainThread::INFO::2016-11-07 18:17:19,011::hosted_engine::1109::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_start_engine_vm) Starting vm using `/usr/sbin/hosted-engine --vm-start`</pre><pre>MainThread::INFO::2016-11-07 18:17:19,410::hosted_engine::1115::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_start_engine_vm) stdout:&nbsp;</pre><pre>fe6a0618-62aa-434d-b9b8-459603a690ce</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>Status = WaitForLaunch</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>nicModel = rtl8139,pv</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>statusTime = 4905357140</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>emulatedMachine = pc</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>pid = 0</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>vmName = HostedEngine</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>devices = [{'index': '0', 'iface': 'virtio', 'format': 'raw', 'bootOrder': '1', 'address': 'None', 'volumeID': '5e24a8ba-ff1f-409b-918c-b0c31565ae89', 'imageID': '23ec5c8d-d154-488b-8dd2-de29a0fd941d', 'readonly': 'false', 'domainID': '4c0104e0-dc8d-44b6-ab70-9d46599eb7f6', 'deviceId': '23ec5c8d-d154-488b-8dd2-de29a0fd941d', 'poolID': '00000000-0000-0000-0000-000000000000', 'device': 'disk', 'shared': 'exclusive', 'propagateErrors': 'off', 'type': 'disk'}, {'nicModel': 'pv', 'macAddr': '00:16:3e:1c:f3:7c', 'linkActive': 'true', 'network': 'ovirtmgmt', 'deviceId': '113617eb-d506-4df1-b542-26248ffd8027', 'address': 'None', 'device': 'bridge', 'type': 'interface'}, {'index': '2', 'iface': 'ide', 'readonly': 'true', 'deviceId': '8c3179ac-b322-4f5c-9449-c52e3665e0ae', 'address': {'bus': '1', 'controller': '0', 'type': 'drive', 'target': '0', 'unit': '0'}, 'device': 'cdrom', 'shared': 'false', 'path': '', 'type': 'disk'}]</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>guestDiskMapping = {}</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>vmType = kvm</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>displaySecurePort = -1</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>memSize = 4096</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>displayPort = -1</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>clientIp =&nbsp;</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>spiceSecureChannels = smain,sdisplay,sinputs,scursor,splayback,srecord,ssmartcard,susbredir</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>smp = 2</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>displayIp = 0</pre><pre><span class="Apple-tab-span" style="white-space:pre">        </span>display = vnc</pre><pre><br></pre><pre>MainThread::INFO::2016-11-07 18:17:19,411::hosted_engine::1116::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_start_engine_vm) stderr: /usr/share/vdsm/vdsClient.py:33: DeprecationWarning: vdscli uses xmlrpc. since ovirt 3.6 xmlrpc is deprecated, please use vdsm.jsonrpcvdscli</pre><pre>&nbsp; from vdsm import utils, vdscli, constants</pre><pre>/usr/share/vdsm/vdsClient.py:33: DeprecationWarning: vdscli uses xmlrpc. since ovirt 3.6 xmlrpc is deprecated, please use vdsm.jsonrpcvdscli</pre><pre>&nbsp; from vdsm import utils, vdscli, constants</pre><pre><br></pre><pre>MainThread::INFO::2016-11-07 18:17:19,411::hosted_engine::1128::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_start_engine_vm) Engine VM started on localhost</pre><pre>MainThread::INFO::2016-11-07 18:17:19,416::brokerlink::111::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify) Trying: notify time=1478542639.42 type=state_transition detail=EngineStart-EngineStarting hostname='ied-blade11.install.eurotux.local'</pre><pre>MainThread::INFO::2016-11-07 18:17:19,681::brokerlink::120::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify) Success, was notification of state_transition (EngineStart-EngineStarting) sent? sent</pre><pre>MainThread::INFO::2016-11-07 18:17:19,681::hosted_engine::612::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm) Initializing VDSM</pre><pre><br></pre><div>But i get the following error and don't see the hosted_engine1 receiving fence:</div><div><br></div><pre>==&gt; /var/log/ovirt-hosted-engine-ha/broker.log &lt;==</pre><pre>Thread-1007::INFO::2016-11-07 18:17:21,407::ping::52::ping.Ping::(action) Successfully pinged 10.10.4.254</pre><pre>Thread-297080::INFO::2016-11-07 18:17:21,622::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup) Connection established</pre><pre>Thread-297080::INFO::2016-11-07 18:17:21,676::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle) Connection closed</pre><pre>Thread-297081::INFO::2016-11-07 18:17:21,676::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup) Connection established</pre><pre>Thread-297081::INFO::2016-11-07 18:17:21,678::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle) Connection closed</pre><pre>Thread-297082::INFO::2016-11-07 18:17:21,678::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup) Connection established</pre><pre>Thread-297082::INFO::2016-11-07 18:17:21,680::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle) Connection closed</pre><pre>Thread-1010::ERROR::2016-11-07 18:17:23,596::cpu_load_no_engine::156::cpu_load_no_engine.EngineHealth::(update_stat_file) Failed to getVmStats: 'pid'</pre><pre>Thread-1010::INFO::2016-11-07 18:17:23,597::cpu_load_no_engine::121::cpu_load_no_engine.EngineHealth::(calculate_load) System load total=0.0732, engine=0.0000, non-engine=0.0732</pre><div></div><div><br></div><div>Can you help me to understand what goes wrong?</div><div><br></div><div>Regards,</div><div>Carlos Rodrigues</div><div><br></div><div>On Tue, 2016-08-23 at 14:29 +0100, Carlos Rodrigues wrote:</div><blockquote type="cite"><pre>On Tue, 2016-08-23 at 14:16 +0200, Simone Tiraboschi wrote:
<blockquote type="cite">
On Mon, Aug 22, 2016 at 1:10 PM, Carlos Rodrigues &lt;<a href="mailto:cmar@eurotux.com">cmar@eurotux.com</a>&gt;
wrote:
<blockquote type="cite">

On Fri, 2016-08-19 at 11:50 +0100, Carlos Rodrigues wrote:
<blockquote type="cite">

On Fri, 2016-08-19 at 12:24 +0200, Simone Tiraboschi wrote:
<blockquote type="cite">


On Fri, Aug 19, 2016 at 12:07 PM, Carlos Rodrigues &lt;cmar@eurotu
x.co
m&gt;
wrote:
<blockquote type="cite">



On Fri, 2016-08-19 at 10:47 +0100, Carlos Rodrigues wrote:
<blockquote type="cite">



On Fri, 2016-08-19 at 11:36 +0200, Simone Tiraboschi wrote:
<blockquote type="cite">






On Fri, Aug 19, 2016 at 11:29 AM, Carlos Rodrigues &lt;cmar@
euro
tu
x.co
m&gt;
wrote:
<blockquote type="cite">




After night, the OVF_STORE it was created:

</blockquote>

It's quite strange that it got so long but now it looks
fine.

If the ISO_DOMAIN that I see in your screenshot is served
by
the
engine VM itself, I suggest to remove it and export from
an
external
server.
Serving the ISO storage domain from the engine VM itself
is
not
a
good idea since when the engine VM is down you can
experiment
long
delays before getting the engine VM restarted due to the
unavailable
storage domain.
</blockquote>

Ok, thank you for advice.

Now, apparently is all ok. I'll do more tests with HA and
any
issue
i'll tell you.

Thank you for your support.

Regards,
Carlos Rodrigues

</blockquote>

I shutdown the network of host with engine VM and i expected
that
other
host fence the host and start engine VM but i don't see any
fence
action and the "free" host keep trying to start VM but get
and
error of
sanlock

Aug 19 11:03:03 ied-blade11.install.eurotux.local kernel:
qemu-
kvm:
sending ioctl 5326 to a partition!
Aug 19 11:03:03 ied-blade11.install.eurotux.local kernel:
qemu-
kvm:
sending ioctl 80200204 to a partition!
Aug 19 11:03:03 ied-blade11.install.eurotux.local kvm[7867]:
1
guest
now active
Aug 19 11:03:03 ied-blade11.install.eurotux.local
sanlock[884]:
2016-
08-19 11:03:03+0100 1023 [903]: r3 paxos_acquire owner 1
delta 1
9
245502 alive
Aug 19 11:03:03 ied-blade11.install.eurotux.local
sanlock[884]:
2016-
08-19 11:03:03+0100 1023 [903]: r3 acquire_token held error
-243
Aug 19 11:03:03 ied-blade11.install.eurotux.local
sanlock[884]:
2016-
08-19 11:03:03+0100 1023 [903]: r3 cmd_acquire 2,9,7862
acquire_token
-243 lease owned by other host
Aug 19 11:03:03 ied-blade11.install.eurotux.local
libvirtd[1369]:
resource busy: Failed to acquire lock: error -243
Aug 19 11:03:03 ied-blade11.install.eurotux.local kernel:
ovirtmgmt:
port 2(vnet0) entered disabled state
Aug 19 11:03:03 ied-blade11.install.eurotux.local kernel:
device
vnet0
left promiscuous mode
Aug 19 11:03:03 ied-blade11.install.eurotux.local kernel:
ovirtmgmt:
port 2(vnet0) entered disabled state
Aug 19 11:03:03 ied-blade11.install.eurotux.local kvm[7885]:
0
guests
now active
Aug 19 11:03:03 ied-blade11.install.eurotux.local systemd-
machined[7863]: Machine qemu-4-HostedEngine terminated.
</blockquote>

Maybe you hit this one:
<a href="https://bugzilla.redhat.com/show_bug.cgi?id=1322849">https://bugzilla.redhat.com/show_bug.cgi?id=1322849</a>


Can you please check it as described in comment 28 and
eventually
apply the workaround in comment 18?

</blockquote>

Apparently the host-id its ok. I don't need to apply the
workaround.

</blockquote>

Any other suggestion?
I check that second host doens't fence the failed host and maybe
this
cause the lock of hosted storage domain.
</blockquote>

The picture isn't still that clear to me: ovirt-ha-agent is not
trying
to fence other hosted-engine hosts.
The communication between hosts simply acts trough the metadata area
on the shared storage.

Do you also loose the connection to the storage domain when you play
with the management network on your host?

</blockquote>

No, the storage domain is connected over fiber channel, we saw that
agent tries to read metadata and tries to start engine VM but after
couple seconds it shuts down the VM because its not able to acquire san
lock.



<blockquote type="cite">
<blockquote type="cite">

After recover network, engine VM check hosts and send fence
commands to
the hosts. In some cases, it sends fence commands to both hosts.

Regards,
Carlos Rodrigues


<blockquote type="cite">


<blockquote type="cite">




<blockquote type="cite">



<blockquote type="cite">



<blockquote type="cite">



<blockquote type="cite">



Regards,
Carlos Rodrigues

On Fri, 2016-08-19 at 08:29 +0200, Simone Tiraboschi
wrote:
<blockquote type="cite">






On Thu, Aug 18, 2016 at 6:38 PM, Carlos Rodrigues &lt;cm
ar@e
ur
otux
.c
om&gt; wrote:
<blockquote type="cite">




On Thu, 2016-08-18 at 17:45 +0200, Simone
Tiraboschi
wrote:
<blockquote type="cite">




On Thu, Aug 18, 2016 at 5:43 PM, Carlos Rodrigues
&lt;cmar
@eur
ot
ux.com&gt; wrote:
<blockquote type="cite">





I increase hosted_engine disk space to 160G.
How do
i
force
to create
OVF_STORE.
</blockquote>

I think that restarting the engine on the engine
VM
will
trigger it
although I'm not sure that it was a size issue.

</blockquote>

I found to OVF_STORE on another storage domain with
"Domain
Type" "Data (Master)"


</blockquote>

Each storage domain has its own OVF_STORE volumes;
you
should
get
them also on the hosted-engine storage domain.
Not really sure about how to trigger it again; adding
Roy
here.




<blockquote type="cite">







<blockquote type="cite">




<blockquote type="cite">





Regards,
Carlos Rodrigues

On Thu, 2016-08-18 at 12:14 +0100, Carlos
Rodrigues
wrote:
<blockquote type="cite">





On Thu, 2016-08-18 at 12:34 +0200, Simone
Tiraboschi
wrote:
<blockquote type="cite">






On Thu, Aug 18, 2016 at 12:11 PM, Carlos
Rodrigues
&lt;cma
<a href="mailto:r@eurotux.co">r@eurotux.co</a>
m&gt;
wrote:
<blockquote type="cite">







On Thu, 2016-08-18 at 11:53 +0200, Simone
Tiraboschi
wrote:
<blockquote type="cite">









On Thu, Aug 18, 2016 at 11:50 AM,
Carlos
Rodrigues
&lt;cmar@eurotu
x.
com&gt;
wrote:
<blockquote type="cite">







On Thu, 2016-08-18 at 11:42 +0200,
Simone
Tiraboschi wrote:
<blockquote type="cite">







On Thu, Aug 18, 2016 at 11:25 AM,
Carlos
Rodrigues &lt;cmar@eu
ro
tux.
com&gt; wrote:
<blockquote type="cite">








On Thu, 2016-08-18 at 11:04
+0200,
Simone
Tiraboschi
wrote:
<blockquote type="cite">








On Thu, Aug 18, 2016 at 10:36
AM,
Carlos
Rodrigues
&lt;cmar@
euro
tux.com&gt;
wrote:
<blockquote type="cite">









On Thu, 2016-08-18 at 10:27
+0200,
Simone
Tiraboschi
wrote:
<blockquote type="cite">









On Thu, Aug 18, 2016 at
10:22
AM,
Carlos Rodrigues
&lt;cmar@
eurotux.
com&gt;
wrote:
<blockquote type="cite">










On Thu, 2016-08-18 at
08:54
+0200,
Simone
Tiraboschi
wrote:
<blockquote type="cite">










On Tue, Aug 16, 2016 at
12:53
PM,
Carlos
Rodrigues &lt;c
mar@euro
tux.
com&gt;
wrote:
<blockquote type="cite">











On Sun, 2016-08-14 at
14:22
+0300, Roy Golan
wrote:
<blockquote type="cite">













On 12 August 2016
at
20:23,
Carlos
Rodrigues
&lt;cma
r@eurotu
x.co
m&gt;
wrote:
<blockquote type="cite">











Hello,

I have one
cluster
with
two
hosts with
power
management
correctly
configured and
one
virtual
machine with
HostedEngine
over
shared
storage with
FiberChannel.

When i shutdown
the
network
of host with
HostedEngine
VM,&nbsp;&nbsp;it
should be
possible the
HostedEngine
VM
migrate
automatically to
another
host?

</blockquote>
migrate on which
network?

<blockquote type="cite">











What is the
expected
behaviour on this
HA
scenario?
</blockquote>

After a few minutes
your
vm
will be
shutdown
by
the High
Availability
agent, as it can't
see
network,
and started
on
another
host.
</blockquote>


I'm testing this
scenario
and
after shutdown
network, it
should
be
expected that agent
shutdown ha
and started
on
another
host,
but
after
couple minutes
nothing
happens
and on host
with
network we
getting
the
following messages:

Aug 16 11:44:08 ied-
blade11.install.eurot
ux.l
oc
al
ovirt-ha-
agent[2779]:
ovirt-ha-agent
ovirt_hosted_engine_h
a.ag
en
t.ho
st
ed_engine.Ho
st
edEn
gine.con
fig
ERROR
Unable to get vm.conf
from
OVF_STORE, falling
back
to
initial
vm.conf

I think the HA agent
its
trying
to get vm
configuration but
some
how it
can't get vm.conf to
start
VM.
</blockquote>

No, this is a different
issues.
In 3.6 we added a
feature
to
let
the engine
manage
also the
engine VM
itself; ovirt-ha-agent
will
pickup
the latest
engine
VM
configuration
from the OVF_STORE
which is
managed
by the
engine.
If something goes
wrong,
ovirt-
ha-
agent could
fallback to the
initial
(bootstrap time)
vm.conf.
This
will
normally
happen
till you
add
your
first regular storage
domain
and
the engine
imports
the
engine
VM.
</blockquote>

But i already have my
first
storage
domain and
storage
engine
domain
and already imported
engine
VM.

I'm using 4.0 version.
</blockquote>

This seams an issue, can
you
please
share your
/var/log/ovirt-hosted-
engine-
ha/agent.log ?

</blockquote>

I sent it in attachment.
</blockquote>

Nothing strange here;
do you see a couple of disks
with
alias
OVF_STORE on
the
hosted-
engine
storage domain if you check it
from
the
engine?

</blockquote>

Do you mean any disk label?
I don't have it anyone:

[root@ied-blade11 ~]#&nbsp;&nbsp;ls
/dev/disk/by-
label/
ls: cannot access /dev/disk/by-
label/:
No
such file or
directory
</blockquote>

No I mean: go to the engine web-ui,
select
the
hosted-
engine
storage
domain, check the disks there.
</blockquote>

No, the alias is virtio-disk0.

</blockquote>

And this is the engine VM disk, so the
issue
is
why
the engine
has
still to create the OVF_STORE.
Can you please share your engine.log
from
the
engine VM?

</blockquote>

Go in attachment.
</blockquote>

The creation of the OVF_STORE disk failed
but
it's
not
that clear
why:

2016-08-17 08:43:33,538 ERROR
[org.ovirt.engine.core.bll.storage.ovfstore
.Cre
at
eOvf
Vo
lumeForStora
ge
DomainCommand]
(DefaultQuartzScheduler6) [6f1f1fd4] Ending
command
'org.ovirt.engine.core.bll.storage.ovfstore
.Cre
at
eOvf
Vo
lumeForStora
ge
DomainCommand'
with failure.
2016-08-17 08:43:33,540 ERROR
[org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d]
(DefaultQuartzScheduler6) [6f1f1fd4] Ending
command
'org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d'
with
failure.
2016-08-17 08:43:33,541 WARN
[org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d]
(DefaultQuartzScheduler6) [6f1f1fd4]
VmCommand::EndVmCommand: Vm is
null - not performing endAction on Vm
2016-08-17 08:43:33,553 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditlo
ghan
dl
ing.
Au
ditLogDirect
or
]
(DefaultQuartzScheduler6) [6f1f1fd4]
Correlation
ID:
6f1f1fd4, Call
Stack: null, Custom Event ID: -1, Message:
Add-
Disk
operation
failed
to complete.
2016-08-17 08:43:33,557 WARN
[org.ovirt.engine.core.dal.dbbroker.auditlo
ghan
dl
ing.
Au
ditLogDirect
or
]
(DefaultQuartzScheduler6) [] Correlation
ID:
19ac5bda,
Call Stack:
null, Custom Event ID: -1, Message: Failed
to
create
OVF store disk
for Storage Domain hosted_storage.
&nbsp;OVF data won't be updated meanwhile for
that
domain.
2016-08-17 08:43:33,585 INFO
[org.ovirt.engine.core.bll.SerialChildComma
ndsE
xe
cuti
on
Callback]
(DefaultQuartzScheduler6) [5f5a8daf]
Command
'ProcessOvfUpdateForStorageDomain' (id:
'71aaaafe-7b9e-45e8-a40c-6d33bdf646a0')
waiting
on
child command
id:
'eb2e6f1a-c756-4ccd-85a1-60d97d6880de'
type:'CreateOvfVolumeForStorageDomain' to
complete
2016-08-17 08:43:33,595 ERROR
[org.ovirt.engine.core.bll.storage.ovfstore
.Cre
at
eOvf
Vo
lumeForStora
ge
DomainCommand]
(DefaultQuartzScheduler6) [5d314e49] Ending
command
'org.ovirt.engine.core.bll.storage.ovfstore
.Cre
at
eOvf
Vo
lumeForStora
ge
DomainCommand'
with failure.
2016-08-17 08:43:33,596 ERROR
[org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d]
(DefaultQuartzScheduler6) [5d314e49] Ending
command
'org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d'
with
failure.
2016-08-17 08:43:33,596 WARN
[org.ovirt.engine.core.bll.storage.disk.Add
Disk
Co
mman
d]
(DefaultQuartzScheduler6) [5d314e49]
VmCommand::EndVmCommand: Vm is
null - not performing endAction on Vm
2016-08-17 08:43:33,602 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditlo
ghan
dl
ing.
Au
ditLogDirect
or
]
(DefaultQuartzScheduler6) [5d314e49]
Correlation
ID:
5d314e49, Call
Stack: null, Custom Event ID: -1, Message:
Add-
Disk
operation
failed
to complete.
2016-08-17 08:43:33,605 WARN
[org.ovirt.engine.core.dal.dbbroker.auditlo
ghan
dl
ing.
Au
ditLogDirect
or
]
(DefaultQuartzScheduler6) [] Correlation
ID:
5f5a8daf,
Call Stack:
null, Custom Event ID: -1, Message: Failed
to
create
OVF store disk
for Storage Domain hosted_storage.
&nbsp;OVF data won't be updated meanwhile for
that
domain.
2016-08-17 08:43:36,460 INFO
[org.ovirt.engine.core.bll.scheduling.HaRes
erva
ti
onHa
nd
ling]
(DefaultQuartzScheduler7) [5d314e49] HA
reservation
status for
cluster
'Default' is 'OK'
2016-08-17 08:43:36,662 INFO
[org.ovirt.engine.core.bll.SerialChildComma
ndsE
xe
cuti
on
Callback]
(DefaultQuartzScheduler4) [5f5a8daf]
Command
'ProcessOvfUpdateForStorageDomain' id:
'71aaaafe-7b9e-45e8-a40c-6d33bdf646a0'
child
commands
'[84959a4b-6a10-4d22-b37e-6c154e17a0da,
eb2e6f1a-c756-4ccd-85a1-60d97d6880de]'
executions
were
completed,
status 'FAILED'
2016-08-17 08:43:37,691 ERROR
[org.ovirt.engine.core.bll.storage.ovfstore
.Pro
ce
ssOv
fU
pdateForStor
ag
eDomainCommand]
(DefaultQuartzScheduler6) [5f5a8daf] Ending
command
'org.ovirt.engine.core.bll.storage.ovfstore
.Pro
ce
ssOv
fU
pdateForStor
ag
eDomainCommand'
with failure.

Can you please check vdsm logs for that
time
frame on
the SPM host?

</blockquote>

I sent in attachment the vdsm logs from both
hosts,
but
i
think the
SPM
host on this time frame it was ied-blade13

<blockquote type="cite">







It seams that you also have an issue in the
SPM
election procedure:

2016-08-17 18:04:31,053 ERROR
[org.ovirt.engine.core.vdsbroker.irsbroker.
IrsP
ro
xyDa
ta
]
(DefaultQuartzScheduler1) [] SPM Init:
could
not
find
reported vds
or
not up - pool: 'Default' vds_spm_id: '2'
2016-08-17 18:04:31,076 INFO
[org.ovirt.engine.core.vdsbroker.irsbroker.
IrsP
ro
xyDa
ta
]
(DefaultQuartzScheduler1) [] SPM selection
-
vds
seems
as spm
'hosted_engine_2'
2016-08-17 18:04:31,076 WARN
[org.ovirt.engine.core.vdsbroker.irsbroker.
IrsP
ro
xyDa
ta
]
(DefaultQuartzScheduler1) [] spm vds is non
responsive,
stopping
spm
selection.
2016-08-17 18:04:31,539 INFO
[org.ovirt.engine.core.vdsbroker.monitoring
.Vms
St
atis
ti
csFetcher]
(DefaultQuartzScheduler7) [] Fetched 1 VMs
from
VDS
'06372186-572c-41ad-916f-7cbb0aba5302'

probably due to:
2016-08-17 18:02:33,569 ERROR
[org.ovirt.engine.core.vdsbroker.monitoring
.Hos
tM
onit
or
ing]
(DefaultQuartzScheduler6) [] Failure to
refresh
Vds
runtime info:
VDSGenericException: VDSNetworkException:
Message
timeout which can
be
caused by communication issues
2016-08-17 18:02:33,569 ERROR
[org.ovirt.engine.core.vdsbroker.monitoring
.Hos
tM
onit
or
ing]
(DefaultQuartzScheduler6) [] Exception:
org.ovirt.engine.core.vdsbroker.vdsbroker.V
DSNe
tw
orkE
xc
eption:
VDSGenericException: VDSNetworkException:
Message
timeout which can
be
caused by communication issues

</blockquote>

This messages maybe cause by connection
issues
from
yesterday at 6pm.

<blockquote type="cite">






can you please check if the engine VM could
correctly
resolve and
reach each host?
</blockquote>

Now i can read engine VM from both hosts

[root@ied-blade11 ~]# ping ied-hosted-engine
PING ied-hosted-engine.install.eurotux.local
(10.10.4.115) 56(84)
bytes
of data.
64 bytes from ied-hosted-
engine.install.eurotux.local
(10.10.4.115):
icmp_seq=1 ttl=64 time=0.179 ms
64 bytes from ied-hosted-
engine.install.eurotux.local
(10.10.4.115):
icmp_seq=2 ttl=64 time=0.141 ms
^C
--- ied-hosted-engine.install.eurotux.local
ping
statistics ---
2 packets transmitted, 2 received, 0% packet
loss,
time
999ms
rtt min/avg/max/mdev =
0.141/0.160/0.179/0.019 ms
[root@ied-blade13 ~]# ping ied-hosted-engine
PING ied-hosted-engine.install.eurotux.local
(10.10.4.115) 56(84)
bytes
of data.
64 bytes from ied-hosted-
engine.install.eurotux.local
(10.10.4.115):
icmp_seq=1 ttl=64 time=0.172 ms
64 bytes from ied-hosted-
engine.install.eurotux.local
(10.10.4.115):
icmp_seq=2 ttl=64 time=0.169 ms
^C
--- ied-hosted-engine.install.eurotux.local
ping
statistics ---
2 packets transmitted, 2 received, 0% packet
loss,
time
1000ms
rtt min/avg/max/mdev =
0.169/0.170/0.172/0.013 ms


I have a message critical of low disk space
on
hosted_storage domain.
I have 50G of disk and i created i VM with
40G.
Do
i
need
more space
of
OVF_STORAGE?
What is the minimum requirements of disk
space
for
deploy
engine VM?

Regards,
Carlos

<blockquote type="cite">








<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







<blockquote type="cite">







Regards,
Carlos Rodrigues


<blockquote type="cite">











<blockquote type="cite">












Regards,

--
Carlos Rodrigues

Engenheiro de
Software
Sénior

Eurotux
Informática,
S.A. |
w
ww.eurotux.c
om
(t) +351 253 680
300
(m)
+351
911 926 110

_________________
____
__
____
__
____________
__
____
Users mailing
list
<a href="mailto:Users@ovirt.org">Users@ovirt.org</a>
<a href="http://lists.ovir">http://lists.ovir</a>
t.or
g/
mail
ma
n/listinfo/u
se
rs

</blockquote>

</blockquote>
--
Carlos Rodrigues

Engenheiro de
Software
Sénior

Eurotux Informática,
S.A.
<blockquote type="cite">


</blockquote>
<a href="http://www.e">www.e</a>
urotux.com
(t) +351 253 680 300
(m)
+351
911
926 110

_____________________
____
__
____
__
____________
__
Users mailing list
<a href="mailto:Users@ovirt.org">Users@ovirt.org</a>
<a href="http://lists.ovirt.or">http://lists.ovirt.or</a>
g/ma
il
man/
li
stinfo/users
</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software
Sénior

Eurotux Informática, S.A.
<blockquote type="cite">

</blockquote>
www
.eur
ot
ux.com
(t) +351 253 680 300 (m)
+351
911
926
110

</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. |
<a href="http://www.e">www.e</a>
ur
otux
.c
om
(t) +351 253 680 300 (m) +351
911
926
110
</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.e">www.e</a>
urot
ux
.com
(t) +351 253 680 300 (m) +351 911
926
110

</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurot">www.eurot</a>
ux.c
om
(t) +351 253 680 300 (m) +351 911 926
110


</blockquote>

</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.c">www.eurotux.c</a>
om
(t) +351 253 680 300 (m) +351 911 926 110
</blockquote>
</blockquote>
_____________________________________________
__
Users mailing list
<a href="mailto:Users@ovirt.org">Users@ovirt.org</a>
<a href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.com">www.eurotux.com</a>
(t) +351 253 680 300 (m) +351 911 926 110

</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.com">www.eurotux.com</a>
(t) +351 253 680 300 (m) +351 911 926 110


</blockquote>

</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.com">www.eurotux.com</a>
(t) +351 253 680 300 (m) +351 911 926 110


</blockquote>

</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.com">www.eurotux.com</a>
(t) +351 253 680 300 (m) +351 911 926 110


</blockquote>

</blockquote>
</blockquote>
--
Carlos Rodrigues

Engenheiro de Software Sénior

Eurotux Informática, S.A. | <a href="http://www.eurotux.com">www.eurotux.com</a>
(t) +351 253 680 300 (m) +351 911 926 110

</blockquote></blockquote></pre></blockquote><div><span><pre>-- <br></pre><div style="width: 71ch; word-wrap: break-word; word-break: break-word; ">Carlos Rodrigues&nbsp;<br><br>Engenheiro de Software Sénior<br><br>Eurotux Informática, S.A. |&nbsp;<a href="http://www.eurotux.com/">www.eurotux.com</a><br>(t) +351 253 680 300 (m) +351 911 926 110<br><br></div></span></div></body></html>