--Apple-Mail=_36EF5248-AA3D-484E-8A71-55F0C8DF8023
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
charset=us-ascii
Hey All,
Since updating to 4.0.x of oVirt, I have had an issue with my hosted =
engine. After a some poking around, I think I have figured out my issue =
and thought I would share to see what others think.
The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in =
4.0.4.
Description:
When my hosted engine starts it reports that it is in a degraded state =
with 7 or 8 services still not started when I run systemctl status. It =
takes about 6 or 7 minutes to eventually start all the services and come =
online. If I don't set my cluster to Global-Maintenance mode it =
eventually thinks that my hosted-engine needs to be rebooted and =
restarts it before it can start everything.
Solution:
I realized that Apache was the culprit and found that the proxy to the =
ovirt-engine in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super =
long timeout with many retries. I changed the settings and now =
everything works for me.
-> Before change:
<LocationMatch =
^/(ovirt-engine($|/)|api($|/)|RHEVManagerWeb/|OvirtEngineWeb/|ca.crt$|engi=
ne.ssh.key.txt$|rhevm.ssh.key.txt$)>
ProxyPassMatch ajp://127.0.0.1:8702 timeout=3D3600 retry=3D5
<IfModule deflate_module>
AddOutputFilterByType DEFLATE text/javascript text/css =
text/html text/xml text/json application/xml application/json =
application/x-yaml
</IfModule>
</LocationMatch>
-> After change:
<LocationMatch ^/ovirt-engine($|/)>
ProxyPassMatch ajp://127.0.0.1:8702 timeout=3D5 retry=3D2
<IfModule deflate_module>
AddOutputFilterByType DEFLATE text/javascript text/css =
text/html text/xml text/json application/xml application/json =
application/x-yaml
</IfModule>
</LocationMatch>
If I read the timeout settings correctly, it will wait 60 minutes with 5 =
retries. 5 hours is way too long for my little server to hold onto all =
those apache processes. The change I made allows for there to be an =
error, and also releases apache's hold on the process. Once everything =
is ready, apache is ready to serve requests and everything/everyone is =
happy. Before making the change, I just get a whitescreen in my browser =
and then nothing works until I restart Apache (or I end up in an endless =
loop of ovirt-ha services restarting my hosted-engine.
I noticed that this setting reverts to the original setting, so oVirt =
must be writing this file. Perhaps these number can be changed in oVirt? =
If not, I will just setup and ansible play to revert the settings with =
working values and restart apache on my engine.
:-)
Cheers,
Gervais
--Apple-Mail=_36EF5248-AA3D-484E-8A71-55F0C8DF8023
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
charset=us-ascii
<html><head><meta http-equiv=3D"Content-Type"
content=3D"text/html =
charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D"">Hey All,<div class=3D""><br
class=3D""></div><div =
class=3D"">Since updating to 4.0.x of oVirt, I have had an issue with my =
hosted engine. After a some poking around, I think I have figured out my =
issue and thought I would share to see what others think.</div><div =
class=3D"">The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and =
still exists in 4.0.4.</div><div class=3D""><br
class=3D""></div><div =
class=3D"">Description:</div><div class=3D"">When my
hosted engine =
starts it reports that it is in a degraded state with 7 or 8 services =
still not started when I run systemctl status. It takes about 6 or 7 =
minutes to eventually start all the services and come online. If I don't =
set my cluster to Global-Maintenance mode it eventually thinks that my =
hosted-engine needs to be rebooted and restarts it before it can start =
everything.</div><div class=3D""><br
class=3D""></div><div =
class=3D"">Solution:</div><div class=3D"">I realized
that Apache was the =
culprit and found that the proxy to the ovirt-engine =
in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super long =
timeout with many retries. I changed the settings and now everything =
works for me.</div><div class=3D""><br
class=3D""></div><div =
class=3D"">-> Before change:</div><blockquote
style=3D"margin: 0 0 0 =
40px; border: none; padding: 0px;" class=3D""><div
class=3D""><div =
class=3D""> <LocationMatch =
^/(ovirt-engine($|/)|api($|/)|RHEVManagerWeb/|OvirtEngineWeb/|ca.crt$|engi=
ne.ssh.key.txt$|rhevm.ssh.key.txt$)></div><div
class=3D""> =
ProxyPassMatch <a
href=3D"ajp://127.0.0.1:8702" =
class=3D"">ajp://127.0.0.1:8702</a> timeout=3D3600
retry=3D5</div><div =
class=3D""><br class=3D""></div><div
class=3D""> =
<IfModule deflate_module></div><div
class=3D""> =
AddOutputFilterByType DEFLATE =
text/javascript text/css text/html text/xml text/json application/xml =
application/json application/x-yaml</div><div class=3D"">
=
</IfModule></div><div
class=3D""> =
</LocationMatch></div></div></blockquote><div
class=3D""><br =
class=3D""></div>-> After change:<blockquote
style=3D"margin: 0 0 0 =
40px; border: none; padding: 0px;" class=3D""><div
class=3D""><div =
class=3D""> <LocationMatch =
^/ovirt-engine($|/)></div><div class=3D"">
=
ProxyPassMatch <a href=3D"ajp://127.0.0.1:8702" =
class=3D"">ajp://127.0.0.1:8702</a> timeout=3D5
retry=3D2</div><div =
class=3D""><br class=3D""></div><div
class=3D""> =
<IfModule deflate_module></div><div
class=3D""> =
AddOutputFilterByType DEFLATE =
text/javascript text/css text/html text/xml text/json application/xml =
application/json application/x-yaml</div><div class=3D"">
=
</IfModule></div><div
class=3D""> =
</LocationMatch></div></div></blockquote><div
class=3D""><br =
class=3D""></div>If I read the timeout settings correctly, it will wait
=
60 minutes with 5 retries. 5 hours is way too long for my little server =
to hold onto all those apache processes. The change I made allows for =
there to be an error, and also releases apache's hold on the process. =
Once everything is ready, apache is ready to serve requests and =
everything/everyone is happy. Before making the change, I just get a =
whitescreen in my browser and then nothing works until I restart Apache =
(or I end up in an endless loop of ovirt-ha services restarting my =
hosted-engine.<br class=3D""><div class=3D""><div
class=3D""><div =
class=3D""><br
class=3D"webkit-block-placeholder"></div><div
class=3D"">I =
noticed that this setting reverts to the original setting, so oVirt must =
be writing this file. Perhaps these number can be changed in oVirt? If =
not, I will just setup and ansible play to revert the settings with =
working values and restart apache on my engine.</div><div =
class=3D"">:-)</div><div class=3D"">
<div id=3D"signature" class=3D""><br
class=3D"">Cheers,<br =
class=3D"">Gervais<br class=3D""><br
class=3D""><br class=3D""></div>
</div>
<br class=3D""></div></div></body></html>=
--Apple-Mail=_36EF5248-AA3D-484E-8A71-55F0C8DF8023--