<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Sep 29, 2016 at 2:51 PM, Gervais de Montbrun <span dir="ltr"><<a href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word">Hi Martin,<div><br></div><div>The entropy was super low. Somewhere around 140. I installed and configured haveged.service to start at bootup, reverted my apache changes... After a reboot, my systemctl status still says that there are 7 services queued (note that I erroneously said degraded in my previous email - the services are, in fact, queued), but the oVirt GUI comes up almost immediately and everything seems to be great.</div><div><br></div></div></blockquote><div class="gmail_quote"><br></div>Take care that using havaged on a VM should not be considered a good source of entropy and the oVirt PKi is managed by the engine.<br><a href="http://security.stackexchange.com/questions/34523/is-it-appropriate-to-use-haveged-as-a-source-of-entropy-on-virtual-machines" target="_blank">http://security.stackexchange.<wbr>com/questions/34523/is-it-<wbr>appropriate-to-use-haveged-as-<wbr>a-source-of-entropy-on-<wbr>virtual-machines</a><br><br>A better approach is the virtio-rng paravirtualised rng driver as for patch <a href="https://gerrit.ovirt.org/#/c/62334/">https://gerrit.ovirt.org/#/c/62334/</a></div><div class="gmail_quote"><br><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word"><div></div><div>Thank you for the tip. You solved my issue.<br><div>
<div><br>Cheers,<br>Gervais<br><br><br></div>
</div><div><div>
<br><div><blockquote type="cite"><div>On Sep 29, 2016, at 7:47 AM, Martin Perina <<a href="mailto:mperina@redhat.com" target="_blank">mperina@redhat.com</a>> wrote:</div><br><div><div dir="ltr"><div>Hi,<br><br></div><div>please take a look at my inline comments:<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 27, 2016 at 7:23 PM, Gervais de Montbrun <span dir="ltr"><<a href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a>></span> wrote:<br><blockquote style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" class="gmail_quote"><div style="word-wrap:break-word">Hey All,<div><br></div><div>Since updating to 4.0.x of oVirt, I have had an issue with my hosted engine. After a some poking around, I think I have figured out my issue and thought I would share to see what others think.</div><div>The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in 4.0.4.</div><div><br></div><div>Description:</div><div>When my hosted engine starts it reports that it is in a degraded state with 7 or 8 services still not started when I run systemctl status. It takes about 6 or 7 minutes to eventually start all the services and come online. If I don't set my cluster to Global-Maintenance mode it eventually thinks that my hosted-engine needs to be rebooted and restarts it before it can start everything.</div></div></blockquote><div><br><div>Could you please share with us logs gathered by ovirt-log-collector?<br><br>It's just a guess but could you please take a look if you HE VM has enough entropy?<br><br> cat /proc/sys/kernel/random/entrop<wbr>y_avail<br><br></div><div>If the value is low (below or around 200), you really need to install and configure some entropy generator such as haveged<br><br></div></div><blockquote style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" class="gmail_quote"><div style="word-wrap:break-word"><div><br></div><div>Solution:</div><div>I realized that Apache was the culprit and found that the proxy to the ovirt-engine in /etc/httpd/conf.d/z-ovirt-e<wbr>ngine-proxy.conf has a super long timeout with many retries. I changed the settings and now everything works for me.</div><div><br></div><div>-> Before change:</div><blockquote style="margin:0px 0px 0px 40px;border-width:medium;border-style:none;padding:0px"><div><div> <LocationMatch ^/(ovirt-engine($|/)|api($|/)|<wbr>RHEVManagerWeb/|OvirtEngineWeb<wbr>/|ca.crt$|engine.ssh.key.txt$|<wbr>rhevm.ssh.key.txt$)></div><div> ProxyPassMatch <a>ajp://127.0.0.1:8702</a> timeout=3600 retry=5</div><div><br></div><div> <IfModule deflate_module></div><div> AddOutputFilterByType DEFLATE text/javascript text/css text/html text/xml text/json application/xml application/json application/x-yaml</div><div> </IfModule></div><div> </LocationMatch></div></div></blockquote><div><br></div>-> After change:<blockquote style="margin:0px 0px 0px 40px;border-width:medium;border-style:none;padding:0px"><div><div> <LocationMatch ^/ovirt-engine($|/)></div><div> ProxyPassMatch <a>ajp://127.0.0.1:8702</a> timeout=5 retry=2</div><div><br></div><div> <IfModule deflate_module></div><div> AddOutputFilterByType DEFLATE text/javascript text/css text/html text/xml text/json application/xml application/json application/x-yaml</div><div> </IfModule></div><div> </LocationMatch></div></div></blockquote></div></blockquote><div><br><div style="font-family:arial,helvetica,sans-serif;display:inline">This one is correct for 4.0</div><div style="font-family:arial,helvetica,sans-serif;display:inline">, not sure why it was not updated during upgrade from 3.6. @Simone?<br></div> </div><blockquote style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" class="gmail_quote"><div style="word-wrap:break-word"><div><br></div>If I read the timeout settings correctly, it will wait 60 minutes with 5 retries. 5 hours is way too long for my little server to hold onto all those apache processes. </div></blockquote><blockquote style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" class="gmail_quote"><div style="word-wrap:break-word">The change I made allows for there to be an error, and also releases apache's hold on the process. Once everything is ready, apache is ready to serve requests and everything/everyone is happy. Before making the change, I just get a whitescreen in my browser and then nothing works until I restart Apache (or I end up in an endless loop of ovirt-ha services restarting my hosted-engine.<br></div></blockquote><div><br><div style="font-family:arial,helvetica,sans-serif;display:inline">Well, if you have an issue with too many apache processes waiting for engine to respond, then there's some issue in engine. As I wrote above please share the logs with us and check entropy.<br><br></div><div style="font-family:arial,helvetica,sans-serif;display:inline">Thanks<br><br></div><div style="font-family:arial,helvetica,sans-serif;display:inline">Martin Perina<br></div> </div><blockquote style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex" class="gmail_quote"><div style="word-wrap:break-word"><div><div><div><br></div><div>I noticed that this setting reverts to the original setting, so oVirt must be writing this file. Perhaps these number can be changed in oVirt? If not, I will just setup and ansible play to revert the settings with working values and restart apache on my engine.</div><div>:-)</div><div>
<div><br>Cheers,<br>Gervais<br><br><br></div>
</div>
<br></div></div></div><br>______________________________<wbr>_________________<br>
Users mailing list<br>
<a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
<a rel="noreferrer" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
<br></blockquote></div><br></div></div>
</div></blockquote></div><br></div></div></div></div></blockquote></div><br></div></div>