the virtual machine crushed and I cant shutdown the vm successfully

The trouble bothered me for a long time;Some times my vm will crush and I cant shutdown it,I had to reboot the ovirt-node to slove it; the vm's error below The ovirt-node's error below The vm's threading on the ovirt-node, the IO ratio is 100% and the vm's process change to defunct I can not kill it,every time I had to shutdown the ovirt-node on the engine website ,the vm's status always in the way to shutdown ,even if I wait for it for hours; It either failed or is shutting down and the "power off" cant shutdown the vm too. ---------------------------------------------------------------- The other infomation about my ovirt-node node-version: node hardware zhouhao@vip.friendtimes.net

Probably related to XFS defragmentation, have you tried to google the error message? https://bugzilla.kernel.org/show_bug.cgi?id=73831https://centos.org/forums/v... https://blog.codecentric.de/en/2017/04/xfs-possible-memory-allocation-deadlo... On Tuesday, 17 September 2019, 13:55:45 GMT+7, zhouhao@vip.friendtimes.net <zhouhao@vip.friendtimes.net> wrote: #yiv6469195733 body {line-height:1.5;}#yiv6469195733 blockquote {margin-top:0px;margin-bottom:0px;margin-left:0.5em;}#yiv6469195733 body {font-size:10.5pt;color:rgb(0, 0, 0);line-height:1.5;}#yiv6469195733 body {font-size:10.5pt;color:rgb(0, 0, 0);line-height:1.5;} The trouble bothered me for a long time;Some times my vm will crush and I cant shutdown it,I had to reboot the ovirt-node to slove it;the vm's error below The ovirt-node's error below The vm's threading on the ovirt-node, the IO ratio is 100% and the vm's process change to defunctI can not kill it,every time I had to shutdown the ovirt-node on the engine website ,the vm's status always in the way to shutdown ,even if I wait for it for hours;It either failed or is shutting downand the "power off" cant shutdown the vm too. ----------------------------------------------------------------The other infomation about my ovirt-nodenode-version:node hardware zhouhao@vip.friendtimes.net _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/2MDD6E7Y22ZOUD...

There is no useful information on Google. I can't solve this problem. I can only restart the ovirt-node zhouhao@vip.friendtimes.net From: tedhima@yahoo.co.uk Date: 2019-09-17 15:30 To: users; devel; zhouhao@vip.friendtimes.net Subject: Re: [ovirt-users] the virtual machine crushed and I cant shutdown the vm successfully Probably related to XFS defragmentation, have you tried to google the error message? https://bugzilla.kernel.org/show_bug.cgi?id=73831 https://centos.org/forums/viewtopic.php?t=52412 https://blog.codecentric.de/en/2017/04/xfs-possible-memory-allocation-deadlo... On Tuesday, 17 September 2019, 13:55:45 GMT+7, zhouhao@vip.friendtimes.net <zhouhao@vip.friendtimes.net> wrote: The trouble bothered me for a long time;Some times my vm will crush and I cant shutdown it,I had to reboot the ovirt-node to slove it; the vm's error below The ovirt-node's error below The vm's threading on the ovirt-node, the IO ratio is 100% and the vm's process change to defunct I can not kill it,every time I had to shutdown the ovirt-node on the engine website ,the vm's status always in the way to shutdown ,even if I wait for it for hours; It either failed or is shutting down and the "power off" cant shutdown the vm too. ---------------------------------------------------------------- The other infomation about my ovirt-node node-version: node hardware zhouhao@vip.friendtimes.net _______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/2MDD6E7Y22ZOUD...

How can I slove it safely? There are 50 vms running in this GFS the mail messages below: Time: 2019-09-24 04:47:20.511 Message: Detected change in status of brick 192.168.3.16:/vmdata/gfs of volume bojoy-GFS of cluster bojoy-cluster from UP to DOWN via cli. Severity: WARNING status below: GFS service status

Can I click The 'start' button?? zhouhao@vip.friendtimes.net From: zhouhao@vip.friendtimes.net Date: 2019-09-24 11:36 To: users Subject: [ovirt-users] One of the brick in the GFS is down ,How can I slove it? How can I slove it safely? There are 50 vms running in this GFS the mail messages below: Time: 2019-09-24 04:47:20.511 Message: Detected change in status of brick 192.168.3.16:/vmdata/gfs of volume bojoy-GFS of cluster bojoy-cluster from UP to DOWN via cli. Severity: WARNING status below: GFS service status

zhouhao@vip.friendtimes.net From: zhouhao@vip.friendtimes.net Date: 2019-09-24 11:41 To: zhouhao; users Subject: Re: [ovirt-users] One of the brick in the GFS is down ,How can I slove it? Can I click The 'start' button?? zhouhao@vip.friendtimes.net From: zhouhao@vip.friendtimes.net Date: 2019-09-24 11:36 To: users Subject: [ovirt-users] One of the brick in the GFS is down ,How can I slove it? How can I slove it safely? There are 50 vms running in this GFS the mail messages below: Time: 2019-09-24 04:47:20.511 Message: Detected change in status of brick 192.168.3.16:/vmdata/gfs of volume bojoy-GFS of cluster bojoy-cluster from UP to DOWN via cli. Severity: WARNING status below: GFS service status

it is preferable to have all GFS nodes online and running first, you can follow troubleshooting here: https://www.ovirt.org/documentation/gluster-hyperconverged/chap-Troubleshoot... On Tue, Sep 24, 2019 at 7:18 AM zhouhao@vip.friendtimes.net < zhouhao@vip.friendtimes.net> wrote:
------------------------------ zhouhao@vip.friendtimes.net
*From:* zhouhao@vip.friendtimes.net *Date:* 2019-09-24 11:41 *To:* zhouhao <zhouhao@vip.friendtimes.net>; users <users@ovirt.org> *Subject:* Re: [ovirt-users] One of the brick in the GFS is down ,How can I slove it? Can I click The 'start' button??
------------------------------ zhouhao@vip.friendtimes.net
*From:* zhouhao@vip.friendtimes.net *Date:* 2019-09-24 11:36 *To:* users <users@ovirt.org> *Subject:* [ovirt-users] One of the brick in the GFS is down ,How can I slove it?
*How can I slove it safely? There are 50 vms running in this GFS*
*the mail messages below:*
*Time:* 2019-09-24 04:47:20.511 *Message:* Detected change in status of brick 192.168.3.16:/vmdata/gfs of volume bojoy-GFS of cluster bojoy-cluster from UP to DOWN via cli. *Severity:* WARNING
status below:
GFS service status
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/TBCAJIHSV2AU24...

You can try start on the volume. Since the volume is already started, this will do a force start to get back the offline bricks. Also, can you check if it's an issue with monitoring from engine? what does "gluster volume status" from one of the gluster server's CLI return? If the brick is offline, even after the force start, you'll need to attach the glusterd logs and brick logs to troubleshoot On Tue, Sep 24, 2019 at 1:15 PM Amit Bawer <abawer@redhat.com> wrote:
it is preferable to have all GFS nodes online and running first, you can follow troubleshooting here:
https://www.ovirt.org/documentation/gluster-hyperconverged/chap-Troubleshoot...
On Tue, Sep 24, 2019 at 7:18 AM zhouhao@vip.friendtimes.net < zhouhao@vip.friendtimes.net> wrote:
------------------------------ zhouhao@vip.friendtimes.net
*From:* zhouhao@vip.friendtimes.net *Date:* 2019-09-24 11:41 *To:* zhouhao <zhouhao@vip.friendtimes.net>; users <users@ovirt.org> *Subject:* Re: [ovirt-users] One of the brick in the GFS is down ,How can I slove it? Can I click The 'start' button??
------------------------------ zhouhao@vip.friendtimes.net
*From:* zhouhao@vip.friendtimes.net *Date:* 2019-09-24 11:36 *To:* users <users@ovirt.org> *Subject:* [ovirt-users] One of the brick in the GFS is down ,How can I slove it?
*How can I slove it safely? There are 50 vms running in this GFS*
*the mail messages below:*
*Time:* 2019-09-24 04:47:20.511 *Message:* Detected change in status of brick 192.168.3.16:/vmdata/gfs of volume bojoy-GFS of cluster bojoy-cluster from UP to DOWN via cli. *Severity:* WARNING
status below:
GFS service status
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/TBCAJIHSV2AU24...
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-leave@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/Q3WA2WGTUOTO3U...
participants (4)
-
Amit Bawer
-
Sahina Bose
-
tedhima@yahoo.co.uk
-
zhouhao@vip.friendtimes.net