
I am running into panics on my ovirt hosts and wanted to know if anyone has seen the same thing. I have 18 ovirt hosts and so far 8 of them have crashed. I have another 18 hosts running KVM on Fedora 16 that are doing fine. There are hardware differences besides oVirt, however they are relatively minor (or so we think). [2299856.345978] systemd[1]: segfault at 7fff36f4df40 ip 000000000040b38c sp 00007fff36f4df30 error 6 in systemd[400000+ce000] [2299856.464068] Kernel panic - not syncing: Attempted to kill init! [2299856.470172] Pid: 1, comm: systemd Tainted: G D 3.2.7-1.fc16.x86_64 #1 [2299856.477656] Call Trace: [2299856.480293] [<ffffffff815d7510>] panic+0x91/0x1a7 [2299856.485265] [<ffffffff81072541>] do_exit+0x861/0x8a0 [2299856.490498] [<ffffffff810728d2>] do_group_exit+0x42/0xa0 [2299856.496081] [<ffffffff810826f6>] get_signal_to_deliver+0x206/0x5b0 [2299856.502526] [<ffffffff81014165>] do_signal+0x65/0x770 [2299856.507847] [<ffffffff815d7677>] ? printk+0x51/0x53 [2299856.512994] [<ffffffff81014918>] do_notify_resume+0x88/0xb0 [2299856.518834] [<ffffffff815e22fc>] retint_signal+0x48/0x8c ---- [88255.648967] BUG: unable to handle kernel NULL pointer dereference at 0000000000000038 [88255.649918] IP: [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88255.649918] PGD 40090c067 PUD 3c6225067 PMD 0 [88255.649918] Oops: 0000 [#1] SMP [88255.649918] CPU 1 [88255.649918] Modules linked in: ip6table_filter ip6_tables ebtable_nat ebtables be2iscsi iscsi_boot_sysfs bnx2i cnic uio cxgb4i cxgb4 cxgb3i libcxgbi cxgb3 mdio ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi 8021q garp serio_raw i2c_i801 microcode i5000_edac iTCO_wdt edac_core iTCO_vendor_support ioatdma i5k_amb dca shpchp vhost_net macvtap macvlan tun virtio_net kvm_intel kvm bridge stp llc bonding e1000e radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core dm_multipath nfs lockd fscache auth_rpcgss nfs_acl sunrpc [last unloaded: scsi_wait_scan] [88255.649918] [88255.649918] Pid: 790, comm: updatedb Not tainted 3.2.7-1.fc16.x86_64 #1 Rackable Systems Inc. S5000PSL/S5000PSL [88255.649918] RIP: 0010:[<ffffffffa008b191>] [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88255.649918] RSP: 0018:ffff8803c6117b48 EFLAGS: 00010286 [88255.649918] RAX: ffffffffa00c5600 RBX: ffff8803ed090e40 RCX: 000000000000001c [88255.649918] RDX: ffff8803c6117c20 RSI: 0000000000000000 RDI: ffff8803ed090e40 [88255.649918] RBP: ffff8803c6117b88 R08: 000000000000001c R09: ffff8803ed090e78 [88255.649918] R10: ffff8803ed090e40 R11: 0000000000000002 R12: ffff880402565300 [88255.649918] R13: ffff8803c6117be8 R14: 0000000000000000 R15: ffff8803ffe8b100 [88255.649918] FS: 00007f1f008a4700(0000) GS:ffff88041fc40000(0000) knlGS:0000000000000000 [88255.649918] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [88255.649918] CR2: 0000000000000038 CR3: 00000003c60b8000 CR4: 00000000000006e0 [88255.649918] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [88255.649918] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [88255.649918] Process updatedb (pid: 790, threadinfo ffff8803c6116000, task ffff8803c62bae40) [88255.649918] Stack: [88255.649918] 0000000000000000 ffff8803ffe8b100 ffff8803c6117b88 ffff8803ed090e40 [88255.649918] ffff880402565300 ffff8803c6117be8 0000000000000000 ffff8803ffe8b100 [88255.649918] ffff8803c6117bc8 ffffffff81184680 ffff8803c6117ba8 ffffffff8126369c [88255.649918] Call Trace: [88255.649918] [<ffffffff81184680>] __lookup_hash.part.8+0x90/0xe0 [88255.649918] [<ffffffff8126369c>] ? security_inode_permission+0x1c/0x30 [88255.649918] [<ffffffff81184b7e>] lookup_one_len+0xee/0x120 [88255.649918] [<ffffffff81184978>] ? inode_permission+0x48/0x100 [88255.649918] [<ffffffffa009bff3>] nfs_sillyrename+0x103/0x520 [nfs] [88255.649918] [<ffffffff81165bbd>] ? kmem_cache_alloc+0x11d/0x140 [88255.649918] [<ffffffff8118ff44>] ? __d_alloc+0x34/0x180 [88255.649918] [<ffffffff81190040>] ? __d_alloc+0x130/0x180 [88255.649918] [<ffffffffa008aecc>] nfs_rename+0x1cc/0x240 [nfs] [88255.649918] [<ffffffff81185d35>] vfs_rename+0x2b5/0x460 [88255.649918] [<ffffffff81184680>] ? __lookup_hash.part.8+0x90/0xe0 [88255.649918] [<ffffffff81189473>] sys_renameat+0x1f3/0x220 [88255.649918] [<ffffffff81193823>] ? notify_change+0x253/0x340 [88255.649918] [<ffffffff81196aa4>] ? mntput_no_expire+0x24/0x100 [88255.649918] [<ffffffff81196b9f>] ? mntput+0x1f/0x30 [88255.649918] [<ffffffff81183c72>] ? path_put+0x22/0x30 [88255.649918] [<ffffffff811894bb>] sys_rename+0x1b/0x20 [88255.649918] [<ffffffff815e9d82>] system_call_fastpath+0x16/0x1b [88255.649918] Code: 54 e1 e9 4c fe ff ff 0f 1f 00 55 48 89 e5 48 83 ec 40 48 89 5d d8 4c 89 65 e0 4c 89 6d e8 4c 89 75 f0 4c 89 7d f8 66 66 66 66 90 <f6> 46 38 40 b8 f6 ff ff ff 49 89 fd 49 89 f7 0f 85 d6 00 00 00 [88255.649918] RIP [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88255.649918] RSP <ffff8803c6117b48> [88255.649918] CR2: 0000000000000038 [88256.012265] ---[ end trace 4301002d5bb8ae27 ]--- ---- [2379039.703224] systemd[1]: segfault at 7fffa79e8f18 ip 00000000004661ee sp 00007fffa79e8f10 error 6 in systemd[400000+ce000] [2379039.781077] Kernel panic - not syncing: Attempted to kill init! [2379039.787304] Pid: 1, comm: systemd Tainted: G D 3.2.7-1.fc16.x86_64 #1 [2379039.794886] Call Trace: [2379039.797564] [<ffffffff815d7510>] panic+0x91/0x1a7 [2379039.802618] [<ffffffff81072541>] do_exit+0x861/0x8a0 [2379039.807937] [<ffffffff810728d2>] do_group_exit+0x42/0xa0 [2379039.813623] [<ffffffff810826f6>] get_signal_to_deliver+0x206/0x5b0 [2379039.820161] [<ffffffff81014165>] do_signal+0x65/0x770 [2379039.825563] [<ffffffff815d7677>] ? printk+0x51/0x53 [2379039.830787] [<ffffffff8106d043>] ? do_fork+0x173/0x330 [2379039.836279] [<ffffffff81014918>] do_notify_resume+0x88/0xb0 [2379039.842220] [<ffffffff815e22fc>] retint_signal+0x48/0x8c ---- [88539.434313] BUG: unable to handle kernel NULL pointer dereference at 0000000000000038 [88539.435257] IP: [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88539.435257] PGD 27dbd5067 PUD 2953af067 PMD 0 [88539.435257] Oops: 0000 [#1] SMP [88539.435257] CPU 1 [88539.435257] Modules linked in: ip6table_filter ip6_tables ebtable_nat ebtables be2iscsi iscsi_boot_sysfs bnx2i cnic uio cxgb4i cxgb4 cxgb3i libcxgbi cxgb3 mdio ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi 8021q garp iTCO_wdt iTCO_vendor_support i5000_edac edac_core i2c_i801 microcode serio_raw i5k_amb ioatdma dca shpchp vhost_net macvtap macvlan tun virtio_net kvm_intel kvm bridge stp llc bonding e1000e radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core dm_multipath nfs lockd fscache auth_rpcgss nfs_acl sunrpc [last unloaded: scsi_wait_scan] [88539.435257] [88539.435257] Pid: 5947, comm: updatedb Not tainted 3.2.7-1.fc16.x86_64 #1 Rackable Systems Inc. S5000PSL/S5000PSL [88539.435257] RIP: 0010:[<ffffffffa008b191>] [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88539.435257] RSP: 0018:ffff8802490a1b48 EFLAGS: 00010286 [88539.435257] RAX: ffffffffa00c5600 RBX: ffff8802f35f3e40 RCX: 000000000000001c [88539.435257] RDX: ffff8802490a1c20 RSI: 0000000000000000 RDI: ffff8802f35f3e40 [88539.435257] RBP: ffff8802490a1b88 R08: 000000000000001c R09: ffff8802f35f3e78 [88539.435257] R10: ffff8802f35f3e40 R11: 0000000000000001 R12: ffff8803066373c0 [88539.435257] R13: ffff8802490a1be8 R14: 0000000000000000 R15: ffff880307a9b8e0 [88539.435257] FS: 00007fdd293bc700(0000) GS:ffff88031fc40000(0000) knlGS:0000000000000000 [88539.435257] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [88539.435257] CR2: 0000000000000038 CR3: 00000002a5e46000 CR4: 00000000000026e0 [88539.435257] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [88539.435257] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [88539.435257] Process updatedb (pid: 5947, threadinfo ffff8802490a0000, task ffff880300314560) [88539.435257] Stack: [88539.435257] 0000000000000000 ffff880307a9b8e0 ffff8802490a1b88 ffff8802f35f3e40 [88539.435257] ffff8803066373c0 ffff8802490a1be8 0000000000000000 ffff880307a9b8e0 [88539.435257] ffff8802490a1bc8 ffffffff81184680 ffff8802490a1ba8 ffffffff8126369c [88539.435257] Call Trace: [88539.435257] [<ffffffff81184680>] __lookup_hash.part.8+0x90/0xe0 [88539.435257] [<ffffffff8126369c>] ? security_inode_permission+0x1c/0x30 [88539.435257] [<ffffffff81184b7e>] lookup_one_len+0xee/0x120 [88539.435257] [<ffffffff81184978>] ? inode_permission+0x48/0x100 [88539.435257] [<ffffffffa009bff3>] nfs_sillyrename+0x103/0x520 [nfs] [88539.435257] [<ffffffff81165bbd>] ? kmem_cache_alloc+0x11d/0x140 [88539.435257] [<ffffffff8118ff44>] ? __d_alloc+0x34/0x180 [88539.435257] [<ffffffff81190040>] ? __d_alloc+0x130/0x180 [88539.435257] [<ffffffffa008aecc>] nfs_rename+0x1cc/0x240 [nfs] [88539.435257] [<ffffffff81185d35>] vfs_rename+0x2b5/0x460 [88539.435257] [<ffffffff81184680>] ? __lookup_hash.part.8+0x90/0xe0 [88539.435257] [<ffffffff81189473>] sys_renameat+0x1f3/0x220 [88539.435257] [<ffffffff81193823>] ? notify_change+0x253/0x340 [88539.435257] [<ffffffff81196aa4>] ? mntput_no_expire+0x24/0x100 [88539.435257] [<ffffffff81196b9f>] ? mntput+0x1f/0x30 [88539.435257] [<ffffffff81183c72>] ? path_put+0x22/0x30 [88539.435257] [<ffffffff811894bb>] sys_rename+0x1b/0x20 [88539.435257] [<ffffffff815e9d82>] system_call_fastpath+0x16/0x1b [88539.435257] Code: 54 e1 e9 4c fe ff ff 0f 1f 00 55 48 89 e5 48 83 ec 40 48 89 5d d8 4c 89 65 e0 4c 89 6d e8 4c 89 75 f0 4c 89 7d f8 66 66 66 66 90 <f6> 46 38 40 b8 f6 ff ff ff 49 89 fd 49 89 f7 0f 85 d6 00 00 00 [88539.435257] RIP [<ffffffffa008b191>] nfs_lookup_revalidate+0x21/0x490 [nfs] [88539.435257] RSP <ffff8802490a1b48> [88539.435257] CR2: 0000000000000038 [88539.797861] ---[ end trace a1ed36439ad7c011 ]---
<> Nathan Stratton nathan at robotics.net http://www.robotics.net
participants (1)
-
Nathan Stratton