[PVE-User] Kernel oops

Gerald Brandt gbr at majentis.com
Sun Nov 13 15:15:36 CET 2016


Hi,

I'm getting a lot of crashes on my Proxmox box. I am runing Proxmox on a 
Debian base install, but I have anther boxes that does the same, and it 
is fine.


Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442402] ------------[ cut 
here ]------------
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442408] WARNING: CPU: 2 
PID: 0 at kernel/rcu/tree.c:2733 rcu_process_callbacks+0x5bb/0x5e0()
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442409] Modules linked in: 
nfsv3 rpcsec_gss_krb5 nfsv4 ip_set ip6table_filter ip6_tables 
iptable_filter ip_tables softdog x_tables nfsd auth_rpcgss nfs_acl nfs 
lockd grace fscache sunrpc ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad 
ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi 
nfnetlink_log nfnetlink xfs snd_hda_codec_hdmi nouveau eeepc_wmi 
asus_wmi kvm_amd kvm sparse_keymap irqbypass mxm_wmi crct10dif_pclmul 
snd_hda_codec_realtek crc32_pclmul video snd_hda_codec_generic ttm 
snd_hda_intel drm_kms_helper drm snd_hda_codec aesni_intel aes_x86_64 
lrw gf128mul glue_helper snd_hda_core ablk_helper cryptd snd_hwdep 
i2c_algo_bit snd_pcm fb_sys_fops syscopyarea snd_timer sysfillrect snd 
sysimgblt input_leds pcspkr serio_raw soundcore edac_mce_amd k10temp 
fam15h_power edac_core shpchp i2c_piix4 8250_fintek mac_hid wmi 
vhost_net vhost macvtap macvlan it87 hwmon_vid autofs4 btrfs raid456 
async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq 
libcrc32c raid1 ses enclosure uas usb_storage firewire_ohci r8169 mii 
firewire_core crc_itu_t sata_sil24 ahci libahci fjes
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442454] CPU: 2 PID: 0 Comm: 
swapper/2 Not tainted 4.4.21-1-pve #1
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442455] Hardware name: To 
be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX, BIOS 0901 
11/24/2011
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442457] 0000000000000086 
63ad933f85fa0f2b ffff88083fc83e70 ffffffff813f3f83
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442459] 0000000000000000 
ffffffff81ccfadb ffff88083fc83ea8 ffffffff81081806
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442460] ffffffff81e576c0 
ffff88083fc97f38 0000000000000246 0000000000000000
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442462] Call Trace:
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442463] <IRQ>  
[<ffffffff813f3f83>] dump_stack+0x63/0x90
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442469] 
[<ffffffff81081806>] warn_slowpath_common+0x86/0xc0
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442471] 
[<ffffffff8108194a>] warn_slowpath_null+0x1a/0x20
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442473] 
[<ffffffff810e792b>] rcu_process_callbacks+0x5bb/0x5e0
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442475] 
[<ffffffff8108630e>] __do_softirq+0x10e/0x2a0
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442476] 
[<ffffffff810865fe>] irq_exit+0x8e/0x90
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442480] 
[<ffffffff81857122>] smp_apic_timer_interrupt+0x42/0x50
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442481] 
[<ffffffff818553e2>] apic_timer_interrupt+0x82/0x90
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442482] <EOI>  
[<ffffffff816d23ea>] ? cpuidle_enter_state+0x10a/0x260
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442487] 
[<ffffffff816d23c6>] ? cpuidle_enter_state+0xe6/0x260
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442488] 
[<ffffffff816d2577>] cpuidle_enter+0x17/0x20
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442491] 
[<ffffffff810c453b>] call_cpuidle+0x3b/0x70
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442492] 
[<ffffffff816d2553>] ? cpuidle_select+0x13/0x20
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442494] 
[<ffffffff810c482f>] cpu_startup_entry+0x2bf/0x380
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442496] 
[<ffffffff81051a34>] start_secondary+0x154/0x190
Nov 13 06:15:54 gbr-proxmox-1 kernel: [61228.442497] ---[ end trace 
8a742910926b0ed4 ]---
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.617812] BUG: unable to 
handle kernel paging request at 000000000000bb00
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.618057] IP: 
[<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.618662] PGD 5cb1c5067 PUD 
5cb0f2067 PMD 0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.619431] Oops: 0000 [#1] SMP
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.620253] Modules linked in: 
nfsv3 rpcsec_gss_krb5 nfsv4 ip_set ip6table_filter ip6_tables 
iptable_filter ip_tables softdog x_tables nfsd auth_rpcgss nfs_acl nfs 
lockd grace fscache sunrpc ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad 
ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi 
nfnetlink_log nfnetlink xfs snd_hda_codec_hdmi nouveau eeepc_wmi 
asus_wmi kvm_amd kvm sparse_keymap irqbypass mxm_wmi crct10dif_pclmul 
snd_hda_codec_realtek crc32_pclmul video snd_hda_codec_generic ttm 
snd_hda_intel drm_kms_helper drm snd_hda_codec aesni_intel aes_x86_64 
lrw gf128mul glue_helper snd_hda_core ablk_helper cryptd snd_hwdep 
i2c_algo_bit snd_pcm fb_sys_fops syscopyarea snd_timer sysfillrect snd 
sysimgblt input_leds pcspkr serio_raw soundcore edac_mce_amd k10temp 
fam15h_power edac_core shpchp i2c_piix4 8250_fintek mac_hid wmi 
vhost_net vhost macvtap macvlan it87 hwmon_vid autofs4 btrfs raid456 
async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq 
libcrc32c raid1 ses enclosure uas usb_storage firewire_ohci r8169 mii 
firewire_core crc_itu_t sata_sil24 ahci libahci fjes
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.624994] CPU: 5 PID: 23044 
Comm: ps Tainted: G        W       4.4.21-1-pve #1
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.626005] Hardware name: To 
be filled by O.E.M. To be filled by O.E.M./SABERTOOTH 990FX, BIOS 0901 
11/24/2011
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.627039] task: 
ffff880818ed3700 ti: ffff8805cb27c000 task.ti: ffff8805cb27c000
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.628071] RIP: 
0010:[<ffffffff811ebe57>]  [<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.629113] RSP: 
0018:ffff8805cb27fc98  EFLAGS: 00010282
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.630145] RAX: 
0000000000000000 RBX: 00000000024080c0 RCX: 00000000000c428b
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.631198] RDX: 
00000000000c428a RSI: 00000000024080c0 RDI: ffff88081f003700
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.632239] RBP: 
ffff8805cb27fcc8 R08: 000000000001a480 R09: 000000000000bb00
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.633275] R10: 
0000000000000006 R11: 0000000000000000 R12: 00000000024080c0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.634310] R13: 
ffffffff8120f26c R14: ffff88081f003700 R15: ffff88081f003700
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.635346] FS: 
00007f54269ce700(0000) GS:ffff88083fd40000(0000) knlGS:0000000000000000
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.636350] CS: 0010 DS: 0000 
ES: 0000 CR0: 0000000080050033
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.637388] CR2: 
000000000000bb00 CR3: 000000052f4f5000 CR4: 00000000000406e0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.638425] Stack:
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.639455] ffff8805cb27fcd0 
0000000000000000 ffff880819ad3cc0 ffff8805cb27fef4
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.640500] 0000000000000000 
ffff8805cb27fdd0 ffff8805cb27fcf0 ffffffff8120f26c
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.641545] ffffffff81217f1d 
0000000000008000 ffff8805cb27fef4 ffff8805cb27fdc0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.642587] Call Trace:
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.643623] 
[<ffffffff8120f26c>] get_empty_filp+0x5c/0x1c0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.644660] 
[<ffffffff81217f1d>] ? terminate_walk+0xbd/0xd0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.645699] 
[<ffffffff8121bee3>] path_openat+0x43/0x1530
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.646731] 
[<ffffffff8121d544>] ? putname+0x54/0x60
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.647758] 
[<ffffffff8121d9e5>] ? filename_lookup+0xf5/0x180
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.648781] 
[<ffffffff8121e5d1>] do_filp_open+0x91/0x100
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.649802] 
[<ffffffff8138eaba>] ? common_perm_cond+0x3a/0x50
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.650814] 
[<ffffffff8111e472>] ? from_kgid_munged+0x12/0x20
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.651825] 
[<ffffffff81212b27>] ? cp_new_stat+0x157/0x190
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.652786] 
[<ffffffff8122bf86>] ? __alloc_fd+0x46/0x180
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.653804] 
[<ffffffff8120c8a9>] do_sys_open+0x139/0x2a0
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.654795] 
[<ffffffff8120ca2e>] SyS_open+0x1e/0x20
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.655780] 
[<ffffffff81854676>] entry_SYSCALL_64_fastpath+0x16/0x75
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.656766] Code: 08 65 4c 03 
05 53 e3 e1 7e 4d 8b 08 4d 85 c9 0f 84 42 01 00 00 49 83 78 10 00 0f 84 
37 01 00 00 49 63 47 20 48 8d 4a 01 4d 8b 07 <49> 8b 1c 01 4c 89 c8 65 
49 0f c7 08 0f 94 c0 84 c0 74 bb 49 63
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.657834] RIP 
[<ffffffff811ebe57>] kmem_cache_alloc+0x77/0x200
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.658878]  RSP <ffff8805cb27fc98>
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.659907] CR2: 000000000000bb00
Nov 13 06:17:06 gbr-proxmox-1 kernel: [61300.667666] ---[ end trace 
8a742910926b0ed5 ]---

I am non-subscriptions, and I just did an update yesterday to see if it 
would fix the error. I'll be running a memtest today to see if I can 
find anything.

I hadn't done an update in awhile before that, so I'm leaning towards a 
hardware issue. What do you think?

Gerald




More information about the pve-user mailing list