Bug#742722: Kernel Oops: BUG: unable to handle kernel paging request at ffff88025c9c6a08
Package: linux-image-3.2.0-4-amd64
Version: 3.2.54-2
We got from time to time the following kernel Oops.
The Oops occurred during high load caused by an oracle database export to a
mounted CIFS share. The machine runs on a ESX host along with a lot other
machines that run happily during the same time, so I do not expect some kind of
memory corruption. How we could proceed? We have captured some (three) kernel
dumps if that will help.
[950029.451929] BUG: unable to handle kernel paging request at ffff88025c9c6a08
[950029.451993] IP: [<ffffffff81043eb4>] sched_destroy_group+0x34/0x11c
[950029.452043] PGD 1606063 PUD 0
[950029.452068] Oops: 0000 [#1] SMP
[950029.452095] CPU 0
[950029.452110] Modules linked in: des_generic ecb md4 hmac nls_utf8 cifs
autofs4 vsock(O) nfsd nfs nfs_acl auth_rpcgss fscache lockd sunrpc ext2 mbcache
loop coretemp snd_pcm snd_page_alloc snd_timer snd soundcore vmw_balloon
crc32c_intel pcspkr psmouse evdev serio_raw parport_pc parport ac vmwgfx
i2c_piix4 processor power_supply thermal_sys container vmci(O) ttm drm i2c_core
shpchp button xfs dm_mod vmxnet(O) vmw_pvscsi vmxnet3 sr_mod cdrom ata_generic
sg sd_mod crc_t10dif floppy e1000 ata_piix mptspi scsi_transport_spi mptscsih
mptbase libata scsi_mod [last unloaded: scsi_wait_scan]
[950029.452546]
[950029.452559] Pid: 3, comm: ksoftirqd/0 Tainted: G O 3.2.0-4-amd64
#1 Debian 3.2.54-2 VMware, Inc. VMware Virtual Platform/440BX Desktop Reference
Platform
[950029.452663] RIP: 0010:[<ffffffff81043eb4>] [<ffffffff81043eb4>]
sched_destroy_group+0x34/0x11c
[950029.452740] RSP: 0018:ffff880236cb5d80 EFLAGS: 00010297
[950029.452790] RAX: ffff88025c9c6980 RBX: ffff880234a47740 RCX:
0000000000000000
[950029.452863] RDX: 0000000000000000 RSI: 00000000c094c093 RDI:
0000000000000200
[950029.452936] RBP: 0000000000000000 R08: 0000000000000000 R09:
ffffffff8168f0a0
[950029.453009] R10: ffff88012ec1d3b0 R11: ffff8800030d6600 R12:
0000000000013780
[950029.453082] R13: ffff88023fc00000 R14: ffffffff8168f0a0 R15:
0000000000000000
[950029.453156] FS: 0000000000000000(0000) GS:ffff88023fc00000(0000)
knlGS:0000000000000000
[950029.453235] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[950029.453287] CR2: ffff88025c9c6a08 CR3: 0000000001605000 CR4:
00000000000006f0
[950029.453393] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[950029.453479] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[950029.453553] Process ksoftirqd/0 (pid: 3, threadinfo ffff880236cb4000, task
ffff880236ca8730)
[950029.453636] Stack:
[950029.453663] 0000000000000000 ffff880203b7acc0 ffffffff81043fa5
ffff880233e80ea8
[950029.453743] 0000000000000006 ffff88023fc0edf0 ffff880046b38e88
ffffffff811ad1c2
[950029.453823] 00000000c094c093 ffff8800aaf71a40 ffff8800aaf71a40
ffffffff81044753
[950029.453902] Call Trace:
[950029.453934] [<ffffffff81043fa5>] ? cpu_cgroup_destroy+0x9/0x9
[950029.453989] [<ffffffff811ad1c2>] ? kref_put+0x3e/0x47
[950029.454038] [<ffffffff81044753>] ? free_signal_struct+0x24/0x34
[950029.454092] [<ffffffff81044959>] ? __put_task_struct+0x9d/0xb9
[950029.454147] [<ffffffff81095d52>] ? __rcu_process_callbacks+0x1c8/0x2ce
[950029.454205] [<ffffffff81095e82>] ? rcu_process_callbacks+0x2a/0x54
[950029.454262] [<ffffffff8104c346>] ? __do_softirq+0xb9/0x177
[950029.454313] [<ffffffff8104c47e>] ? run_ksoftirqd+0x7a/0x118
[950029.454365] [<ffffffff8104c404>] ? __do_softirq+0x177/0x177
[950029.454418] [<ffffffff8105f681>] ? kthread+0x76/0x7e
[950029.454467] [<ffffffff81356ef4>] ? kernel_thread_helper+0x4/0x10
[950029.454522] [<ffffffff8105f60b>] ? kthread_worker_fn+0x139/0x139
[950029.454577] [<ffffffff81356ef0>] ? gs_change+0x13/0x13
[950029.454625] Code: c7 c4 80 37 01 00 55 83 cd ff 53 48 89 fb 50 4c 8b 35 b9
1e 3c 00 eb 73 48 8b 43 28 4c 63 fd 4e 8b 2c fd c0 da 68 81 4a 8b 04 f8 <83> b8
88 00 00 00 00 74 57 4d 01 e5 4c 89 ef e8 b8 be 30 00 48
[950029.454868] RIP [<ffffffff81043eb4>] sched_destroy_group+0x34/0x11c
[950029.454926] RSP <ffff880236cb5d80>
[950029.454963] CR2: ffff88025c9c6a08
debian_version: 7.4
uname -a: Linux wiora03 3.2.0-4-amd64 #1 SMP Debian 3.2.54-2 x86_64 GNU/Linux
libc6: Version: 2.13-38+deb7u1
/proc/cpuinfo:
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 26
model name : Intel(R) Xeon(R) CPU E5-2690 0 @ 2.90GHz
stepping : 4
microcode : 0x710
cpu MHz : 2665.909
cache size : 20480 KB
fpu : yes
fpu_exception : yes
cpuid level : 11
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 clflush dts mmx fxsr sse sse2 ss syscall nx rdtscp lm constant_tsc up
arch_perfmon pebs bts nopl xtopology tsc_reliable nonstop_tsc aperfmperf pni
ssse3 cx16 sse4_1 sse4_2 x2apic popcnt hypervisor lahf_lm arat epb pln pts
dtherm
bogomips : 5331.81
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management:
/proc/meminfo
MemTotal: 7938476 kB
MemFree: 135292 kB
Buffers: 36 kB
Cached: 6357784 kB
SwapCached: 484 kB
Active: 4516172 kB
Inactive: 2819844 kB
Active(anon): 2080360 kB
Inactive(anon): 534656 kB
Active(file): 2435812 kB
Inactive(file): 2285188 kB
Unevictable: 0 kB
Mlocked: 0 kB
SwapTotal: 3903484 kB
SwapFree: 3757092 kB
Dirty: 28 kB
Writeback: 0 kB
AnonPages: 977720 kB
Mapped: 909924 kB
Shmem: 1636820 kB
Slab: 270148 kB
SReclaimable: 221772 kB
SUnreclaim: 48376 kB
KernelStack: 3440 kB
PageTables: 98552 kB
NFS_Unstable: 0 kB
Bounce: 0 kB
WritebackTmp: 0 kB
CommitLimit: 7872720 kB
Committed_AS: 3911400 kB
VmallocTotal: 34359738367 kB
VmallocUsed: 290012 kB
VmallocChunk: 34359434084 kB
HardwareCorrupted: 0 kB
AnonHugePages: 0 kB
HugePages_Total: 0
HugePages_Free: 0
HugePages_Rsvd: 0
HugePages_Surp: 0
Hugepagesize: 2048 kB
DirectMap4k: 63488 kB
DirectMap2M: 8325120 kB
Reply to: