内核崩溃obj_cgroup_uncharge

Kernel panic obj_cgroup_uncharge

提问人:polymetr 提问时间:7/21/2023 更新时间:7/21/2023 访问量:38

问:

有人可以帮我去切除这个内核恐慌回溯吗?

uname -a
Linux ubuntu-PC 5.19.0-46-generic #47~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Wed Jun 21 
15:35:31 UTC 2 x86_64 x86_64 x86_64 GNU/Linux

来自故障转储的信息

KERNEL: /usr/lib/debug/boot/vmlinux-5.19.0-46-generic
DUMPFILE: /var/crash/202307191343/dump.202307191343  [PARTIAL DUMP]
CPUS: 20
DATE: Wed Jul 19 13:42:41 EET 2023
UPTIME: 5 days, 21:13:29
LOAD AVERAGE: 1.14, 1.64, 1.66
TASKS: 1910
NODENAME: ubuntu-PC
RELEASE: 5.19.0-46-generic
VERSION: #47~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Wed Jun 21 15:35:31 UTC 2
MACHINE: x86_64  (3600 Mhz)
MEMORY: 31.3 GB
PANIC: ""
PID: 65
COMMAND: "ksoftirqd/8"
TASK: ffff98814133b380  [THREAD_INFO: ffff98814133b380]
CPU: 8
STATE: TASK_RUNNING (PANIC)

和 bt

crash> bt
PID: 65       TASK: ffff98814133b380  CPU: 8    COMMAND: "ksoftirqd/8"
#0 [ffffa50ac038b9b0] machine_kexec at ffffffff81495c2b
#1 [ffffa50ac038ba10] __crash_kexec at ffffffff815bb822
#2 [ffffa50ac038bae0] crash_kexec at ffffffff815bd102
#3 [ffffa50ac038baf0] oops_end at ffffffff81446830
#4 [ffffa50ac038bb18] die_addr at ffffffff81446b41
#5 [ffffa50ac038bb48] exc_general_protection at ffffffff8232217a
#6 [ffffa50ac038bbf0] asm_exc_general_protection at ffffffff82400ab7
[exception RIP: refill_obj_stock+83]
RIP: ffffffff817d1b23  RSP: ffffa50ac038bca8  RFLAGS: 00010046
RAX: 0000000000000000  RBX: 00000000000000c8  RCX: 0000000000000000
RDX: 0000000000000000  RSI: 0000000000000000  RDI: 0000000000000000
RBP: ffffa50ac038bcd0   R8: 0000000000000000   R9: 0000000000000000
R10: 0000000000000000  R11: 0000000000000007  R12: ffff98889fa2dd40
R13: 0000000000000202  R14: ffff9881d422c380  R15: ffdf9881d422c380
ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
#7 [ffffa50ac038bcd8] obj_cgroup_uncharge at ffffffff817d5a13
#8 [ffffa50ac038bce8] memcg_slab_free_hook at ffffffff817a6c2c
#9 [ffffa50ac038bd48] kmem_cache_free at ffffffff817ae796
#10 [ffffa50ac038bd90] __d_free_external at ffffffff81812f50
#11 [ffffa50ac038bda8] rcu_do_batch at ffffffff8157c308
#12 [ffffa50ac038be20] rcu_core at ffffffff8157e4ca
#13 [ffffa50ac038be68] rcu_core_si at ffffffff8157e94e
#14 [ffffa50ac038be78] __softirqentry_text_start at ffffffff826000d5
#15 [ffffa50ac038bed8] run_ksoftirqd at ffffffff814d43a7
#16 [ffffa50ac038bee8] smpboot_thread_fn at ffffffff81502d60
#17 [ffffa50ac038bf10] kthread at ffffffff814fa08b
#18 [ffffa50ac038bf50] ret_from_fork at ffffffff814038cf

似乎它发生在 python 进程的 OOM 杀手之后。

我想这是内存问题,因为Postgres数据库有时会出现问题,例如

关系块 4557 中的无效页面

Linux Ubuntu 内核 恐慌

评论

0赞 Tsyvarev 7/21/2023
“有人可以帮我去削这个内核恐慌回溯吗” - 回溯中的哪一行你不能“去削”?如果您希望有人修复 Linux 内核中的该错误,请将其报告给相应的错误跟踪器。Stack Overflow 不是错误跟踪器。
0赞 polymetr 7/21/2023
@Tsyvarev。我的意思是是否有可能区分它是内核错误还是与硬件相关的概率问题。

答: 暂无答案