我有一个Ubuntu14.04服务器,它偶尔会向dmesg发出"NOHZ: local_softirq_pending 08“错误。这是在升级到内核4.4之后开始的;以前,它在3.16内核上运行时没有问题。下面是日志末尾的摘录:
[ 7.805258] audit: type=1400 audit(1484883362.092:11): apparmor="STATUS" operation="profile_replace" profile="unconfined" name="/sbin/dhclient" pid=1636 comm="apparmor_parser"
[ 10.605443] igb 0000:c1:00.0 eth0: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
[ 10.605545] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
[ 19.219187] ixgbe 0000:02:00.1 p4p2: NIC Link is Up 10 Gbps, Flow Control: None
[ 19.219368] IPv6: ADDRCONF(NETDEV_CHANGE): p4p2: link becomes ready
[ 52.010390] ip_tables: (C) 2000-2006 Netfilter Core Team
[ 52.089283] init: plymouth-upstart-bridge main process ended, respawning
[ 2857.027773] perf interrupt took too long (2542 > 2500), lowering kernel.perf_event_max_sample_rate to 50000
[ 7195.391731] perf interrupt took too long (5012 > 5000), lowering kernel.perf_event_max_sample_rate to 25000
[37277.461862] perf interrupt took too long (10050 > 10000), lowering kernel.perf_event_max_sample_rate to 12500
[239795.500056] NOHZ: local_softirq_pending 08
[579047.644110] NOHZ: local_softirq_pending 08
[837865.916051] NOHZ: local_softirq_pending 08它是一个生产数据库主机,在相当大的负载下有32个核心。
我想知道我是否应该关心这些信息,如果是的话,我将如何着手解决这个问题。
内核的详细信息如下:
[ 0.000000] Linux version 4.4.0-59-generic (buildd@lcy01-32) (gcc version 4.8.4 (Ubuntu 4.8.4-2ubuntu1~14.04.3) ) #80~14.04.1-Ubuntu SMP Fri Jan 6 18:02:02 UTC 2017 (Ubuntu 4.4.0-59.80~14.04.1-generic 4.4.35)
[ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.4.0-59-generic root=UUID=5db4a2c8-24f4-409b-b437-6120682cc518 ro noautogroup transparent_hugepage=never nomdmonddf nomdmonisw发布于 2017-01-30 19:52:28
在引导期间将nohz=off添加到内核参数以禁用它。
此选项导致RCU试图加快宽限期,以便CPU更快地进入dynticks-空闲状态。另一方面,这个选项增加了动态空闲检查的开销,特别是在拥有大量CPU的系统上。
你似乎被大胆的部分影响了。
更多的阅读..。
https://askubuntu.com/questions/878030
复制相似问题