我在dmesg中包含了以下垃圾邮件:
kernel:EDAC MC0: UE page 0x0, offset 0x0, grain 1073741824, row 3, labels ":": i3200 UE你知道什么是错的吗?
以下是加载的模块:
# lsmod | grep edac
i3200_edac 3330 0
edac_core 46581 2 i3200_edacedac-util不显示任何错误。
# edac-util -v
mc0: 0 Uncorrected Errors with no DIMM info
mc0: 0 Corrected Errors with no DIMM info
mc0: csrow0: 0 Uncorrected Errors
mc0: csrow0: ch0: 0 Corrected Errors
mc0: csrow0: ch1: 0 Corrected Errors
mc0: csrow1: 0 Uncorrected Errors
mc0: csrow1: ch0: 0 Corrected Errors
mc0: csrow1: ch1: 0 Corrected Errors
mc0: csrow2: 0 Uncorrected Errors
mc0: csrow2: ch0: 0 Corrected Errors
mc0: csrow2: ch1: 0 Corrected Errors
mc0: csrow3: 0 Uncorrected Errors
mc0: csrow3: ch0: 0 Corrected Errors
mc0: csrow3: ch1: 0 Corrected Errors
mc0: csrow4: 0 Uncorrected Errors
mc0: csrow4: ch0: 0 Corrected Errors
mc0: csrow4: ch1: 0 Corrected Errors
mc0: csrow5: 0 Uncorrected Errors
mc0: csrow5: ch0: 0 Corrected Errors
mc0: csrow5: ch1: 0 Corrected Errors
mc0: csrow6: 0 Uncorrected Errors
mc0: csrow6: ch0: 0 Corrected Errors
mc0: csrow6: ch1: 0 Corrected Errors
mc0: csrow7: 0 Uncorrected Errors
mc0: csrow7: ch0: 0 Corrected Errors
mc0: csrow7: ch1: 0 Corrected Errors发布于 2014-07-30 12:39:17
这似乎是一个内存错误,但不是致命的错误。
echo 0 > /sys/module/edac_core/parameters/edac_mc_log_ce 将防止控制台上的垃圾邮件,直到下次重新启动为止。
基本上,ce_errors是可纠正错误的缩写(也就是在ram之外没有“缺陷”)。
有关更多细节,请参见关于edac的内核文档和edac维基。
虽然我可能完全错了,但我们有一个服务器(ECC RAM),并且由于没有不可纠正的错误,而且memdisk没有显示任何问题--我让它在同一个内存中运行,更改输出,开始监视不可更正的错误,并且我们没有进一步的问题。
https://serverfault.com/questions/616599
复制相似问题