在当前状态下,我有一个ZFS池:
[root@SERVER-abc ~]# zpool status -v DATAPOOL
pool: DATAPOOL
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-8A
scan: resilvered 18.5M in 00:00:01 with 0 errors on Wed Jan 5 19:10:50 2022
config:`
NAME STATE READ WRITE CKSUM
DATAPOOL DEGRADED 0 0 0
raidz2-0 DEGRADED 0 0 0
gptid/14c707c6-f16c-11e8-b117-0cc47a2ba44e DEGRADED 0 0 17 too many errors
spare-1 ONLINE 0 0 17
gptid/168342c5-f16c-11e8-b117-0cc47a2ba44e ONLINE 0 0 0
gptid/1bfaa607-f16c-11e8-b117-0cc47a2ba44e ONLINE 0 0 0
gptid/1875501a-f16c-11e8-b117-0cc47a2ba44e ONLINE 0 0 30
gptid/1a16d37c-f16c-11e8-b117-0cc47a2ba44e ONLINE 0 0 29
spares
gptid/1bfaa607-f16c-11e8-b117-0cc47a2ba44e INUSE currently in use
errors: Permanent errors have been detected in the following files:
DATAPOOL/VMS/ubuntu_1804_LTS_ustrich-m6i87@auto-2022-01-04_11-41:<0x1>
<0x1080a>:<0x1>
<0x182a>:<0x1>
DATAPOOL/VMS/ubuntu_1804_LTS_ustrich-m6i87:<0x1>
<0x16fa>:<0x1>这是一个带有4+1备用驱动器的zpool。发生了一些事情,突然备用机与另一个驱动器自动配对为备用-1。
这对我来说是意外的,因为:
目标是拯救池,而不必从备份中获取大量数据,但在核心方面,我想了解发生了什么以及原因。以及如何像“最佳实践”那样处理这些情况。
坦克一堆!)
系统为: SuperMicro,TrueNAS-12.0-U4.1,zfs-2.0.4-3
编辑:将输出从zpool状态-x更改为zpool状态-v DATAPOOL
Edit2:到目前为止,我认为第一个168342c5似乎有一个错误,备用(1bfaa607)跳入其中。之后,14c707c6也降解了。
Edit3,附加问题:由于所有驱动器(备用1中的驱动器除外)似乎都有CKSUM错误--这说明了什么?电缆?HBA?所有的驱动器都同时失效了?
最新更新,在zpool clear和zpool scrub DATAPOOL之后,似乎很明显,已经发生了很多事情,没有办法拯救池:
pool: DATAPOOL
state: DEGRADED
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Thu Jan 6 16:18:05 2022
1.82T scanned at 1.55G/s, 204G issued at 174M/s, 7.82T total
40.8G resilvered, 2.55% done, 12:44:33 to go
config:
NAME STATE READ WRITE CKSUM
DATAPOOL DEGRADED 0 0 0
raidz2-0 DEGRADED 0 0 0
gptid/14c707c6-f16c-11e8-b117-0cc47a2ba44e DEGRADED 0 0 156 too many errors
spare-1 DEGRADED 0 0 0
gptid/168342c5-f16c-11e8-b117-0cc47a2ba44e DEGRADED 0 0 236 too many errors
gptid/1bfaa607-f16c-11e8-b117-0cc47a2ba44e ONLINE 0 0 0 (resilvering)
gptid/1875501a-f16c-11e8-b117-0cc47a2ba44e DEGRADED 0 0 182 too many errors
gptid/1a16d37c-f16c-11e8-b117-0cc47a2ba44e DEGRADED 0 0 179 too many errors
spares
gptid/1bfaa607-f16c-11e8-b117-0cc47a2ba44e INUSE currently in use我现在要检查所有的智能数据。
发布于 2022-01-06 10:40:56
这是一个四盘RAIDZ2吗?
您选择了这个布局而不是ZFS镜像吗?
你能显示zpool status -v的输出吗?
也请运行一个zpool clear并跟踪结果/进度。
https://serverfault.com/questions/1089146
复制相似问题