测试积点老人 发表于 2022-10-28 11:17:06

8805RAID 卡 RAID5 做 xfs 本地文件系统,业务压力稍大时会偶现 IO 归零,有时还...

现象:
业务过程中,RAID 的 IO 会偶现归零的情况
报错打印如下:
2022-10-26T13:48:02.114399+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,0,0)
2022-10-26T13:48:02.114510+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 07 71 a8 00 00 00 02 00 00 00
2022-10-26T13:48:02.115882+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(17) = ffff9600a1d40cc0 : 35808a40 Command = 503 XferState = 830ad Wait Time = 63 Sec
2022-10-26T13:48:02.118834+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,1,0)
2022-10-26T13:48:02.118946+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 04 5e 9a 00 00 00 02 00 00 00
2022-10-26T13:48:02.121773+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(260) = ffff9600a1d4c300 : 358840a0 Command = 503 XferState = 830ad Wait Time = 62 Sec
2022-10-26T13:48:02.121884+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,1,0)
2022-10-26T13:48:02.123269+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 04 5e 98 00 00 00 02 00 00 00
2022-10-26T13:48:02.126204+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(259) = ffff9600a1d4c240 : 35883880 Command = 503 XferState = 830ad Wait Time = 62 Sec
2022-10-26T13:48:02.126331+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,1,0)
2022-10-26T13:48:02.127660+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 04 5e 96 00 00 00 02 00 00 00
2022-10-26T13:48:02.130572+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(144) = ffff9600a1d46c00 : 35849220 Command = 503 XferState = 830ad Wait Time = 62 Sec
2022-10-26T13:48:02.130710+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,1,0)
2022-10-26T13:48:02.133508+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 06 6f d8 00 00 00 02 00 00 00
2022-10-26T13:48:02.133642+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(4) = ffff9600a1d40300 : 358020a0 Command = 503 XferState = 830ad Wait Time = 62 Sec
2022-10-26T13:48:02.136450+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Host adapter abort request (11,0,1,0)
2022-10-26T13:48:02.136592+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_eh_abort:Timed out Command: 8a 00 00 00 00 00 1d f3 02 00 00 00 02 00 00 00
2022-10-26T13:48:02.139355+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_fib_debug_print:FIB(139) = ffff9600a1d46840 : 35846980 Command = 503 XferState = 830ad Wait Time = 63 Sec
2022-10-26T13:48:02.149197+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_dev_reset:Host device reset request (11,0,0,0)
2022-10-26T13:48:02.149299+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_print_command_queue_states:outstanding cmnd: midlevel 0, lowlevel 0, error handler 0, firmware 0, kernel 0
2022-10-26T13:48:02.149385+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_dev_reset:Host device reset request (11,0,1,0)
2022-10-26T13:48:02.152169+08:00 node01 kernel: aacraid 0000:82:00.0: AAC1:aac_print_command_queue_states:outstanding cmnd: midlevel 0, lowlevel 0, error handler 0, firmware 0, kernel 0有没有见过的大佬,这是 RAID 卡坏了吗

bellas 发表于 2022-10-31 10:01:21

等大神

jingzizx 发表于 2022-10-31 17:57:21

测试一下读写
页: [1]
查看完整版本: 8805RAID 卡 RAID5 做 xfs 本地文件系统,业务压力稍大时会偶现 IO 归零,有时还...