硬盘配置为12LFF背板(0302A6VS),P460卡接两块F0/F1两块SATA SSD,F8-F11为4块NVMe。
客户批量报修10台左右的R5300 G6服务器SEL中有硬盘线缆告警:Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board,实际使用没有异常。
1、客户提供其中两台日志,发现在11月15日 9:40左右升级HDM版本后出现的Incorrect SATA cable connection告警。怀疑为软件误报。
告警信息:
1821 Info NA NA NA From BMC 2024-11-15 09:41:33 UTC+08:00 Reboot Cause: [BMC] [warm reset] BMC occurred warm reset because of updating BMC. 2024-11-15 09:40:41
1830 Minor Cable / Interconnect Cable Asserted From BMC 2024-11-15 09:41:43 UTC+08:00 Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board
1877 Info NA NA NA From BMC 2024-11-15 09:49:24 UTC+08:00 Reboot Cause: [BMC][cold reset] BMC occurred cold reset because of resetting BMC. 2024-11-15 09:48:44 UTC+8
1883 Minor Cable / Interconnect Cable Asserted From BMC 2024-11-15 09:49:30 UTC+08:00 Configuration Error-Incorrect cable connected / Incorrect interconnection---Incorrect SATA cable connection to the system board
升级固件记录:
%# 2024-11-15 09:39:49.771 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Update space preparation succeeded.
%# 2024-11-15 09:39:54.845 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update verification command successfully.
%# 2024-11-15 09:40:07.254 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update command successfully.
%# 2024-11-15 09:40:10.418 UTC+08:00 HDM210235A4GP********1S [BMC.update] 2667 [SUCCESS]: [root][redfish][10.x.x.x] Issued upgrade configuration. Module: HDM; Conf: Retain(Primary).
%# 2024-11-15 09:41:45.299 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: HDM; Location: Primary; Model: R5300 G6; Version: 2.03 -> 2.08; Update result: Succeeded.
%# 2024-11-15 09:51:41.206 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Update space preparation succeeded.
%# 2024-11-15 09:51:41.711 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update verification command successfully.
%# 2024-11-15 09:51:48.275 UTC+08:00 HDM210235A4GP********1S [BMC.update] 517 [SUCCESS]: [root][redfish][10.x.x.x] Issued the update command successfully.
%# 2024-11-15 09:51:49.783 UTC+08:00 HDM210235A4GP********1S [BMC.update] 3642 [SUCCESS]: [root][redfish][10.x.x.x] Issued upgrade configuration. Module: BIOS; Conf: Retain(BIOS and ME).
%# 2024-11-15 10:13:10.568 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: BIOS; Location: BIOS; Model: R5300 G6; Version: 6.00.25 -> 6.10.40; Update result: Succeeded.
%# 2024-11-15 10:13:10.593 UTC+08:00 HDM210235A4GP********1S [BMC.update] 710 [SUCCESS]: [root][redfish][10.x.x.x] Module: BIOS; Location: ME; Model: R5300 G6; Version: 6.00.25 -> 6.10.40; Update result: Succeeded.
2、cpld版本是V005,故障版本是V004;在V005版本说明书上有解决该问题。触发条件:特定机型配置才会有报错;004逻辑循环检测存在概率误报,线缆告警检测日志只会在BMC刚启动时上报一次,随后就不再重复上报。现场升级BMC后会重启HDM,这时候上报了报警 。
升级CPLD版本至V005及以上。
该案例暂时没有网友评论
✖
案例意见反馈
亲~登录后才可以操作哦!
确定你的邮箱还未认证,请认证邮箱或绑定手机后进行当前操作