华三核心交换机S10508X-V风扇FAN0故障,导致业务板卡温度过高超过阈值,自动下电,更换风扇模块后,业务板卡故障仍未消除,请问这种情况如何处理,如何恢复业务板卡西状态 slave DEV/2/BOARD_STATE_FAULT: Board state changed to Fault on slot 12, type is LsuM1FAB08XE0. %Apr 1 09:29:15:230 2026 Slave DEV/2/TEMPERATURE SHUTDOWN:-S1ot-14; Temperature is greater than the high-temperature shutdown threshold on slot 14 sensor hotspot 1. The siot will be powered off automatically.%Apr 1 09:29:19:755 2026 Slave DRVPLAT/2/DrvDebug: warning: slot li temperature is too high, power off it. please check it right now. XApr 1 09:29:19:755 2026 slave DRVPLAT/4/DrvDebug: hotspot 1 in slot 11 temperature(109) is too high and the board wil1 be shutdown. %Apr1 09:29:19:761 2026 51ave DEV/2/BOARD_STATE_FAULT: Board state changed to Fault on slot 11,type is LSUMJFAB08XE0 %Apr1 09:29:19:991 2026 slave DEV/3/FAN ABSENT: Fan 0 is absent. %Apr 1 09:29:24:338 2026 slave SHELL/6/SHELL_CMD: -Line=vty0-IPAddr-10.208.10.6-user=admin; Command is dis ip int brief %Apr 1 09:29:28:878 2026 STave DRVPLAT/2/DrvDebug: warnino: Slot 14 temperature is too high, power off it. please check it right now. %ADP1 09:29:28:878 2026 Slave DRVPLAT/4/DrvDebug: hutspot 1 in slot 14 temperature(109) is too high and the board will be shutdown. SAD 1 09:29:28:885 2026 5lave DEV/2/BOARD STATE FAULT: Board state changed to Fault on slot 14, type is LSUMIFABOBXEO %ADr 1 09:29:43:366 2026 STave DEV/2/TEMPERATURE_SHUTDOWN:-Slot-10; Temperature is greater than the high-temperature shutdown threshold an slot 10 sensor hotspot 1. The slot will he powered off automatically.%Apr 1 09:29:55:080 2026 Slave DRVPLAT/2/DrvDebug:Warning: slot 10 temperature is too high, power off it. please check it right now. 1 09:29:35:191 2026 slave DEV/3/FAN_ABSENT: Fan 1 is absent. %Apr %Apr 1 09:29:55:080 2026 5lave DRVPLAT/4/DrvDebug: hutspo
从日志看,S10508X-V交换机因为风扇FAN0和FAN1缺失导致多个槽位(10、11、12、14)温度过高,触发了自动下电保护。更换风扇模块后,板卡并不会自动恢复上电,需要手动恢复。
先登录交换机(主控板,如果主备切换,确认当前主用主控),执行:
Normal。如果仍有风扇显示Absent或Fault,需要继续排查风扇模块是否插紧或模块本身故障。执行:display environment
查看各槽位温度是否已降至正常范围内(通常应低于80℃)。如果温度依然过高(如日志中的109℃),可能是散热风道仍不畅通或机柜散热问题,需先解决环境温度问题,否则强行上电板卡可能再次过热下电。
温度正常后,板卡不会自动上电,需要手动操作。
方法一:通过命令行恢复
display device 或 display device slot 查看板卡状态是否变为Normal。方法二:如果板卡仍显示Fault,尝试先下电再上电
Fault变为Normal。如果变为Master或Standby正常,业务应逐步恢复。持续监控温度和风扇状态,确保没有新告警。
如果某个板卡始终无法恢复(仍显示Fault),可能是因长时间高温导致硬件损坏,需要更换板卡。
暂无评论
Fan 0 is absent / Fan 1 is absentFAULTFan absent,说明:display fan
Absent / Abnormal→ 散热根本没恢复,板卡必然继续高温保护# 假设故障槽位 10、11、12、14
slot 10 offline
slot 11 offline
slot 12 offline
slot 14 offline
# 等待 1 分钟
slot 10 online
slot 11 online
slot 12 online
slot 14 online
display environment
display temperature all
Slave 异常,说明备用引擎也在高温报警整机风道堵死,不是单块板卡问题。display fan 全部 Normaldisplay temperature all 全部正常slot x offline
slot x online
display device
Fault → Normal 才算恢复。display fan 确保无 absentdisplay temperature 确认温度降到 70℃ 以内offline + onlinedisplay device 观察是否恢复 Normaldisplay fan
display device
display temperature all暂无评论
亲~登录后才可以操作哦!
确定你的邮箱还未认证,请认证邮箱或绑定手机后进行当前操作
举报
×
侵犯我的权益
×
侵犯了我企业的权益
×
抄袭了我的内容
×
原文链接或出处
诽谤我
×
对根叔社区有害的内容
×
不规范转载
×
举报说明
暂无评论