Print

某据点S10506X设备整机重启

2025-12-24 发表

问题描述

设备发生warm reboot

过程分析

高端内存的数据都被清空了,包括只有1条启动记录,from-highmemory里的数据都没了。

现象为掉电重启情况,掉电的重启的是硬件感知记录的,记录到nvram中,然后软件启动的时候去读取,Warm reboot是未知原因重启,是软件感知记录的,由于电压波动,在掉电瞬间,软件可能先运行异常,记录了这个原因。因此,掉电重启也可能被记录为warm reboot,需要诊断中判断高级内存或异常重启记录是否有相关记录判断

 

  ===============printk log buffer info =============== 

[    0.000000] 0:---------- secondary log buffer [1] ----------

  ===============printk log buffer info on slot 7=============== 

[    0.000000] 0:---------- secondary log buffer [1] ----------

……

 

  ====local logbuffer chassis 0 slot 0 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 2 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 6 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 7 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 8 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 9 display from-highmemory==== 

  ====local logbuffer chassis 0 slot 10 display from-highmemory==== 

 

==== display hardware internal port peak-rate slot 0 ==== 

Port Ten-GigabitEthernet0/0/1:

Valid=0, has no record!

Port Ten-GigabitEthernet0/0/2:

Valid=0, has no record!

Port Ten-GigabitEthernet0/0/3:

Valid=0, has no record!

Port Ten-GigabitEthernet0/0/4:

Valid=0, has no record!

Port Ten-GigabitEthernet0/0/5:

Valid=0, has no record!

解决方法

建议改多路外部供电运行观察