近期业务有些卡,开log不时的能看到内存报警情况。
该如何排查问题?
DEVICE_NAME : S7003X
Software Ver : S7000X-7748P01
%Dec 24 09:42:40:220 2025 C SSHS/6/SSHS_DISCONNECT: SSH user D*** (IP: ***) disconnected from the server.
%Dec 24 09:42:42:930 2025 C DIAG/4/MEM_ALERT:
system memory info:
total used free shared buffers cached
Mem: 1946124 1635244 310880 0 68 202312
-/+ buffers/cache: 1432864 513260
Swap: 0 0 0
%Dec 24 09:42:44:395 2025 C SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is undo debugging all
%Dec 24 09:42:45:681 2025 C DIAG/1/MEM_EXCEED_THRESHOLD: Memory early-warning threshold has been exceeded.
Memory statistics are measured in KB:
Total Free FreeRatio
Mem: 1946124 298496 15%
Free-memory thresholds:
Minor: 10%
Severe: 7%
Critical: 4%
Normal: 13%
Early-warning: 16%
Secure: 18%
Process info(KB):
JID Used Name
8762 591856 ifmgr
8784 249840 laggd
8837 231228 ipstackd
8780 230540 aaad
8741 222380 syslogd
Slub info(KB):
Used Name
100116 osal-128
70414 osal-32
51884 osal-64
47376 osal-4092
29696 osal-1048572
%Dec 24 09:42:46:791 2025 C DIAG/1/MEM_BELOW_THRESHOLD: Memory usage has dropped below early-warning threshold.
[C]dis memory summary
Memory statistics are measured in KB:
Slot CPU Total Used Free Buffers Caches FreeRatio
0 0 1946124 1580856 365268 48 201792 19.5%
1 0 1946124 1532888 413236 48 190440 21.4%
2 0 919612 507664 411948 24 40672 45.1%
[C]dis process memory
JID Text Data Stack Dynamic Name
1 116 8532 36 200 scmd
2 0 0 0 0 [kthreadd]
3 0 0 0 0 [ksoftirqd/0]
5 0 0 0 0 [kworker/0:0H]
7 0 0 0 0 [rcu_sched]
8 0 0 0 0 [rcu_bh]
9 0 0 0 0 [migration/0]
10 0 0 0 0 [migration/1]
11 0 0 0 0 [ksoftirqd/1]
13 0 0 0 0 [kworker/1:0H]
15 0 0 0 0 [perf]
177 0 0 0 0 [writeback]
178 0 0 0 0 [ksmd]
180 0 0 0 0 [crypto]
181 0 0 0 0 [bioset]
183 0 0 0 0 [kblockd]
209 0 0 0 0 [edac-poller]
288 0 0 0 0 [kworker/0:1]
300 0 0 0 0 [kswapd0]
314 0 0 0 0 [kworker/1:1]
315 0 0 0 0 [vmstat]
390 0 0 0 0 [fsnotify_mark]
1743 0 0 0 0 [bioset]
1761 0 0 0 0 [bioset]
1808 0 0 0 0 [bioset]
1858 0 0 0 0 [bioset]
1916 0 0 0 0 [bioset]
1932 0 0 0 0 [bioset]
1958 0 0 0 0 [bioset]
2001 0 0 0 0 [bioset]
2610 0 0 0 0 [deferwq]
3507 0 0 0 0 [watchdog/0]
3536 0 0 0 0 [kworker/u4:0]
3552 0 0 0 0 [watchdog/1]
3578 0 0 0 0 [kstarvationtask]
3825 0 0 0 0 [irq/17-mmc0]
3920 0 0 0 0 [bioset]
3923 0 0 0 0 [spi32766]
3939 0 0 0 0 [bioset]
3940 0 0 0 0 [mmcqd/0]
3947 0 0 0 0 [bioset]
3949 0 0 0 0 [bioset]
3961 0 0 0 0 [mmcqd/0boot0]
3977 0 0 0 0 [bioset]
3996 0 0 0 0 [mmcqd/0boot1]
4008 0 0 0 0 [bioset]
4031 0 0 0 0 [mmcqd/0rpmb]
4340 0 0 0 0 [kworker/1:1H]
4344 0 0 0 0 [kworker/0:1H]
4403 0 0 0 0 [TMTH]
4404 0 0 0 0 [dGDB]
4405 0 0 0 0 [ctcTimer]
4432 0 0 0 0 [ctclnkMon-0]
4433 0 0 0 0 [dma_async_tx-0]
4434 0 0 0 0 [ctcPktRx0-0]
4435 0 0 0 0 [ctcPktRx1-0]
4436 0 0 0 0 [ctcPktRx2-0]
4437 0 0 0 0 [DmaStats1-0]
4438 0 0 0 0 [DmaInfo0-0]
4439 0 0 0 0 [DmaInfo2-0]
4440 0 0 0 0 [DmaInfo3-0]
4441 0 0 0 0 [dal_intr0]
4442 0 0 0 0 [RECV]
4443 0 0 0 0 [DIPC]
4444 0 0 0 0 [TXAT]
4445 0 0 0 0 [DVP]
4446 0 0 0 0 [DrvDiag]
4447 0 0 0 0 [IBC_TEST]
4448 0 0 0 0 [IBCEVENT]
4449 0 0 0 0 [DDEV]
4450 0 0 0 0 [DTIM]
4451 0 0 0 0 [DSNC]
4452 0 0 0 0 [DSYN]
4453 0 0 0 0 [MNET]
4454 0 0 0 0 [DPFT]
4455 0 0 0 0 [MDCT2]
4456 0 0 0 0 [MDCT]
4457 0 0 0 0 [LinkDelay]
4458 0 0 0 0 [LinkScan]
4459 0 0 0 0 [RrppLos]
4460 0 0 0 0 [Linkintr]
4461 0 0 0 0 [STAT]
4462 0 0 0 0 [FMSC]
4463 0 0 0 0 [DSWT]
4464 0 0 0 0 [DFWK]
4465 0 0 0 0 [RSF0]
4466 0 0 0 0 [RSF1]
4467 0 0 0 0 [RSF2]
4469 0 0 0 0 [BMDT]
4470 0 0 0 0 [DQIT]
4471 0 0 0 0 [ctctod]
4472 0 0 0 0 [PTPR]
4473 0 0 0 0 [TTOD]
4474 0 0 0 0 [QINQ]
4475 0 0 0 0 [L2AU/0]
4476 0 0 0 0 [L2HC/0]
4477 0 0 0 0 [L2SY/0]
4478 0 0 0 0 [L2DM/0]
4479 0 0 0 0 [L2US/0]
4480 0 0 0 0 [L2DRNI/0]
4481 0 0 0 0 [L2VV/0]
4482 0 0 0 0 [MacNotify/0]
4483 0 0 0 0 [DIPUC]
4484 0 0 0 0 [DROPUNKNOWN]
4485 0 0 0 0 [DSIPMC]
4486 0 0 0 0 [DTNL]
4487 0 0 0 0 [DBFD]
4488 0 0 0 0 [GLD1]
4489 0 0 0 0 [GLD2]
4490 0 0 0 0 [PCHK]
4491 0 0 0 0 [DAct]
4492 0 0 0 0 [DIAG]
4493 0 0 0 0 [DRSM]
4538 0 0 0 0 [lipc_topology]
4926 0 0 0 0 [LOAD]
4927 0 0 0 0 [LOADProc]
4928 0 0 0 0 [kdlipc]
4929 0 0 0 0 [krpc_event]
4930 0 0 0 0 [krpc_serv]
4931 0 0 0 0 [timesyncs]
4932 0 0 0 0 [lipc_portevt]
4933 0 0 0 0 [kevent]
4939 0 0 0 0 [kifupdown]
4942 0 0 0 0 [kifbufmon]
4949 0 0 0 0 [kiftcb]
4957 0 0 0 0 [L2BC/1]
4965 0 0 0 0 [kmac/1]
4983 0 0 0 0 [kipfs/1]
4993 0 0 0 0 [kip6fs/1]
7661 0 0 0 0 [mbuf_main]
7669 0 0 0 0 [rlink/1]
7686 0 0 0 0 [sock/1]
8260 156 8624 20 160 ntpd
8262 104 8456 16 36 dnsd
8290 0 0 0 0 [kworker/1:2]
8291 0 0 0 0 [kworker/u4:1]
8433 0 0 0 0 [kfcfib/1]
8473 0 0 0 0 [kifmon_th/1]
8729 0 0 0 0 [DMON]
8731 8 200 12 4 cioctld
8732 16 308 16 4 mdcagentd
8733 100 221392 24 268 fsd
8735 184 292 52 80 licd
8736 112 80320 20 4808 dbmd
8738 40 252 16 28 resmond
8739 104 83420 16 1504 diagd
8740 92 200 12 24 had
8741 168 221716 24 472 syslogd
8742 88 115508 8 548 diagaid
8743 212 221452 48 44 devd
8754 172 82200 20 32 edev
8757 48 292 16 12 licsynd
8758 40 260 16 72 cryptomgrd
8759 172 147788 16 72 mdcd
8762 36 590172 12 1636 ifmgr
8770 64 232 20 20 goldd
8771 24 148888 16 344 aaamngtd
8772 48 764 76 180 httpredrd
8773 12 8500 12 8 drvpdtd
8775 12904 23208 17 5269 comsh
8776 88 240 56 36 sysmand
8777 200 147928 68 276 lauthd
8779 76 8452 20 56 ttymgrd
8780 264 230000 16 264 aaad
8781 328 372 12 112 cfad
8784 492 248204 16 1128 laggd
8785 44 204 16 16 tranged
8786 312 82452 44 340 vland
8787 68 324 64 28 automount
8788 152 9100 12 712 ifmond
8795 752 20752 20 508 aclmgrd
8799 12 200 8 4 taed
8808 84 12440 24 688 qosd
8815 12 292 16 12 licxcvrd
8817 272 90800 12 412 ethd
8818 24 352 8 96 coppd
8826 40 336 16 24 wsald
8827 516 86124 160 7076 xmlcfgd
8831 56 224 20 20 hlthd
8832 652 22748 28 3312 dhcpd
8833 92 9300 24 360 dhcpcd
8834 248 416 16 160 dhcpspd
8835 20 200 16 44 hlthcase
8836 128 74064 12 124 ipcimd
8837 388 230312 20 508 ipstackd
8838 160 82160 8 80 l3vpnd
8839 360 82728 32 596 lldpd
8840 1384 41840 12 4704 ospfd
8841 40 30756 16 2408 routed
8842 580 159428 32 3388 snmpd
8843 180 4220 20 194 sshd
8844 520 91300 24 892 stpd
8846 108 312 12 20 telnetd
8847 108 312 16 8 telnetd
8856 168 12416 16 592 staticrtd
8858 0 0 0 0 [karp/1]
8861 0 0 0 0 [kwadj/1]
8862 0 0 0 0 [kfib/1]
8866 0 0 0 0 [loop0]
8873 0 0 0 0 [loop1]
8897 0 0 0 0 [loop2]
8906 0 0 0 0 [loop3]
8915 0 0 0 0 [loop4]
8930 0 0 0 0 [loop5]
8946 0 0 0 0 [loop6]
8960 0 0 0 0 [loop7]
8976 0 0 0 0 [bioset]
8981 0 0 0 0 [loop8]
9009 0 0 0 0 [bioset]
9021 0 0 0 0 [loop9]
9059 0 0 0 0 [bioset]
9069 0 0 0 0 [loop10]
9093 0 0 0 0 [bioset]
9103 0 0 0 0 [loop11]
9160 0 0 0 0 [bioset]
9173 0 0 0 0 [loop12]
9211 0 0 0 0 [bioset]
9214 0 0 0 0 [loop13]
9235 0 0 0 0 [bioset]
9242 0 0 0 0 [loop14]
9277 0 0 0 0 [bioset]
9286 0 0 0 0 [loop15]
9326 0 0 0 0 [bioset]
9331 0 0 0 0 [loop16]
9353 0 0 0 0 [bioset]
9364 0 0 0 0 [loop17]
9409 0 0 0 0 [bioset]
9424 0 0 0 0 [loop18]
9449 0 0 0 0 [bioset]
9454 0 0 0 0 [loop19]
9499 0 0 0 0 [bioset]
9508 0 0 0 0 [loop20]
9537 0 0 0 0 [bioset]
9544 0 0 0 0 [loop21]
9571 0 0 0 0 [bioset]
9581 0 0 0 0 [loop22]
9620 0 0 0 0 [bioset]
9626 0 0 0 0 [loop23]
9649 0 0 0 0 [bioset]
9654 0 0 0 0 [loop24]
9701 0 0 0 0 [bioset]
9708 0 0 0 0 [loop25]
9739 0 0 0 0 [bioset]
9744 0 0 0 0 [loop26]
9788 0 0 0 0 [bioset]
9813 0 0 0 0 [loop27]
9836 0 0 0 0 [bioset]
9841 0 0 0 0 [loop28]
9874 0 0 0 0 [bioset]
9885 0 0 0 0 [loop29]
9921 0 0 0 0 [bioset]
9927 0 0 0 0 [loop30]
9965 0 0 0 0 [bioset]
9971 0 0 0 0 [loop31]
10010 0 0 0 0 [bioset]
10013 0 0 0 0 [loop32]
10030 0 0 0 0 [bioset]
10062 0 0 0 0 [loop33]
10092 0 0 0 0 [bioset]
10097 0 0 0 0 [loop34]
10131 0 0 0 0 [bioset]
10139 0 0 0 0 [loop35]
10168 0 0 0 0 [bioset]
10176 0 0 0 0 [loop36]
10209 0 0 0 0 [bioset]
10216 0 0 0 0 [loop37]
10260 0 0 0 0 [bioset]
10266 0 0 0 0 [loop38]
10308 0 0 0 0 [bioset]
10311 0 0 0 0 [loop39]
10336 0 0 0 0 [bioset]
10341 0 0 0 0 [loop40]
10374 0 0 0 0 [bioset]
10377 0 0 0 0 [loop41]
10434 0 0 0 0 [bioset]
10437 0 0 0 0 [loop42]
10476 0 0 0 0 [bioset]
10479 0 0 0 0 [loop43]
10515 0 0 0 0 [bioset]
10521 0 0 0 0 [loop44]
10556 0 0 0 0 [bioset]
10561 0 0 0 0 [loop45]
10602 0 0 0 0 [bioset]
10605 0 0 0 0 [loop46]
10647 0 0 0 0 [bioset]
10650 0 0 0 0 [loop47]
10690 0 0 0 0 [bioset]
10695 0 0 0 0 [loop48]
10741 0 0 0 0 [bioset]
10745 0 0 0 0 [loop49]
10783 0 0 0 0 [bioset]
10787 0 0 0 0 [loop50]
10820 0 0 0 0 [bioset]
10826 0 0 0 0 [loop51]
10850 0 0 0 0 [bioset]
10855 0 0 0 0 [loop52]
10910 0 0 0 0 [bioset]
10914 0 0 0 0 [loop53]
10944 0 0 0 0 [bioset]
10952 0 0 0 0 [loop54]
10973 0 0 0 0 [bioset]
10977 0 0 0 0 [loop55]
11013 0 0 0 0 [bioset]
11024 0 0 0 0 [loop56]
11057 0 0 0 0 [bioset]
11060 0 0 0 0 [loop57]
11103 0 0 0 0 [bioset]
11109 0 0 0 0 [loop58]
11162 0 0 0 0 [bioset]
11167 0 0 0 0 [loop59]
11186 0 0 0 0 [bioset]
11189 0 0 0 0 [loop60]
11246 0 0 0 0 [bioset]
11249 0 0 0 0 [loop61]
11277 0 0 0 0 [bioset]
11283 0 0 0 0 [loop62]
11306 0 0 0 0 [bioset]
11312 0 0 0 0 [loop63]
11368 0 0 0 0 [bioset]
11375 0 0 0 0 [loop64]
11414 0 0 0 0 [bioset]
11417 0 0 0 0 [loop65]
11429 0 0 0 0 [bioset]
11438 0 0 0 0 [loop66]
11487 0 0 0 0 [bioset]
11497 0 0 0 0 [loop67]
11551 0 0 0 0 [MTMT]
11584 0 0 0 0 [krpc0_1_20607]
11619 0 0 0 0 [krpc0_1_21201]
15671 180 4252 32 406 sshd
15672 40 8512 36 92 login
15674 12 284 20 44 comshc
15675 12904 32424 28 4878 comsh
15774 12904 32424 26 4878 comsh
15775 12904 32492 26 4899 comsh
24797 0 0 0 0 [kworker/0:2]
28672 56 8424 20 20 sflowd
参考下这个案例
现场反馈一组堆叠的R7577P02版本的S7606X的LSUM1FAB06C3内存利用率较高,超过了百分之70,
Chassis 1 Slot 8:
Total Used Free Shared Buffers Cached FreeRatio
Mem: 949632 665580 284052 0 4 57628 30.4%
-/+ Buffers/Cache: 607948 341684
Swap: 0 0 0
Chassis 1 Slot 9:
Total Used Free Shared Buffers Cached FreeRatio
Mem: 949632 671136 278496 0 4 57628 29.8%
-/+ Buffers/Cache: 613504 336128
Swap: 0 0 0
Chassis 2 Slot 8:
Total Used Free Shared Buffers Cached FreeRatio
Mem: 949632 674144 275488 0 4 57632 29.5%
-/+ Buffers/Cache: 616508 333124
Swap: 0 0 0
Chassis 2 Slot 9:
Total Used Free Shared Buffers Cached FreeRatio
Mem: 949632 674352 275280 0 4 57632 29.5%
-/+ Buffers/Cache: 616716 332916
Swap: 0 0 0
网板LSUM1FAB06C内存太小只有1G,该网板空配置起来内存剩余就很小了,另外新版本由于特性增加,很多模块都有少量的占用增加,属于正常占用,加起来增加就多了,而单板内存又太小,导致单板剩余内存过少。
网板不需要下发mac、arp、路由等表项,不会影响业务。建议现场继续使用,如果内存出现告警,按照下面的方法调整告警阈值。
memory-threshold chassis 1 slot 4 minor 80 severe 64 critical 48 normal 96 early-warning 100 secure 140
如果设备已经进入early-warning状态(但依然处于normal状态),需要先关闭关闭预警检测 memory-threshold chassis 1 slot 4 minor 0 severe 0 critical 0 normal 0 early-warning 0 secure 0,然后再调整阀值
暂无评论
亲~登录后才可以操作哦!
确定你的邮箱还未认证,请认证邮箱或绑定手机后进行当前操作
举报
×
侵犯我的权益
×
侵犯了我企业的权益
×
抄袭了我的内容
×
原文链接或出处
诽谤我
×
对根叔社区有害的内容
×
不规范转载
×
举报说明
暂无评论