拓扑如下:
S6800侧接口配置:
#
interface Bridge-Aggregation11
description TO-MOD2-18-HWS6855
port link-type trunk
undo port trunk permit vlan 1
port trunk permit vlan 2001 3003
vtep access port
#
service-instance 2001
encapsulation s-vid 2001
xconnect vsi SDN_VSI_12001
#
service-instance 3003
encapsulation s-vid 3003
xconnect vsi SDN_VSI_13003
#
#
interface Eth-Trunk0
port link-type trunk
undo port trunk allow-pass vlan 1
port trunk allow-pass vlan 2001
#
===============dir /all /all-filesystems===============
Directory of flash:
0 -rw- 4845 Jun 18 2019 15:24:34 545_leaf.template
1 -rw- 135489536 Apr 17 2019 15:38:31 S6900F-CMW710-R2612P02.ipe
%@441758%Sep 2 14:48:40:378 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 CFGMAN/5/CFGMAN_CFGCHANGED: -EventIndex=10024-CommandSource=snmp-COnfigSource=startup-COnfigDestination=running; Configuration changed.
%@441759%Sep 2 14:48:43:838 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 VCF/6/VCF_DOWN_LINK: In phase 2.0.8, device with MAC address 7c1e-0627-38a2 discovered downlink interface FortyGigE1/0/52.
%@441760%Sep 2 14:48:44:410 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 VCF/6/VCF_DOWN_LINK: In phase 2.0.8, device with MAC address 7c1e-0627-38a2 discovered downlink interface FortyGigE1/0/51.
%@441761%Sep 2 14:48:48:841 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is system-view
%@441762%Sep 2 14:48:50:701 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is interface FortyGigE1/0/52
%@441763%Sep 2 14:48:50:892 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is system-view
%@441764%Sep 2 14:48:50:900 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is interface FortyGigE1/0/52
%@441765%Sep 2 14:48:50:905 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is undo lldp compliance admin-status cdp
%@441766%Sep 2 14:48:50:913 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is undo port trunk permit vlan 2001 3003
%@441767%Sep 2 14:48:50:952 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 SHELL/6/SHELL_CMD: -Line=-IPAddr=**-User=**; Command is port trunk permit vlan 1
1、
2、
设备上/var/log/fabric.log记录了设备的自动化日志(probe视图下执行 view /var/log/fabric.log > fbric.log 然后将flash下的fabric.log导出),通过日志可以看到故障时间点突然收到一个拓扑改变的事件,原因是发现对端设备系统名称改变了:
2019-09-02 14:48:43,545 topo.py[line:87] DEBUG [topology]--Notified--topology change {'FortyGigE1/0/52': {'add': [], 'del': [], 'mod': [{'PeerIP': None, 'Oldrole': 'computer', 'sysName': 'JM601-MOD2-18-U45-HWCE6855', 'sysdesc': '', 'Loopback1IP': '0.0.0.0', 'OldsysName': 'M2-18-U45-HWCE6855', 'mac': 'e0cc-7aec-bc21', 'PeerIntf': '40GE1/0/1', 'role': 'computer', 'LoopbackIP': '10.104.51.74', 'OldPeerIntf': '40GE1/0/1', 'OldPeerIP': None, 'OldLoopbackIP': '10.104.51.74', 'OldLoopback1IP': '0.0.0.0', 'Oldmac': 'e0cc-7aec-bc21', 'oldsysdesc': ''}]}}
2019-09-02 14:48:43,553 base.py[line:257] DEBUG [underlay]--Notified--topology change {'data': {u'FortyGigE1/0/52': {u'add': [], u'del': [], u'mod': [{u'PeerIP': None, u'Oldrole': u'computer', u'sysName': u'JM601-MOD2-18-U45-HWCE6855', u'sysdesc': u'', u'Loopback1IP': u'0.0.0.0', u'OldsysName': u'M2-18-U45-HWCE6855', u'mac': u'e0cc-7aec-bc21', u'PeerIntf': u'40GE1/0/1', u'role': u'computer', u'LoopbackIP': u'10.104.51.74', u'OldPeerIntf': u'40GE1/0/1', u'OldPeerIP': None, u'OldLoopbackIP': u'10.104.51.74', u'OldLoopback1IP': u'0.0.0.0', u'Oldmac': u'e0cc-7aec-bc21', u'oldsysdesc': u''}]}}, 'event': 9} module [<underlay.allocation.updown_link.UpDownLinkIF object at 0xffe8a87990>, <underlay.allocation.aggregation.AggrAllocation object at 0xffe8a87d10>, <underlay.allocation.irf.IRFAllocation object at 0xffe8a87cd0>, <underlay.underlay_route.bgp object at 0xffe8a87c10>]
2019-09-02 14:48:43,557 topo.py[line:87] DEBUG [topology]--Notified--topology change {'FortyGigE1/0/51': {'add': [], 'del': [], 'mod': [{'PeerIP': None, 'Oldrole': 'computer', 'sysName': 'JM601-MOD2-18-U45-HWCE6855', 'sysdesc': '', 'Loopback1IP': '0.0.0.0', 'OldsysName': 'M2-18-U45-HWCE6855', 'mac': 'e0cc-7aec-bc21', 'PeerIntf': '40GE2/0/1', 'role': 'computer', 'LoopbackIP': '10.104.51.74', 'OldPeerIntf': '40GE2/0/1', 'OldPeerIP': None, 'OldLoopbackIP': '10.104.51.74', 'OldLoopback1IP': '0.0.0.0', 'Oldmac': 'e0cc-7aec-bc21', 'oldsysdesc': ''}]}}
2019-09-02 14:48:43,559 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/52 ; return
和现场沟通确认,对端华为设备确实是在故障时间点做了sysname的修改操作。
经确认当前设备的自动化实现确实如此,当对端sysname变化的时候,认为聚合对端的设备发生了改变,所以会重新走自动聚合。(但实际上此处有待优化,因为修改系统名称是常常有的事,已提需求建议后续如果仅仅是sysname发生变化,不触发重新聚合过程)。
以下是设备成员口离开聚合和重新创建了聚合口2047的过程日志:
离开原有聚合组:
2019-09-02 14:48:43,687 command_executor.py[line:37] DEBUG Execute command: system-view ; interface Bridge-Aggregation11 ; port link-type trunk ; vtep access port
2019-09-02 14:48:43,845 base.py[line:36] DEBUG AggrAllocation <process_neighbor_mod:642>: Port FortyGigE1/0/52 process neighbor {u'PeerIP': None, u'Oldrole': u'computer', u'sysName': u'JM601-MOD2-18-U45-HWCE6855', u'sysdesc': u'', u'Loopback1IP': u'0.0.0.0', u'OldsysName': u'M2-18-U45-HWCE6855', u'mac': u'e0cc-7aec-bc21', u'PeerIntf': u'40GE1/0/1', u'role': u'computer', u'LoopbackIP': u'10.104.51.74', u'OldPeerIntf': u'40GE1/0/1', u'OldPeerIP': None, u'OldLoopbackIP': u'10.104.51.74', u'OldLoopback1IP': u'0.0.0.0', u'Oldmac': u'e0cc-7aec-bc21', u'oldsysdesc': u''} modify
2019-09-02 14:48:43,854 base.py[line:36] DEBUG AggrAllocation <process_neighbor_del:608>: Port FortyGigE1/0/52 process neighbor {u'PeerIP': None, u'Oldrole': u'computer', u'sysName': u'M2-18-U45-HWCE6855', u'sysdesc': u'', u'Loopback1IP': u'0.0.0.0', u'OldsysName': u'M2-18-U45-HWCE6855', u'mac': u'e0cc-7aec-bc21', u'PeerIntf': u'40GE1/0/1', u'role': u'computer', u'LoopbackIP': u'10.104.51.74', u'OldPeerIntf': u'40GE1/0/1', u'OldPeerIP': None, u'OldLoopbackIP': u'10.104.51.74', u'OldLoopback1IP': u'0.0.0.0', u'Oldmac': u'e0cc-7aec-bc21', u'oldsysdesc': u''} delete
2019-09-02 14:48:43,859 base.py[line:36] DEBUG AggrAllocation <process_delete_aggregation:584>: Remove member port FortyGigE1/0/52 from [u'FortyGigE5/0/52', u'FortyGigE5/0/51', u'FortyGigE1/0/51', u'FortyGigE1/0/52']
2019-09-02 14:48:43,863 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/52 ; undo port link-aggregation group
2019-09-02 14:48:44,068 base.py[line:36] DEBUG AggrAllocation <process_delete_aggregation:589>: Remove member FortyGigE1/0/52's config
重新加入新增聚合组:
%@441795%Sep 2 14:48:58:493 2019 XM-JM601-MOD2-45U-TOR-H3CS6900 VCF/6/VCF_AGGR_CREATE: In phase 2.0.5, device with MAC address 7c1e-0627-38a2 created aggregation group 2047. The member port list is FortyGigE1/0/52,FortyGigE1/0/51.
2019-09-02 14:48:45,156 command_executor.py[line:37] DEBUG Execute command: system ; interface bridge-aggregation 2047
2019-09-02 14:48:45,205 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/52 ; default
2019-09-02 14:48:51,039 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/52 ; lldp compliance admin-status cdp txrx ;
2019-09-02 14:48:51,093 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/52 ; port link-aggregation group 2047 force
2019-09-02 14:48:51,359 aggregation.py[line:248] DEBUG add_members result: ['system', 'System View: return to User View with Ctrl+Z.', '[XM-JM601-MOD2-45U-TOR-H3CS6900]', ' interface FortyGigE1/0/52', '[XM-JM601-MOD2-45U-TOR-H3CS6900-FortyGigE1/0/52]', ' port link-aggregation group 2047 force', '[XM-JM601-MOD2-45U-TOR-H3CS6900-FortyGigE1/0/52]']
2019-09-02 14:48:51,363 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/51 ; default
2019-09-02 14:48:57,185 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/51 ; lldp compliance admin-status cdp txrx ;
2019-09-02 14:48:57,238 command_executor.py[line:37] DEBUG Execute command: system ; interface FortyGigE1/0/51 ; port link-aggregation group 2047 force
但是第二个疑点,为什么重走了自动聚合过程,业务仍然无法恢复?看日志聚合2047的配置理论上是能够让服务器重新上线的:
2019-09-02 15:43:07,203 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; port link-type trunk
2019-09-02 15:43:07,349 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; undo port trunk permit vlan 1
2019-09-02 15:43:07,463 topo.py[line:87] DEBUG [topology]--Notified--topology change {'FortyGigE5/0/51': {'add': [{'PeerIP': None, 'sysName': 'JM601-MOD2-18-U45-HWCE6855', 'sysdesc': '', 'Loopback1IP': '0.0.0.0', 'mac': 'e0cc-7aec-bc21', 'PeerIntf': '40GE1/0/2', 'role': 'computer', 'LoopbackIP': '10.104.51.74'}], 'del': [], 'mod': []}}
2019-09-02 15:43:07,468 topo_collect.py[line:822] DEBUG ifIndex 703 peerChassisID e0cc-7aec-bc21 peerPortID 40GE1/0/2 lldpAgent 1 event 1
2019-09-02 15:43:07,498 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; port trunk permit vlan 2001 3003
2019-09-02 15:43:07,644 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; vtep access port
2019-09-02 15:43:07,706 aggregation.py[line:226] DEBUG do config ['port link-type trunk', 'undo port trunk permit vlan 1', 'port trunk permit vlan 2001 3003'] on aggr 2047
2019-09-02 15:43:07,711 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; port link-type trunk
2019-09-02 15:43:07,875 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; undo port trunk permit vlan 1
2019-09-02 15:43:08,023 command_executor.py[line:37] DEBUG Execute command: system ; interface Bridge-Aggregation2047 ; port trunk permit vlan 2001 3003
2019-09-02 15:43:08,172 command_executor.py[line:37] DEBUG Execute command: system ; interface bridge-aggregation 2047 ; link-aggregation mode dynamic
该案例暂时没有网友评论
✖
案例意见反馈
亲~登录后才可以操作哦!
确定你的邮箱还未认证,请认证邮箱或绑定手机后进行当前操作