• 全部
  • 经验案例
  • 典型配置
  • 技术公告
  • FAQ
  • 漏洞说明
  • 全部
  • 全部
  • 大数据引擎
  • 知了引擎
产品线
搜索
取消
案例类型
发布者
是否解决
是否官方
时间
搜索引擎
匹配模式
高级搜索

某局点S6850 聚合口协议闪断问题

  • 0关注
  • 1收藏 1988浏览
粉丝:3人 关注:1人

问题描述

两台S6850搭建DRNI,通过drbagg401和华为设备对接。现场10日出现bagg401闪断,具体查看对应时间的日志为成员口收到的lacp异常因此未选中。但是对端查看日志也是差不多的报错,两边都说对端发的有问题。由于只是之前的一次闪断,到今天也没有再次出现过,没法debug或者抓包看报文。

过程分析

1)对应端口并未物理down,只是协议lagg down,两台设备打印的报错还不一样,具体协议down的原因设备侧日志提示如下:

 

DR1:这台设备报的是无D标志位和key不对

 

%@6141%Oct 10 07:32:03:818 2022 DR1 LAGG/6/LAGG_INACTIVE_OPERSTATE: Member port HGE1/0/27 of aggregation group BAGG401 changed to the inactive state, because the peer port did not have the Synchronization flag.

%@6142%Oct 10 07:32:03:818 2022 DR1 LAGG/6/LAGG_INACTIVE_PARTNER_KEY_WRONG: Member port HGE1/0/28 of aggregation group BAGG401 changed to the inactive state, because the operational key of the peer port was different from that of the reference port.

%@6143%Oct 10 07:32:03:822 2022 DR1 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/27 changed to down.

%@6144%Oct 10 07:32:03:822 2022 DR1 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/28 changed to down.

 

%@6148%Oct 10 07:32:03:855 2022 DR1 IFNET/3/PHY_UPDOWN: Physical state on the interface Bridge-Aggregation401 changed to down.

%@6149%Oct 10 07:32:03:855 2022 DR1 IFNET/5/LINK_UPDOWN: Line protocol state on the interface Bridge-Aggregation401 changed to down.

 

其中27down的原因是对端发的lacp报文中没有携带可聚合的标志位;28down的原因为对端端口的操作key和参考端口不同。

 

DR2:这台提示的是

 

%@13479%Oct 10 07:32:03:835 2022 DR2 LAGG/6/LAGG_INACTIVE_OTHER: Member port HGE1/0/27 of aggregation group BAGG401 changed to the inactive state, because other reason.

%@13480%Oct 10 07:32:03:835 2022 DR2 LAGG/6/LAGG_INACTIVE_OTHER: Member port HGE1/0/28 of aggregation group BAGG401 changed to the inactive state, because other reason.

%@13481%Oct 10 07:32:03:837 2022 DR2 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/27 changed to down.

%@13482%Oct 10 07:32:03:838 2022 DR2 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/28 changed to down.

%@13483%Oct 10 07:32:03:842 2022 DR2 DRNI/6/DRNI_IFEVENT_DR_NOSELECTED: Local DR interface Bridge-Aggregation401 in DR group 401 does not have Selected member ports because the aggregate interface went down. Please check the aggregate link status.

%@13484%Oct 10 07:32:03:842 2022 DR2 DRNI/6/DRNI_IFEVENT_DR_GLOBALDOWN: The state of DR group 401 changed to down.

%@13485%Oct 10 07:32:03:880 2022 DR2 IFNET/3/PHY_UPDOWN: Physical state on the interface Bridge-Aggregation401 changed to down.

2)基于上述信息来看似乎是对端设备发送的lacp报文存在问题,但是对端huawei设备的日志看也是指向我们设备发出的lacp存在异常:

 

对端设备打印日志如下:

Oct 10 2022 07:32:03+08:00 HW %%01LACP/3/LAG_DOWN_REASON_PDU(l)[258]:The member of the LACP mode Eth-Trunk interface went down because the local device received changed LACP PDU from partner. (TrunkName=Eth-Trunk27, PortName=100GE1/0/3, Reason=PartnerSyncFalse, OldParam=b1Synchronization:1, NewParam=b1Synchronization:0)

Oct 10 2022 07:32:03+08:00 HW %%01LACP/3/LAG_DOWN_REASON_PDU(l)[259]:The member of the LACP mode Eth-Trunk interface went down because the local device received changed LACP PDU from partner. (TrunkName=Eth-Trunk27, PortName=100GE0/0/4, Reason=PartnerSyncFalse, OldParam=b1Synchronization:1, NewParam=b1Synchronization:0)

Oct 10 2022 07:32:03+08:00 HW %%01LACP/3/OPTICAL_FIBER_MISCONNECT(l)[260]:The member of the LACP mode Eth-Trunk interface received an abnormal LACPDU, which may be caused by optical fiber misconnection. (TrunkName=Eth-Trunk27, PortName=100GE0/0/3, LocalParam=ActorOperPortKey:6993, PDUParam=PartnerKey:1089)

3)查看本端最新聚合信息如下,本地设备的maca8c9-8a34-c4e1,对端是b008-7565-4900

Aggregate Interface: Bridge-Aggregation401

Creation Mode: Manual

Aggregation Mode: Dynamic

Loadsharing Type: Shar

Management VLANs: None

System ID: 0xa, a8c9-8a36-c4e1

Local:

  Port         Status   Priority     Index   Oper-Key         Flag

HGE1/0/27(R)  S        32768    16392    40401         {ACDEF}

  HGE1/0/28    S        32768    16393    40401         {ACDEF}

Remote:

  Actor          Priority   Index    Oper-Key  SystemID             Flag  

  HGE1/0/27     32768    2        6993     0x8000, b008-7565-4900  {ACDEF}

  HGE1/0/28     32768    40       6993     0x8000, b008-7565-4900  {ACDEF}

 

System ID

设备ID(由系统的LACP优先级和系统的MAC地址共同构成)

 

4)查看选中端口收到的聚合报文,聚合震荡时收到了异常报文:

通过probe视图下的display system internal link-aggregation lacp packet interface te x/0/x count 20命令可以查看到设备收到的报文,中间有个错误报文

该异常报文的解析为:

SystemID对端为32768,本端为32768

SystemMAC对端为b008-7565-4900,本端为5825-7570-a3c0

详细如下:

[ZJHZ-IXP22-NET-PE-H3C-S6850-49-probe]display system internal link-aggregation lacp packet interface h 1/0/27 count 20

Data and Time: 10/10 07:32:03.841

Packet description:

Local:  SystemID=32768 SystemMAC=b008-7565-4900 Key=6993 Index=2 Priority=32768 Flag=13

Remote: SystemID=10 SystemMAC=a8c9-8a36-c4e1 Key=40401 Index=16392 Priority=32768 Flag=5     //正常对端发的lacp应该是这个

Data and Time: 10/10 07:32:03.807

Packet description:

Local:  SystemID=32768 SystemMAC=b008-7565-4900 Key=1089 Index=27 Priority=32768 Flag=61

Remote: SystemID=32768 SystemMAC=5825-7570-a3c0 Key=7745 Index=54 Priority=32768 Flag=61           //端口震荡时,对端设备报文发串了,把发给5825-7570-a3c0的报文发给了我们

 

 

display system internal link-aggregation lacp packet interface te 1/0/18 count 20

对应时间点报文无问题

[ZJHZ-IXP22-NET-PE-H3C-S6850-50-probe]display system internal link-aggregation lacp packet interface h 1/0/27 count 20

Aggregate interface: Bridge-Aggregation401

Data and Time: 10/10 07:32:04.003

Packet description:

Local:  SystemID=32768 SystemMAC=b008-7565-4900 Key=6993 Index=39 Priority=32768 Flag=61

Remote: SystemID=10 SystemMAC=a8c9-8a36-c4e1 Key=40401 Index=32776 Priority=32768 Flag=13

 

但是其他时间点也有异常报文,对应聚合端口也有震荡

Data and Time: 09/28 09:06:20.939

Packet description:

Local:  SystemID=32768 SystemMAC=b008-7565-4900 Key=1089 Index=27 Priority=32768 Flag=61

Remote: SystemID=32768 SystemMAC=5825-7570-a3c0 Key=7745 Index=54 Priority=32768 Flag=61

%@13458%Sep 28 09:06:20:961 2022 ZJHZ-IXP22-NET-PE-H3C-S6850-50 LAGG/6/LAGG_INACTIVE_PARTNER_KEY_WRONG: Member port HGE1/0/27 of aggregation group BAGG401 changed to the inactive state, because the operational key of the peer port was different from that of the reference port.

%@13459%Sep 28 09:06:20:964 2022 ZJHZ-IXP22-NET-PE-H3C-S6850-50 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/27 changed to down.

 

Data and Time: 09/24 08:45:57.722

Packet description:

Local:  SystemID=32768 SystemMAC=b008-7565-4900 Key=1089 Index=31 Priority=32768 Flag=61

Remote: SystemID=32768 SystemMAC=5825-7570-a3c0 Key=7745 Index=58 Priority=32768 Flag=61

%@13330%Sep 24 08:45:57:730 2022 ZJHZ-IXP22-NET-PE-H3C-S6850-50 LAGG/6/LAGG_INACTIVE_PARTNER_KEY_WRONG: Member port HGE1/0/27 of aggregation group BAGG401 changed to the inactive state, because the operational key of the peer port was different from that of the reference port.

%@13331%Sep 24 08:45:57:732 2022 ZJHZ-IXP22-NET-PE-H3C-S6850-50 IFNET/5/LINK_UPDOWN: Line protocol state on the interface HundredGigE1/0/27 changed to down.

解决方法

1、排查对端设备发送异常lacp报文的原因。

该案例对您是否有帮助:

您的评价:1

若您有关于案例的建议,请反馈:

0 个评论

该案例暂时没有网友评论

编辑评论

举报

×

侵犯我的权益 >
对根叔知了社区有害的内容 >
辱骂、歧视、挑衅等(不友善)

侵犯我的权益

×

泄露了我的隐私 >
侵犯了我企业的权益 >
抄袭了我的内容 >
诽谤我 >
辱骂、歧视、挑衅等(不友善)
骚扰我

泄露了我的隐私

×

您好,当您发现根叔知了上有泄漏您隐私的内容时,您可以向根叔知了进行举报。 请您把以下内容通过邮件发送到zhiliao@h3c.com 邮箱,我们会尽快处理。
  • 1. 您认为哪些内容泄露了您的隐私?(请在邮件中列出您举报的内容、链接地址,并给出简短的说明)
  • 2. 您是谁?(身份证明材料,可以是身份证或护照等证件)

侵犯了我企业的权益

×

您好,当您发现根叔知了上有关于您企业的造谣与诽谤、商业侵权等内容时,您可以向根叔知了进行举报。 请您把以下内容通过邮件发送到 zhiliao@h3c.com 邮箱,我们会在审核后尽快给您答复。
  • 1. 您举报的内容是什么?(请在邮件中列出您举报的内容和链接地址)
  • 2. 您是谁?(身份证明材料,可以是身份证或护照等证件)
  • 3. 是哪家企业?(营业执照,单位登记证明等证件)
  • 4. 您与该企业的关系是?(您是企业法人或被授权人,需提供企业委托授权书)
我们认为知名企业应该坦然接受公众讨论,对于答案中不准确的部分,我们欢迎您以正式或非正式身份在根叔知了上进行澄清。

抄袭了我的内容

×

原文链接或出处

诽谤我

×

您好,当您发现根叔知了上有诽谤您的内容时,您可以向根叔知了进行举报。 请您把以下内容通过邮件发送到zhiliao@h3c.com 邮箱,我们会尽快处理。
  • 1. 您举报的内容以及侵犯了您什么权益?(请在邮件中列出您举报的内容、链接地址,并给出简短的说明)
  • 2. 您是谁?(身份证明材料,可以是身份证或护照等证件)
我们认为知名企业应该坦然接受公众讨论,对于答案中不准确的部分,我们欢迎您以正式或非正式身份在根叔知了上进行澄清。

对根叔知了社区有害的内容

×

垃圾广告信息
色情、暴力、血腥等违反法律法规的内容
政治敏感
不规范转载 >
辱骂、歧视、挑衅等(不友善)
骚扰我
诱导投票

不规范转载

×

举报说明

提出建议

    +

亲~登录后才可以操作哦!

确定

亲~检测到您登陆的账号未在http://hclhub.h3c.com进行注册

注册后可访问此模块

跳转hclhub

你的邮箱还未认证,请认证邮箱或绑定手机后进行当前操作