Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add lag member failed on mellanox platform #191

Closed
lguohan opened this issue Apr 14, 2017 · 8 comments · Fixed by sonic-net/sonic-buildimage#540
Closed

add lag member failed on mellanox platform #191

lguohan opened this issue Apr 14, 2017 · 8 comments · Fixed by sonic-net/sonic-buildimage#540
Assignees
Labels

Comments

@lguohan
Copy link
Contributor

lguohan commented Apr 14, 2017

Apr 14 19:40:46 str-msn2700-04 NOTICE orchagent: :- addLagMember: Add member Ethernet120 to LAG PortChannel03 lid:20000000005a1 pid:100000000000f
Apr 14 19:40:46 str-msn2700-04 ERR syncd: :- handle_generic: failed to create -1
Apr 14 19:40:46 str-msn2700-04 ERR syncd: :- processEvent: failed to execute api: create, key: SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005b8, status: SAI_STATUS_FAILURE
Apr 14 19:40:46 str-msn2700-04 ERR syncd: :- processEvent: field: SAI_LAG_MEMBER_ATTR_LAG_ID, value: oid:0x20000000005a1
Apr 14 19:40:46 str-msn2700-04 ERR syncd: :- processEvent: field: SAI_LAG_MEMBER_ATTR_PORT_ID, value: oid:0x100000000000f
Apr 14 19:40:46 str-msn2700-04 NOTICE syncd: :- exit_and_notify: sending switch_shutdown_request notification to OA
Apr 14 19:40:46 str-msn2700-04 NOTICE orchagent: :- handle_switch_shutdown_request: switch shutdown request
Apr 14 19:40:46 str-msn2700-04 ERR orchagent: :- on_switch_shutdown_request: Syncd stopped
Apr 14 19:40:46 str-msn2700-04 INFO swss.sh[2648]: terminate called without an active exception

@lguohan
Copy link
Contributor Author

lguohan commented Apr 14, 2017

Please find the attached sai record.

sairedis.2017-04-14.10_30_11.766709.txt

@lguohan
Copy link
Contributor Author

lguohan commented Apr 14, 2017

it looks like we are recreating the LAG_MEMBER for the same lag_id and port_id pair.

2017-04-14.10:30:34.146625|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a6|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x20000000005a1|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000f


2017-04-14.19:40:46.847206|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005b8|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x20000000005a1|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000f

@stcheng
Copy link
Contributor

stcheng commented Apr 14, 2017

2017-04-14.10:30:15.085076|c|SAI_OBJECT_TYPE_LAG:oid:0x200000000059d|NULL=NULL
2017-04-14.10:30:15.176111|c|SAI_OBJECT_TYPE_LAG:oid:0x200000000059e|NULL=NULL
2017-04-14.10:30:15.226323|c|SAI_OBJECT_TYPE_LAG:oid:0x20000000005a1|NULL=NULL
2017-04-14.10:30:15.237719|c|SAI_OBJECT_TYPE_LAG:oid:0x20000000005a2|NULL=NULL
2017-04-14.10:30:34.041837|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a5|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x20000000005a2|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x1000000000010
2017-04-14.10:30:34.146625|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a6|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x20000000005a1|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000f
2017-04-14.10:30:34.316485|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a7|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x200000000059e|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000e
2017-04-14.10:30:34.589054|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a8|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x200000000059d|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000d
2017-04-14.19:40:44.314727|r|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005a6
2017-04-14.19:40:46.847206|c|SAI_OBJECT_TYPE_LAG_MEMBER:oid:0x1a0000000005b8|SAI_LAG_MEMBER_ATTR_LAG_ID=oid:0x20000000005a1|SAI_LAG_MEMBER_ATTR_PORT_ID=oid:0x100000000000f

The SAI function call looks fine. After the creation of all LAGs, it first removes one LAG member and then re-create it. I will double check with the latest build.

@lguohan
Copy link
Contributor Author

lguohan commented Apr 14, 2017

@marian-pritsak

@stcheng
Copy link
Contributor

stcheng commented Apr 14, 2017

@liatgrozovik hey Liat, could you also help to take a look at this? I notice that attaching/detaching members of a LAG will cause creation failure.

@marian-pritsak
Copy link
Collaborator

marian-pritsak commented Apr 19, 2017

Tried on HEAD.220-68fcc36

Commands:

teamdctl PortChannel0 port remove Ethernet0
teamdctl PortChannel0 port add Ethernet0

syncd does not crash, LAG is still working after member is added back to LAG

@stcheng
Copy link
Contributor

stcheng commented Apr 21, 2017

@marian-pritsak i tried also and it seems that everything is working well. I will double check nightly test to see if it is still an issue or I will close it.

@lguohan
Copy link
Contributor Author

lguohan commented Apr 24, 2017

what is the root cause for this?

andriymoroz-mlnx added a commit to andriymoroz-mlnx/sonic-buildimage_old that referenced this issue Apr 26, 2017
lguohan pushed a commit to sonic-net/sonic-buildimage that referenced this issue Apr 26, 2017
EdenGri pushed a commit to EdenGri/sonic-swss that referenced this issue Feb 28, 2022
Signed-off-by: Chenchen Qi <chenchen.qcc@alibaba-inc.com>
oleksandrivantsiv pushed a commit to oleksandrivantsiv/sonic-swss that referenced this issue Mar 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants