-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Chassis] Update lossy profile to restrict buffer usage in congestion state #20132
Conversation
@saksarav-nokia @kenneth-arista please help review. |
@vmittal-msft , Is this also applicable for 202205 chassis? if so, please add the label "chassis for 202205 branch" Also update your MSFT ADO with proper branch request tags. |
Hi @judyjoseph , Can you please review this? I have no experience in RDMA buffer tuning. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why the alpha value is different for 400g/100g Lcs. If a 400g port is configured at 100g speed, will it still use the 400g port alpha?
It is different to give different q limits for different port speeds. 100g port will get lesser limit then 400g. if 400g us configured as 100g, it will be set based on 100g. |
@vmittal-msft, do add reference to the qos tests run with these new configs, or any PR sonic-mgmt test change needed to support this. |
Yes. I have requested Nokia team to refer this while opening sonic-mgmt PR. |
device/nokia/x86_64-nokia_ixr7250e_36x400g-r0/Nokia-IXR7250E-36x100G/1/buffers_defaults_t2.j2
Outdated
Show resolved
Hide resolved
device/nokia/x86_64-nokia_ixr7250e_36x400g-r0/Nokia-IXR7250E-36x100G/0/buffers_defaults_t2.j2
Outdated
Show resolved
Hide resolved
d5a61b1
to
079daa2
Compare
… state (sonic-net#20132) Why I did it This change is to restrict lossy queue buffer usage in case of congestion state. Work item tracking Microsoft ADO (29315559): How I did it Updated alpha from 0 to -4 (400g) & -5 (100g) port speed. This configuration is applied on system port and will be using HWSKU port speed settings. How to verify it It is verified using sonic-mgmt tests and running ok.
…ss_pkt count for lossy profile (#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx
… state (sonic-net#20132) Why I did it This change is to restrict lossy queue buffer usage in case of congestion state. Work item tracking Microsoft ADO (29315559): How I did it Updated alpha from 0 to -4 (400g) & -5 (100g) port speed. This configuration is applied on system port and will be using HWSKU port speed settings. How to verify it It is verified using sonic-mgmt tests and running ok.
Cherry-pick PR to 202405: #20318 |
…ss_pkt count for lossy profile (sonic-net#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx
… state (#20132) Why I did it This change is to restrict lossy queue buffer usage in case of congestion state. Work item tracking Microsoft ADO (29315559): How I did it Updated alpha from 0 to -4 (400g) & -5 (100g) port speed. This configuration is applied on system port and will be using HWSKU port speed settings. How to verify it It is verified using sonic-mgmt tests and running ok.
…ss_pkt count for lossy profile (sonic-net#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx
…ss_pkt count for lossy profile (sonic-net#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx
…ss_pkt count for lossy profile (sonic-net#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx
…hresholds (#15448) * [QoS]qos_yaml j2C+ changes for new _vsq thresholds (#13069) What is the motivation for this PR? The new MMU settings to enhance performance for RDMA traffic in production. Hence the qos_params needs to be tweaked according to the set buffer profiles. However, the existing sonic-mgmt LossyQueueTest doesn't fairly verify the buffer threshold for headroom for Lossy traffic. As per the new vsq profile setting the XOFF FADT threshold/PG is way lesser than the Nominal headroom, which limits it to not utilize the headroom buffer completely and send pause frames before reaching the MAX headroom limit. Either the test case needs to be improvised by adding more source ports or a new test case should be added to verify the Lossy queue traffic at PG level * [Chassis][Voq] Updating J2C+ qos yaml for 400G and 100G profile _egress_pkt count for lossy profile (#14585) Since the dynamic_th-alpha changed from 0 to -4 (400g) & 100g port speed for egress lossy profile. PR #sonic-net/sonic-buildimage#20132 Corresponding changes made in J2C+ qos yaml for t2 -broadcom-dnx * [Qos]qos_yaml updated for 400G
… state (sonic-net#20132) Why I did it This change is to restrict lossy queue buffer usage in case of congestion state. Work item tracking Microsoft ADO (29315559): How I did it Updated alpha from 0 to -4 (400g) & -5 (100g) port speed. This configuration is applied on system port and will be using HWSKU port speed settings. How to verify it It is verified using sonic-mgmt tests and running ok.
Why I did it
This change is to restrict lossy queue buffer usage in case of congestion state.
Work item tracking
How I did it
Updated alpha from 0 to -4 (400g) & -5 (100g) port speed. This configuration is applied on system port and will be using HWSKU port speed settings.
How to verify it
It is verified using sonic-mgmt tests and running ok.
Which release branch to backport (provide reason below if selected)
Tested branch (Please provide the tested image version)
Description for the changelog
Link to config_db schema for YANG module changes
A picture of a cute animal (not mandatory but encouraged)