-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
retest OpenMPI, jucx, infinileap and disni when SoftiWarp or SoftRoCe module is loaded #47
Comments
also try again to run RMA put/get tests in jucx |
https://www.reflectionsofthevoid.com/2020/07/software-rdma-revisited-setting-up.html Reflections Of The Void Software RDMA revisited setting up SoftiWARP on Ubuntu 20.04.pdf Requires the use of an actually connected network interface:
|
ucx does not support iWarp as it seems: openucx/ucx#2507 They have some commits for it but say it is untested since 2017? Also seems as if the code does not support iWarp because it checks for only Infiniband? |
Alternative is rdma_rxe (similar to UDP, though seems to keep packet order?): https://enterprise-support.nvidia.com/s/article/howto-configure-soft-roce (though outdated linux-rdma/rdma-core@0d2ff0e)
Hadronio works with Soft-RoCe, our Jucx integration requires that the listener does not get closed (which is now the default). This here suggests Soft-RoCe can improve performance of normal networks cards as well: https://www.reflectionsofthevoid.com/2011/08/soft-roce-alternative-to-soft-iwarp.html |
where stream is used during establish connection and tag is used afterwards
Soft-RoCe Checklist:
RoCe hangs might be due to unreliability of the protocol: zrlio/disni#37 (comment) SoftiWarp Checklist:
|
finished |
https://github.com/zrlio/softiwarp
software based infiniband (similar to TCP/SCTP)
The text was updated successfully, but these errors were encountered: