Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ibv_reg_mr_iova2 failed with error bad address #115

Open
ThkerLee opened this issue Aug 17, 2023 · 3 comments
Open

ibv_reg_mr_iova2 failed with error bad address #115

ThkerLee opened this issue Aug 17, 2023 · 3 comments

Comments

@ThkerLee
Copy link

evn:

  1. ubuntu2204
  2. cuda_12.2.1_535.86.10
  3. MLNX_OFED_LINUX-23.04-1.1.3.0
  4. nccl2_2.18.3
  5. cx7 fireware 28.37.1014
    ib_write_bw is sucessed:

ib_write

nccl topo:
nccl topo

error message:

image

This error is happened on mellox cx7

@ThkerLee
Copy link
Author

env problem

@nnurlan008
Copy link

Hi @ThkerLee,

I have a similar problem where I need to assign GPU buffer to completion queue in ibv_create_cq. However, it gives me bad address error with GPU address but succeeds with CPU address. Can you please explain how you solved the issue you mentioned?

Many thanks

@candyzone
Copy link

Hi @ThkerLee,

I have a similar problem where I need to assign GPU buffer to completion queue in ibv_create_cq. However, it gives me bad address error with GPU address but succeeds with CPU address. Can you please explain how you solved the issue you mentioned?

Many thanks

Any update? when I use ib_read interface from GPU address, it hangs. But CPU address, it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants