Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Amd gpu support #49

Merged
merged 1 commit into from
Jun 22, 2023
Merged

Amd gpu support #49

merged 1 commit into from
Jun 22, 2023

Conversation

lars-t-hansen
Copy link
Collaborator

@lars-t-hansen lars-t-hansen commented Jun 19, 2023

This parses the output of rocm-smi to generate GPU usage data for AMD, as for NVIDIA. It's a bit of a hack but is reliable in my testing - I have a machine with two NVIDIA cards, and some test code that uses both of them.

I think that our nervousness about running nvidia-smi (see #44) is going to be even more acute for rocm-smi.

(The base rev here is not the tip of my work queue but the underlying main refactoring of ps.rs, #45, a little rebasing work will be required to land this if all the other patches land first.)

@lars-t-hansen lars-t-hansen requested a review from bast June 19, 2023 09:39
@bast
Copy link
Member

bast commented Jun 21, 2023

When I try to rebase this on top of main, then src/ps.rs conflicts. Could you please have a look? Or we can do that together. Or I try it :-)

@lars-t-hansen
Copy link
Collaborator Author

I'll have a look.

@lars-t-hansen
Copy link
Collaborator Author

Rebased patch tested on ML4 with AMD cards, seems to work OK.

@bast
Copy link
Member

bast commented Jun 22, 2023

Thanks a lot!

@bast bast merged commit e840067 into NordicHPC:main Jun 22, 2023
1 check passed
@lars-t-hansen lars-t-hansen deleted the amd_gpu branch June 23, 2023 10:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants