Skip to content

[QST] Why are multiple executors per GPU not supported? #6493

Answered by revans2
lw-cxn asked this question in General
Discussion options

You must be logged in to vote

It is possible to have multiple executors share a GPU. The technical limitations vary from cluster to cluster, and are mostly around resource constraints. We do it as a part of our integration tests in local mode to speed up testing. We can get away with this because the amount of memory needed for each application is very small, and because we know exactly what is going to be using the GPU and take steps to avoid overloading the GPU.

As https://nvidia.github.io/spark-rapids/docs/FAQ.html#why-are-multiple-executors-per-gpu-not-supported pointed out Spark does not support scheduling partial GPUs. This means that if you want to use the GPU but not ask Spark to hand out the GPUs to your proc…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@lw-cxn
Comment options

@revans2
Comment options

@lw-cxn
Comment options

Answer selected by lw-cxn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
2 participants
Converted from issue

This discussion was converted from issue #6489 on September 02, 2022 14:17.