Memory used across all GPUs #180

mys007 · 2015-06-15T13:40:14Z

The current implementation allocates hundreds of MB of GPU memory on each GPU present in the system (at least 102MB per device as reported by nvidia-smi), just upon simple require 'cutorch'. This doesn't change with subsequent calls like cutorch.setDevice(). Is there any technical reason for this behavior?

The text was updated successfully, but these errors were encountered:

soumith · 2015-06-15T13:49:47Z

we only allocate a tiny amount of scratch space. All combined to be less than 2MB afaik.
NVIDIA's driver allocates a minimum amount of memory per process using the GPU, for p2p scratch buffers etc. (possibly).

mys007 · 2015-06-15T13:57:55Z

Hmm, but the end effect is the same, there is 100MB GPU memory less for each process. It should be possible to start cutorch selectively just on a chosen subsets of GPUs.

soumith · 2015-06-15T14:00:29Z

CUDA_VISIBLE_DEVICES=0,2 th [yourscript.lua]

where you are telling it to use device 0 and device 2.
The devices are 0-indexed.

mys007 · 2015-06-15T18:00:13Z

Great, thanks!

eriche2016 · 2016-05-12T01:40:07Z

so if you use CUDA_VISIBLE_DEVICES=0 th [yourscript.lua], it means that you can only have 1 gpu to use and others are transparent to you, so there is no sense to use cutorch.setDevice(id) which can be used to switch the default gpu, right? if so, can you offer some guidlines about when to use CUDA_VISIBLE_DEVICES and when to use cutorch.setDevice(id)?

fmassa · 2016-05-12T06:15:45Z

@eriche2016 it makes sense to use it in the context of multi-GPUs. For example, with CUDA_VISIBLE_DEVICES=0,2 you select the GPUs you want to use (here 0 and 2), and with cutorch.setDevice you select between each visible device your tensor will be created. This is how it's done in DataParallelTable.
Now if you only want to use 1 GPU, then there are no benefits in using cutorch.setDevice instead of CUDA_VISIBLE_DEVICES.

soumith closed this as completed Jun 15, 2015

szagoruyko mentioned this issue Apr 5, 2016

require 'cutorch' causes memory use on all GPUs #380

Closed

sidgan mentioned this issue Feb 23, 2017

Memory Leak cmusatyalab/openface#197

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory used across all GPUs #180

Memory used across all GPUs #180

mys007 commented Jun 15, 2015

soumith commented Jun 15, 2015

mys007 commented Jun 15, 2015

soumith commented Jun 15, 2015

mys007 commented Jun 15, 2015

eriche2016 commented May 12, 2016 •

edited

Loading

fmassa commented May 12, 2016

Memory used across all GPUs #180

Memory used across all GPUs #180

Comments

mys007 commented Jun 15, 2015

soumith commented Jun 15, 2015

mys007 commented Jun 15, 2015

soumith commented Jun 15, 2015

mys007 commented Jun 15, 2015

eriche2016 commented May 12, 2016 • edited Loading

fmassa commented May 12, 2016

eriche2016 commented May 12, 2016 •

edited

Loading