Onloading and Offloading `nnx` models #4273

ariG23498 · 2024-10-09T05:50:49Z

ariG23498
Oct 9, 2024

Hey folks!

Is there a one stop solution for onloading and offloading a model to and from any accelerated device (GPU, TPUs)?

I am working on a diffusion model, that has 4 models in total (2 text encoders, 1 flow models, and an autoencoder). I would like to juggle between loading and offloading the models for better memory management.

Any help would be great!

cgarciae · 2024-10-09T13:40:20Z

cgarciae
Oct 9, 2024
Maintainer

Hey @ariG23498, you can create use jax.device_put in conjunction with nnx.state + nnx.update e.g:

def put_model(model, device):
  state = nnx.state(model)
  state = jax.device_put(state, device)
  nnx.update(model, state)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onloading and Offloading `nnx` models #4273

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Onloading and Offloading nnx models #4273

ariG23498 Oct 9, 2024

Replies: 1 comment

cgarciae Oct 9, 2024 Maintainer

Onloading and Offloading `nnx` models #4273

ariG23498
Oct 9, 2024

cgarciae
Oct 9, 2024
Maintainer