[Question] Is PureJaxRL TPU-optimal? #27

MRiabov · 2024-06-21T14:39:13Z

Hello everyone, practitioner here,

I am looking to train a very serious non-LLM model, and the training is expected to be very hard, so I am looking for maximum speed.

I know that Google's TPUs are said to be the fastest for training and inference - some 197 TFlops at only $0.6/hr with interruptible (spot) pricing.

Is this library TPU-optimized? Is it much faster than the other existing libraries? What to compare it against (aside from what is said in the blogpost?)

Thanks,
@MRiabov

luchris429 · 2024-06-21T17:09:35Z

I haven't thoroughly tested this on TPU's; however, I don't think there is any hardware-specific/relevant details in this implementation. It should work faster than other non-pure JAX libraries.

MRiabov changed the title ~~Is PureJaxRL TPU-optimal?~~ [Question] Is PureJaxRL TPU-optimal? Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Is PureJaxRL TPU-optimal? #27

[Question] Is PureJaxRL TPU-optimal? #27

MRiabov commented Jun 21, 2024

luchris429 commented Jun 21, 2024

[Question] Is PureJaxRL TPU-optimal? #27

[Question] Is PureJaxRL TPU-optimal? #27

Comments

MRiabov commented Jun 21, 2024

luchris429 commented Jun 21, 2024