Question about training model #150

Firegreat123 · 2022-04-19T02:24:02Z

Hi, I'm using neural tangents to construct training models architecture. Can model be saved as a file?

sschoenholz · 2022-04-19T15:00:36Z

Hey! Great question. There are a few ways that you can go about saving models.

The simplest way is to note that NT params are made of standard python datastructures (tuples and lists) along with JAX arrays, which will be serialized to standard numpy arrays. Thus, one option is to use pickle to save the whole params tree, another is to flatten the tree, save using numpy.save or numpy.savez, and then save the tree structure using pickle.

For more details and sample code for this approach check out the thread over on Haiku: google-deepmind/dm-haiku#18

Another option that's a little bit more complicated is to use jax2tf to convert the model to tensorflow and then save the model as a SavedModel. This has the advantage that it's hermetic (so that you don't need to keep the code to construct the model intact).

See here for more details: https://github.com/google/jax/tree/main/jax/experimental/jax2tf

In general, I would probably opt to save the model as numpy arrays during training and then if I wanted to have a longer term storage option to use the model on downstream tasks look into the SavedModel pipeline.

Firegreat123 · 2022-04-20T02:03:14Z

Thank you, really appreciate.

romanngg added the question Further information is requested label Apr 19, 2022

Firegreat123 closed this as completed Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about training model #150

Question about training model #150

Firegreat123 commented Apr 19, 2022

sschoenholz commented Apr 19, 2022

Firegreat123 commented Apr 20, 2022

Question about training model #150

Question about training model #150

Comments

Firegreat123 commented Apr 19, 2022

sschoenholz commented Apr 19, 2022

Firegreat123 commented Apr 20, 2022