Add shape-based lazy init to `LinenToNNX` (prev `LinenWrapper`) #4081

IvyZX · 2024-07-13T00:27:50Z

Renamed wrappers to LinenToNNX and NNXToLinen to minimize confusion.
Moved state initialization of LinenToNNX to __call__ to realize lazy init. This allows it to be a submodule of an NNX module, which doesn't have input args during initialization.
- User can use nnx.shaped_init to do a dry run of __call__ and initialize the whole state & full graphdef.
Made state initialization of LinenToNNX nested & closer to NNX, aka. each VariableState is created for every jax Array, not every collection.

codecov-commenter · 2024-07-15T23:42:02Z

Codecov Report

Attention: Patch coverage is 0% with 51 lines in your changes missing coverage. Please review.

Project coverage is 0.00%. Comparing base (31adb00) to head (0139c90).
Report is 140 commits behind head on main.

Files	Patch %	Lines
flax/nnx/nnx/bridge/wrappers.py	0.00%	48 Missing ⚠️
flax/nnx/nnx/bridge/__init__.py	0.00%	3 Missing ⚠️

Additional details and impacted files

@@          Coverage Diff           @@
##            main   #4081    +/-   ##
======================================
  Coverage   0.00%   0.00%            
======================================
  Files        106     108     +2     
  Lines      13582   14045   +463     
======================================
- Misses     13582   14045   +463

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cgarciae · 2024-07-16T13:05:27Z

flax/nnx/nnx/bridge/wrappers.py

+        _rngs['params'] = _rngs['default']
+        del _rngs['default']


Suggested change

_rngs['params'] = _rngs['default']

del _rngs['default']

_rngs['params'] = _rngs.pop('default')

Now that I think about it, we have to make Rngs implement MutableMapping for either of these to work.

The _rngs here is a dict instead of Rngs class, so this already works.

cgarciae · 2024-07-16T13:16:09Z

flax/nnx/nnx/bridge/wrappers.py

+  """To trigger init of all `LinenToNNX` module variables and return a wholesome state."""
+  assert callable(module)
+  _ = module(*args, **kwargs)
+  return nnx.split(module)


In this function we should leverage the fact that Object._object__state._initializing still exists and set if via something like

def _set_initializing(initializing: bool): for _, value in graph.iter_graph(module): if isinstance(value, Object): value._object__state._initializing = initializing

and use the value of _initializing to choose between init and apply when calling the Linen Modules.

Added _set_initializing to the LinenToNNX wrapper.
Note that we can't do check on top level modules' ._object__state._initializing because the top level module might be a pure NNX module with ._object__state._initializing always False.

I'd do something like this:

def _set_initializing(module, initializing: bool): for _, value in graph.iter_graph(module): if isinstance(value, Object): value._object__state._initializing = initializing def shaped_init(module: Module, *args, **kwargs): """To trigger init of all `LinenToNNX` module variables and return a wholesome state.""" module = graph.clone(module) # create a copy _set_initializing(module, True) assert callable(module) try: _ = module(*args, **kwargs) finally: _set_initializing(module, False) return nnx.split(module)

Done, also renamed to lazy_init as discussed offline.

cgarciae · 2024-07-16T13:17:55Z

flax/nnx/nnx/bridge/wrappers.py

+    # Shape-based lazy init of the flax variables
+    if not rngs:
+      rngs = self.rngs
+    if not hasattr(self, 'states'):


Use self._object__state.initializing instead, see above.

Suggested change

if not hasattr(self, 'states'):

if self._object__state.initializing:

cgarciae · 2024-07-18T10:10:44Z

flax/nnx/nnx/bridge/wrappers.py

+      rngs = self.rngs
+    if self._object__state.initializing:
+      _rngs = (
+        {name: stream.key.raw_value for name, stream in rngs.items()}


We need to generate new keys so Linen Modules get new RNG state every time.

Suggested change

{name: stream.key.raw_value for name, stream in rngs.items()}

{name: stream() for name, stream in rngs.items()}

cgarciae · 2024-07-18T10:30:44Z

flax/nnx/nnx/bridge/wrappers.py

+      if 'params' not in _rngs and 'default' in _rngs:
+        _rngs['params'] = _rngs.pop('default')
+
+      variables = self.module.init(_rngs, *args, **kwargs)


we could use init_with_output to avoid calling forward twice

IvyZX requested a review from cgarciae July 13, 2024 00:27

IvyZX force-pushed the bridge branch 3 times, most recently from dd5936c to 2c8963d Compare July 15, 2024 23:32

cgarciae reviewed Jul 16, 2024

View reviewed changes

IvyZX force-pushed the bridge branch from 2c8963d to 4e528dd Compare July 16, 2024 21:27

cgarciae reviewed Jul 18, 2024

View reviewed changes

cgarciae mentioned this pull request Jul 18, 2024

NNXWrapper #4088

Open

cgarciae reviewed Jul 18, 2024

View reviewed changes

IvyZX force-pushed the bridge branch 3 times, most recently from e9fa1e0 to 3f33ad8 Compare July 18, 2024 22:40

cgarciae approved these changes Jul 22, 2024

View reviewed changes

cgarciae added the pull ready label Jul 22, 2024

IvyZX force-pushed the bridge branch from 3f33ad8 to 4d59627 Compare July 22, 2024 18:17

Add shape-based lazy init to LinenToNNX (prev LinenWrapper)

0139c90

IvyZX force-pushed the bridge branch from 4d59627 to 0139c90 Compare July 22, 2024 18:35

copybara-service bot merged commit d8bc194 into google:main Jul 22, 2024
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add shape-based lazy init to `LinenToNNX` (prev `LinenWrapper`) #4081

Add shape-based lazy init to `LinenToNNX` (prev `LinenWrapper`) #4081

IvyZX commented Jul 13, 2024

codecov-commenter commented Jul 15, 2024 •

edited

Loading

cgarciae Jul 16, 2024

cgarciae Jul 16, 2024

IvyZX Jul 16, 2024

cgarciae Jul 16, 2024

IvyZX Jul 16, 2024

cgarciae Jul 17, 2024

IvyZX Jul 18, 2024

cgarciae Jul 16, 2024

IvyZX Jul 16, 2024

cgarciae Jul 18, 2024

IvyZX Jul 18, 2024

cgarciae Jul 18, 2024

IvyZX Jul 18, 2024

	_rngs['params'] = _rngs['default']
	del _rngs['default']
	_rngs['params'] = _rngs.pop('default')

	if not hasattr(self, 'states'):
	if self._object__state.initializing:

	{name: stream.key.raw_value for name, stream in rngs.items()}
	{name: stream() for name, stream in rngs.items()}

Add shape-based lazy init to LinenToNNX (prev LinenWrapper) #4081

Add shape-based lazy init to LinenToNNX (prev LinenWrapper) #4081

Conversation

IvyZX commented Jul 13, 2024

codecov-commenter commented Jul 15, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add shape-based lazy init to `LinenToNNX` (prev `LinenWrapper`) #4081

Add shape-based lazy init to `LinenToNNX` (prev `LinenWrapper`) #4081

codecov-commenter commented Jul 15, 2024 •

edited

Loading