global imports #126

hyukim17 · 2015-08-20T14:25:46Z

I know that dill can have issues pickling global variables, but I was wondering if the following case is intended. My use case for this is setting up a job on a given machine, pickling the instructions, sending it over to remote machines, unpickling the objects, and running the jobs there.

Run the following in one file.

import dill
import numpy as np

def test():
  return np

if __name__ == "__main__":
  with open("out", "w") as f:
    f.write(dill.dumps(test))

Run the next in a different file. Running in a different file is necessary to emulate the environment I am using, where the numpy import is not included at the target machine.

import dill

with open("out") as f:
  contents = f.read()

res = dill.loads(contents)

print(res())

This yields:

NameError: global name 'np' is not defined

However, the code works if, instead of the first file, I define the function test in a separate file (so that it is in a formal namespace, not "main", load and pickle it from a separate runner, the numpy import also gets packaged and unpickled successfully.

In other words, if we have a file

import numpy as np

def test():
  return np

and we have a separate file

from source import test

with open("out", "w") as f:
  f.write(dill.dumps(test))

then the unpickling is fine.

As it turns out, setting dill.settings["recurse"] = True fixes this particular example, but in my actual script, which is more complicated, doing so raises:

RuntimeError: maximum recursion depth exceeded in cmp

Can I get global imports such that:

I do not have to define a separate file for the function I am pickling
I do not trigger maximum recursion depth?

The text was updated successfully, but these errors were encountered:

mmckerns · 2015-08-25T15:22:30Z

The answer to (1) is: use dill.settings['recurse']=True. This will likely become the default setting, however it currently is not.

I can't answer (2) as posed. Can you give an example that raises the recursion error? That would help see what needs to be done.

mmckerns · 2015-09-05T18:32:37Z

as far as I can tell, this is a duplicate of #123

jessanmen1 · 2018-06-20T23:18:40Z

Just wanted to add that dill.settings['recurse']=True solved an issue like the described before while dilling scikit-learn pipelines. Thanks.

mmckerns added the duplicate label Sep 5, 2015

mmckerns closed this as completed Sep 5, 2015

mmckerns modified the milestone: dill-0.2.5 Feb 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

global imports #126

global imports #126

hyukim17 commented Aug 20, 2015

mmckerns commented Aug 25, 2015

mmckerns commented Sep 5, 2015

jessanmen1 commented Jun 20, 2018

global imports #126

global imports #126

Comments

hyukim17 commented Aug 20, 2015

mmckerns commented Aug 25, 2015

mmckerns commented Sep 5, 2015

jessanmen1 commented Jun 20, 2018