slim_handler reduce /tmp use and enable significantly larger deployments #1022

olirice · 2017-07-28T13:42:08Z

Description

Updated slim_handler behavior to:

Delete project.zip after it has been extracted
Enable significantly larger deployments (see Delete <myproject>.zip from /tmp after load to free space (slim_handler=true) #1020)

Previously:

Download project.zip from s3 to /tmp
Unzip project.zip to /tmp/project
Do not delete project.zip

Now:

Download project.zip into an SpooledTemporaryFile
- SpooledTempfile is in-memory
- SpooledTempfile has a max_size based on the Lambda function's memory from an env var
- SpooledTempfile seamlessly dumps to /tmp and continues if max_size is exceeded (safe)
- SpooledTempfile deletes automatically once it falls out of scope
Unzip project.zip to /tmp

GitHub Issues

#1020
#961
#881

coveralls · 2017-07-28T14:11:33Z

Coverage decreased (-0.04%) to 74.019% when pulling 4475b7a on olirice:master into a21c973 on Miserlou:master.

Miserlou · 2017-07-29T14:23:28Z

This is really great, is there any chance you can also update the README file or write a blog post about this for the release?

Poking @mcrowson for review as well.

coveralls · 2017-07-30T15:51:42Z

Coverage decreased (-0.04%) to 74.019% when pulling c07ab31 on olirice:master into a21c973 on Miserlou:master.

olirice · 2017-07-30T15:59:29Z

Great, README updated

The existing blog post on Large Applications already says that Zappa supports projects up to 500 MB.

Zappa deployments now support up to 500M of zipped up Python projects. Simply set “slim_handler”: true in zappa_settings.json and your large projects can now serve up requests from Lambda without a server.

What are you looking for in a blog post?

I could write a walkthrough with code snippets to create and deploy a 500 MB project and explain the memory considerations throughout? "Lets Deploy a 500 MB Project on Lambda" or something similar.

mcrowson · 2017-07-31T14:20:05Z

Love the idea, i just don't think it makes a tremendous difference. It still ends up spilling onto disk and you have the zip size and zip folder to worry about. The only change here then is that we have /tmp space plus ram space to put the zip file and the unzipped contents.

Setting the RAM size to 650M to account for all of it is over a huge cost increase for the whole application to get it all to fit. I still think a streaming approach is the right long term solution, but this might help the projects that just needed an extra 100M or something to fit it all. For those project zips over 300M though they still might not be big enough to account for the fully unzipped contents in addition to the zip file.

olirice · 2017-07-31T19:18:52Z

@mcrowson
Agreed. Unfortunatey, Zipfile expects random access to the archive so streaming unzip isn't possible.

@Miserlou If I can change the project upload format to .tar.gz for slim handler projects then streaming unzip is totally feasible.

Objections?

mcrowson · 2017-07-31T20:07:06Z

totally on the same page. I say give it a go packaging with the gzipped tarball and streaming unzip that way

olirice · 2017-08-01T00:57:46Z

@mcrowson streaming .tar.gz from s3 works great.

# Resources
remote_bucket, remote_file = parse_s3_url(project_zip_path)
s3 = boto_session.resource('s3')

# S3 File Object
remote_project = s3.Object(remote_bucket, remote_file)

# remote_project byte stream
raw_stream = remote_project.get()['Body']._raw_stream

# Create tarfile from byte stream. Note mode = 'r|gz' not 'r:gz' 
# For mode explanation see https://docs.python.org/2/library/tarfile.html
remote_archive = tarfile.open(None, 'r|gz', fileobj=raw_stream)

# Extract as usual
remote_archive.extractall(path=project_folder)

It looks like both the cli and core need updating to implement this feature. That's a little more than I intended to bite off.

Do you know of any contributors who might consider updating the client-side tools?

If not, how about merging the SpooledTemporaryFile PR so those of us who insist on abusing Lambda have a simple (albeit costly) solution for 500mb deployments until I have time to loop back around and implement the streaming solution.

mcrowson · 2017-08-01T13:08:45Z

I think @mbeacom offered to do the whole of it as well over on #881

dswah · 2017-08-02T18:21:31Z

I've already found this code to be super useful!

I'd like to deploy my app with this, but I dont want to manually copy over handler.py every time i reinstall Zappa...

What's the status of this PR?

olirice · 2017-08-02T19:43:22Z

@dswah I believe @mcrowson is waiting on an update wrt @mbeacom's streaming gzip implementation over on #881 before making a recommendation to merge this (not as good) solution.

@mbeacom, if you had any trouble wrestling that s3 file object into a stream (I did) there's a code snippet ^ that might be helpful.

olirice · 2017-08-07T00:48:24Z

Closing due to better solution at #1037

Oliver Rice added 2 commits July 28, 2017 13:14

slimhandler download to (safe) in-memory tempfile

23aaa90

updated to pass runs locally requirement

4475b7a

olirice mentioned this pull request Jul 28, 2017

Delete <myproject>.zip from /tmp after load to free space (slim_handler=true) #1020

Closed

large projects - explain memory_size impact

c07ab31

olirice mentioned this pull request Aug 2, 2017

Zipped and unzipped deployment adding upto more than 500Mb and failing deployment #881

Closed

olirice closed this Aug 7, 2017

CaseGuide mentioned this pull request Jul 29, 2020

slim_handler: true fails in Windows, possibly due to file slashes in tarball #2145

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

slim_handler reduce /tmp use and enable significantly larger deployments #1022

slim_handler reduce /tmp use and enable significantly larger deployments #1022

olirice commented Jul 28, 2017 •

edited

Loading

coveralls commented Jul 28, 2017 •

edited

Loading

Miserlou commented Jul 29, 2017

coveralls commented Jul 30, 2017 •

edited

Loading

olirice commented Jul 30, 2017

mcrowson commented Jul 31, 2017

olirice commented Jul 31, 2017

mcrowson commented Jul 31, 2017

olirice commented Aug 1, 2017 •

edited

Loading

mcrowson commented Aug 1, 2017

dswah commented Aug 2, 2017

olirice commented Aug 2, 2017

olirice commented Aug 7, 2017

slim_handler reduce /tmp use and enable significantly larger deployments #1022

slim_handler reduce /tmp use and enable significantly larger deployments #1022

Conversation

olirice commented Jul 28, 2017 • edited Loading

Description

GitHub Issues

coveralls commented Jul 28, 2017 • edited Loading

Miserlou commented Jul 29, 2017

coveralls commented Jul 30, 2017 • edited Loading

olirice commented Jul 30, 2017

mcrowson commented Jul 31, 2017

olirice commented Jul 31, 2017

mcrowson commented Jul 31, 2017

olirice commented Aug 1, 2017 • edited Loading

mcrowson commented Aug 1, 2017

dswah commented Aug 2, 2017

olirice commented Aug 2, 2017

olirice commented Aug 7, 2017

olirice commented Jul 28, 2017 •

edited

Loading

coveralls commented Jul 28, 2017 •

edited

Loading

coveralls commented Jul 30, 2017 •

edited

Loading

olirice commented Aug 1, 2017 •

edited

Loading