-
-
Notifications
You must be signed in to change notification settings - Fork 382
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
slow .zip uncompression between s3 #742
Comments
Can't see anything obviously wrong with your source. Try and work out where the bottleneck is first. Is it:
You can reduce the effects of the first two by running on EC2. For 3., there's probably not much you can do. You can also consider buying a larger hard drive to work on this locally, 180GB is well within range even for a laptop these days ;) |
I know why this is slow. The ZipFile module is doing a smart_open when it processes a Performance can be improved by changing smart_open's I'll open a PR to improve this. |
Any update here? |
The PR is stuck because some of the tests are not passing. We are awaiting feedback from the author. |
Hi All,
Just busy right now but the change has been working great so far in my
extended testing.
I’ll get to this when free.
Rusty
…On Tue, Jan 10, 2023 at 08:10 Michael Penkov ***@***.***> wrote:
The PR is stuck because some of the tests are not passing. We are awaiting
feedback from the author.
—
Reply to this email directly, view it on GitHub
<#742 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAFSWJN4HAJFD36ED73ASW3WRVNUBANCNFSM6AAAAAASIZ2ZP4>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
I think this can be closed now that #782 is merged. |
this code is writing at around 275k records per minute, any idea how to speed it up?
my file is 6GB compressed, 180GB uncompressed. so it won't fit on local disk or memory. I notice only 200MB of memory is being consumed
The text was updated successfully, but these errors were encountered: