-
Notifications
You must be signed in to change notification settings - Fork 195
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is pyiceberg.Table
thread-safe?
#1305
Comments
the write process rely on the catalog's atomic swap to guarantee serializable isolation. Multiple threads can call |
i tried to run the following codes: (multiple processes to
but got
i have too many rows to write to the same iceberg table.... how to speed it up? Thanks |
This is not a thread issue. This is expected when you have concurrent writers. The writer should retry the commit. See #1084 for a similar issue. And #269 as the issue to track this feature.
You can try writing batches as parquet files, and then collect them all in the main method. Use the |
Question
Is
pyiceberg.Table
thread-safe? can we calltable.append(table_data)
in multiple threads or processes to write to the same table parallelly?The text was updated successfully, but these errors were encountered: