Skip to content

Releases: delta-io/delta-rs

python-v0.5.1

20 Jul 02:09
5ad31d0
Compare
Choose a tag to compare
  • added columns argument to to_pyarrow_table method to support projections on PyArrow Table conversion @zijie0
  • added to_pandas shortcut method to convert a DeltaTable directly to pandas dataframe @bramrodenburg

rust-v0.4.0

19 Jul 00:20
Compare
Choose a tag to compare
  • added primitive writer API
  • added new DeltaTable method get_file_paths_by_partitions
  • added new DeltaTable method get_active_add_actions
  • added new DeltaTable method update_incremental
  • renamed log_bytes_from_actions to log_entry_from_actions
  • avoided clone in DeltaTable's get_files method
  • renamed camelCased fields to snake_case thanks to @nfx
  • added multi-writer support for S3 backend
  • optimized vacuum operation
  • added checkpoint writer
  • added s3-rustls feature
  • added checkpoint lambda function
  • made lock optional when creating s3 backend
  • added missing partition component in parquet path thanks to @viirya
  • started using delta statistics (row nums, null count, total bytes) in datafusion integration, thanks to @Dandandan @viirya
  • avoid unnecessary object head in DeltaTable's get_latest_version method thanks to @viirya
  • fixed file content write flush in fs storage backend

python-v0.5.0

02 Jun 19:59
d922b31
Compare
Choose a tag to compare
  • Optimize vacuum operation
  • Manage empty delta table in Python binding
  • Improve Python binding development
  • Add repair of failed/expired rename in S3 backend
  • Fix Rename camelCased fields to snake_case
  • Introduce the primitive writer API which will put a file and create the "add" action
  • Add commit_version method to DeltaTransaction
  • Add tests and Err restructure for commit_version
  • Inline JSON action join when creating delta log entry
  • Avoid commit loop call from commit_version
  • Rename log_bytes_from_actions to log_entry_from_actions
  • Add get_file_paths_by_partitions in Python bindings

python-v0.4.8

10 May 07:31
a3b27b3
Compare
Choose a tag to compare
  • Bump the version of pyarrow needed for Python bindings
  • Honor AWS_REGION env var for S3 endpoint override
  • Add valueContainsNull for the MapType in DeltaTable schema
  • Add Vacuum command in DeltaTable
  • Add "element" as the arrow list field name
  • Add Date, StructArray, and Map for Pyarrow types in Python schema bindings
  • Make list object streams send

python-v0.4.7

28 Apr 19:14
e9dc51b
Compare
Choose a tag to compare
  • Add createdTime in Metadata
  • Update arrow dependency to 4.0.0
  • Add dry_run vacuum command
  • Accommodate AWS_ENDPOINT_URL in the python binding to allow alternatives to S3
  • Add support for hive style partitioning when reading a table with to_pyarrow_dataset
  • Allow to_pyarrow_table() to take an optional list of partitions

python-v0.4.6

15 Apr 15:51
8aa9551
Compare
Choose a tag to compare
  • Add documentation for Python bindings
  • Change the default Delta timestamp to Nanoseconds for Arrow
  • Add pyarrow floatingpoint in Python bindings
  • Add Metadata in Python bindings

python-v0.4.5

05 Apr 20:50
c6d49ae
Compare
Choose a tag to compare
  • Add the functionality of filtering partitions when reading a partitioned DeltaTable
  • Improve the Python documentation using docstring
  • Enable date column support for delta to arrow schema conversion
  • Fix the struct data type generation in the Python schema method
  • Add the schema when reading DeltaTable parquet files with pyarrow in Python
  • Support custom endpoint URL for S3 backend

python-v0.4.4

23 Mar 20:46
38be44c
Compare
Choose a tag to compare

fix glibc linking for linux build

python-v0.4.3

21 Mar 20:49
Compare
Choose a tag to compare

fix windows build

python-v0.4.2

19 Mar 19:40
Compare
Choose a tag to compare
  • better AWS S3 credential handling, follow the standard credential chain and added support for EKS service account (@fvaleye @ryanlyn)
  • minor bug fix in arrow schema timestamp field conversion @nevi-me