Release AWS Data Wrangler 2.9.0 · aws/aws-sdk-pandas

Caveats

⚠️ For platforms without PyArrow 4 support (e.g. MWAA, EMR, Glue PySpark Job):

➡️ pip install pyarrow==2 awswrangler

Documentation

Added S3 Select tutorial #748
Clarified wr.s3.to_csv docs #730

Enhancements

Enable server-side predicate filtering using S3 Select 🚀 #678
Support VersionId parameter for S3 read operations #721
Enable prefix in output S3 files for wr.redshift.unload_to_files #729
Add option to skip commit on wr.redshift.to_sql #705
Move integration test infrastructure to CDK 🎉 #706

Bug Fix

Wait until athena query results bucket is created #735
Remove explicit Excel engine configuration #742
Fix bucketing types #719
Change end_time to UTC #720

Thanks

We thank the following contributors/users for their work on this release:

@maxispeicher, @kukushking, @jaidisido

P.S. The AWS Lambda Layer file (.zip) and the AWS Glue file (.whl) are available below. Just upload it and run or use them from our S3 public bucket!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AWS Data Wrangler 2.9.0

Caveats

Documentation

Enhancements

Bug Fix

Thanks