AWS Data Wrangler 2.9.0
Caveats
⚠️ For platforms without PyArrow 4 support (e.g. MWAA, EMR, Glue PySpark Job):
➡️pip install pyarrow==2 awswrangler
Documentation
- Added S3 Select tutorial #748
- Clarified wr.s3.to_csv docs #730
Enhancements
- Enable server-side predicate filtering using
S3 Select
🚀 #678 - Support
VersionId
parameter for S3 read operations #721 - Enable prefix in output S3 files for
wr.redshift.unload_to_files
#729 - Add option to skip commit on
wr.redshift.to_sql
#705 - Move integration test infrastructure to CDK 🎉 #706
Bug Fix
- Wait until athena query results bucket is created #735
- Remove explicit Excel engine configuration #742
- Fix bucketing types #719
- Change end_time to UTC #720
Thanks
We thank the following contributors/users for their work on this release:
@maxispeicher, @kukushking, @jaidisido
P.S. The AWS Lambda Layer file (.zip) and the AWS Glue file (.whl) are available below. Just upload it and run or use them from our S3 public bucket!