Error index out of bounds when scanning multiple CSV files from S3 with .scan_csv
#18053
Closed
2 tasks done
.scan_csv
#18053
Checks
Reproducible example
Log output
Issue description
Fail scenario: Upload a few csv files to
your_bucket_containing_multiple_csv_files
and then run the code.Success scenario: Retry with just one csv in the bucket.
Success scenario: Retry with the same (multiple) csv files in a local folder instead of S3 (and remove
storage_options
from code).Expected behavior
A lazy frame is successfully created and usable.
Installed versions
--------Version info---------
Polars: 1.4.1
Index type: UInt32
Platform: Windows-10-10.0.19045-SP0
Python: 3.11.4 (tags/v3.11.4:d2340ef, Jun 7 2023, 05:45:37) [MSC v.1934 64 bit (AMD64)]
----Optional dependencies----
adbc_driver_manager:
cloudpickle:
connectorx:
deltalake:
fastexcel:
fsspec: 2024.3.1
gevent:
great_tables:
hvplot:
matplotlib:
nest_asyncio: 1.6.0
numpy: 1.26.4
openpyxl:
pandas: 2.2.1
pyarrow: 15.0.2
pydantic:
pyiceberg:
sqlalchemy:
torch:
xlsx2csv:
xlsxwriter:
The text was updated successfully, but these errors were encountered: