You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
File "C:\Program Files\Python38\lib\site-packages\polars\lazyframe\frame.py", line 1940, in collect
returnwrap_df(ldf.collect())
ComputeError: column 'b_old' not available in schema Schema:
name: a, data type: Int64
name: b, data type: Boolean
Issue description
Some lazy processing throws exceptions when the equivalent nonlazy code runs fine.
In the reproducible example above, if I remove the lazy processing, the code returns a dataframe with zero rows, as expected:
DF3_nofilter is a dataframe where b_old definitely exists, so I don't know why the original code is complaining that b_old isn't in the schema, and, more surprisingly, why it asks about b_old at all when the filtering and dropping operations mention only b, not b_old.
Checks
Reproducible example
Log output
Issue description
Some lazy processing throws exceptions when the equivalent nonlazy code runs fine.
In the reproducible example above, if I remove the lazy processing, the code returns a dataframe with zero rows, as expected:
If I take the lazy code but do not process the filtering code or anything after, it runs fine:
DF3_nofilter is a dataframe where
b_old
definitely exists, so I don't know why the original code is complaining thatb_old
isn't in the schema, and, more surprisingly, why it asks aboutb_old
at all when the filtering and dropping operations mention onlyb
, notb_old
.Expected behavior
Output of the lazy and nonlazy processing should be the same, and the lazy processing should not throw an exception.
Installed versions
The text was updated successfully, but these errors were encountered: