-
Notifications
You must be signed in to change notification settings - Fork 998
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
get_historical_features fails with dask error for file offline store #2865
Comments
In fact, the last version that works is 0.18.1 |
The problem was: source = FileSource(
path=str(source_path),
event_timestamp_column="timestamp",
created_timestamp_column="timestamp",
) When both timestamp columns are the same, it breaks. Once I changed to: source = FileSource(
path=str(source_path),
timestamp_field="timestamp",
) it's no longer an issue. I will leave this ticket open, and let the maintainers decide if this is expected behavior or if there's something to be done to fix it or add some explicit asserts. |
Thanks for the details @elshize - this definitely smells like a bug we need to fix! |
I was unable to reproduce this issue locally - for posterity this is my setup:
@elshize can you see if this is still an issue for you and reopen this if that's the case? |
actually I was able to repro this; the source of the issue was reusing the same column for |
Yes, the problem was reusing the column. I shared that when I earlier in the comment, sorry if it wasn't entirely clear. |
Expected Behavior
where
feature_store
is a feature store with file offline store andfs_columns
is a list of column names, anddf
is a Pandas data frame, should work.Current Behavior
It currently raises an error inside of dask:
Stacktrace:
Steps to reproduce
Here is my feature store definition:
Specifications
Possible Solution
This works fine in at least version 0.18.1, but I think it fails for any >0.20
It might have something to do with adding Dask requirement, maybe the version is insufficient? I used to use 2022.2 before, but the requirement is now for 2022.1.1. But this is just a guess, really.
The text was updated successfully, but these errors were encountered: