You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
How to use featuretools at the test time? It seems featuretools' feature definitions do not store train time statistics to accurately apply primitives like 'PERCENTILE' at the test time
#2697
Open
nitinmnsn opened this issue
Mar 23, 2024
· 0 comments
print(fm)
val PERCENTILE(val)
index
1 1 0.2
2 2 0.4
3 3 0.6
4 4 0.8
5 5 1.0
So far everything is expected
Now, when I get an example with the value, say, 3, at the test time. I would want it translated to 0.6 as per the training data. But, that is not what happens
So, metadata in feature definitions in fl that is the output of ft.dfs does not store train time stats needed to compute the features at the test time. This would throw any machine-learning model into a tailspin
What is the canonical way to apply featuretools at the test time?
The text was updated successfully, but these errors were encountered:
Creating a github issue for better attention. I have a StackOverflow question for the same as well
I would demonstrate the issue with an example:
Let us say we want to use the primitive 'PERCENTILE'
Imports:
For training (create a simple data with one column and let featuretools compute a percentile feature on top of it):
output:
So far everything is expected
Now, when I get an example with the value, say, 3, at the test time. I would want it translated to 0.6 as per the training data. But, that is not what happens
output:
So, metadata in feature definitions in
fl
that is the output offt.dfs
does not store train time stats needed to compute the features at the test time. This would throw any machine-learning model into a tailspinWhat is the canonical way to apply featuretools at the test time?
The text was updated successfully, but these errors were encountered: