-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About time in results dataframe #647
Comments
Hi @toncho11, The “time” corresponds to the average time it takes to train and test the pipeline on one CV fold. This “time” column allows you to compare the different pipelines together, not plan how long an experiment will take. |
Hi @PierreGtch . Thanks for your answer. It would be great if this is documented in the evaluation classes! |
Thank you @PierreGtch! Confirming all this was very important! I just wanted to correct my previous query. It should be:
|
Also time is reported in seconds. For example 0.18 in the time column means 180 milliseconds (average time per fold). |
I think the documentation of MOABB is not very clear on the "time" column. Please point me to a source if I am wrong.
I have 2 pipelines that take 53 minutes to run on many datasets and subjects. I do "WithinSession". So each line in the results is a session. The "time" column comprises of both the training and classification (not obvious).
But the time is not by session time, but by fold (thanks @gcattan). It is line
moabb/moabb/evaluations/evaluations.py
Line 262 in 0ee8eb6
print(results.groupby("pipeline").mean("score")[["score", "time"]])
because the mean of the "time" column is better estimation than the total time?
The text was updated successfully, but these errors were encountered: