You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One comment in the workshop was that we should be looking into different binning schemes for the country- and grid-level.
I think this is a fair comment. The current binning scheme is equal for both levels (and hard coded in evaluate_submissions.py, you can vary it using CompetitionEvaluation.calculate_metrics):
At the country-level, stopping the binning at 1000 does sound a bit small. We would want models that could differentiate between a conflict with 1000 deaths and 100 000 deaths. Differentiating between 1-2, 3-5, 6-10, and 11-25 at the country-level might be a really unfair challenge.
At the priogrid-level, it could also be argued whether it is important to punish a model predicting 3 fatalities for not predicting 1 or 6. The issue is of course always there on the bin edges, but they are particularly visible when the binning range is small.
The text was updated successfully, but these errors were encountered:
We have been discussing this at length, and the logic of the scheme is that it is a roughly even distribution on a log scale, something I find to be optimal (and for that reason the challenge is not unfair; the vast majority of observations also fall in the lower bins). But I agree we might want to have more bins at the right end. Calculating the number of actuals in each bin would be good as part of a justification.
I would not mind specifying a supplementary version of this with another binning scheme, of course.
One comment in the workshop was that we should be looking into different binning schemes for the country- and grid-level.
I think this is a fair comment. The current binning scheme is equal for both levels (and hard coded in evaluate_submissions.py, you can vary it using
CompetitionEvaluation.calculate_metrics
):bins = [0, 0.5, 2.5, 5.5, 10.5, 25.5, 50.5, 100.5, 250.5, 500.5, 1000.5]
At the country-level, stopping the binning at 1000 does sound a bit small. We would want models that could differentiate between a conflict with 1000 deaths and 100 000 deaths. Differentiating between 1-2, 3-5, 6-10, and 11-25 at the country-level might be a really unfair challenge.
At the priogrid-level, it could also be argued whether it is important to punish a model predicting 3 fatalities for not predicting 1 or 6. The issue is of course always there on the bin edges, but they are particularly visible when the binning range is small.
The text was updated successfully, but these errors were encountered: