You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This document provides a detailed summary of the performance metrics for models used in our AI-enabled school mapping work.
Key Metrics
Area Under the Precision-Recall Curve (AUPRC): Optimizes the model's precision and recall performance at all thresholds
F2 Score: Emphasizes recall more than precision, prioritizing the identification of all schools.
The models are trained to maximize the AUPRC, ensuring robust performance at all possible thresholds.
Using the best (non-ensemble) model, best Class Activation Mapping (CAM) method is selected based on the largest confidence drop following perturbation of the top 10% of pixels, as evaluated using the ROAD framework.
CAMs are generated only for images with probability scores exceeding the threshold that optimizes the validation set F2 score. The final F2 scores on the test set are reported below.