Events grouped by pre-event GRS tier. A well-calibrated model shows predicted and observed rates that track each other across bins. In-sample retrospective metric.
| GRS Tier | Events | Crises (sev ≥ 4) | Predicted Rate | Observed Rate | Gap |
|---|---|---|---|---|---|
| Low (< 15) | 2 | 1 | 0.0% | 50.0% | 50.0pp |
| Moderate (15-30) | 6 | 3 | 26.5% | 50.0% | 23.5pp |
| Elevated (30-45) | 27 | 22 | 44.1% | 81.5% | 37.4pp |
| High (45-60) | 8 | 8 | 61.8% | 100.0% | 38.2pp |
| Critical (60+) | 4 | 4 | 85.3% | 100.0% | 14.7pp |
How well does GRS flag events across different crisis types? In-sample retrospective metric.
| Category | Events | Flagged | TPR | Avg GRS | Avg Severity |
|---|---|---|---|---|---|
| Conflict | 17 | 17 | 100.0% | 47.4 | 4.6 |
| Economic | 9 | 7 | 77.8% | 36 | 4 |
| Political | 14 | 10 | 71.4% | 36 | 3.6 |
| Social | 7 | 5 | 71.4% | 35.7 | 4.4 |
