Spatial Multi-Event Contingency Table

Frédéric Atger
Météo-France
frederic.atger@meteo.fr

The spatial multi-event contingency table methodology is well suited for verifying high resolution forecasts since it gives credit to forecasts that are "close" to the truth in some way but need not be exactly correct.

The performance of a set of deterministic forecasts is often represented by a simple 2 x 2 contingency table that represents the joint distribution of forecasts and observations for a specified event criterion or threshold (for example, rain exceeding 1 mm/h).

Observed yes Observed no

Forecast yes hits false alarms

Forecast no misses correct negatives

Now, for the same observed event criterion, consider a range of K thresholds on the forecasts (for example, forecast rain exceeding 1 mm/h, 2 mm/h, 5 mm/hr, etc). These can be viewed as possible decision thresholds for taking action, such as issuing a warning. Instead of the contingency table having only a single event category it now contains multiple categories corresponding to the K forecast thresholds.

Observed yes Observed no

Forecast >= threshold₁ hits₁ false alarms₁

Forecast < threshold₁ misses₁ correct negatives₁

Forecast >= threshold₂ hits₂ false alarms₂

Forecast < threshold₂ misses₂ correct negatives₂

... ... ...

... ... ...

Forecast >= threshold_K hits_K false alarms_K

Forecast < threshold_K misses_K correct negatives_K

By using multiple thresholds, a deterministic forecast system can be evaluated across a range of possible decision thresholds (instead of just one) using ROC and relative value. This enables a fairer comparison against ensemble prediction systems or other probabilistic forecasts.

For an ensemble prediction system with M members, for each forecast threshold k there are now M probability categories (at least 1 member >= threshold_k, at least 2 members >= threshold_k, etc.), yielding a total of KxM categories.

An alternative to multiple intensity thresholds is multiple "closeness" thresholds, for example, forecast event within 10 km of the location of interest, within 20 km, 30 km, etc. Forecasters conceptually interpret high resolution model output in this way. The verification results can therefore be used to assess the performance of high resolution forecasts where the exact spatial matching of forecast and observed events is difficult or unimportant.

Other forecast decision criteria are possible, depending on the application.

Decision criteria can be combined to produce multi-dimensional contingency tables. The spatial multi-category contingency table described by Atger (2001) is a good example. In the case below, the number of categories would be JxK for single-model forecasts, and JxKxM for ensemble prediction systems.

Forecast within distance₁ ... Forecast within distance_J

Observed yes Observed no ... ... Observed yes Observed no

Forecast >= threshold₁ hits₁₁ false alarms₁₁ ... ... hits_J1 false alarms_J1

Forecast < threshold₁ misses₁₁ correct negatives₁₁ ... ... misses_J1 correct negatives_J1

Forecast >= threshold₂ hits₁₂ false alarms₁₂ ... ... hits_J2 false alarms_J2

Forecast < threshold₂ misses₁₂ correct negatives₁₂ ... ... misses_J2 correct negatives_J2

... ... ... ... ... ... ...

... ... ... ... ... ... ...

Forecast >= threshold_K hits_1K false alarms_1K ... ... hits_JK false alarms_JK

Forecast < threshold_K misses_1K correct negatives_1K ... ... misses_JK correct negatives_JK

Reference:

Atger, F., 2001: Verification of intense precipitation forecasts from single models and ensemble prediction systems. Nonlin. Proc. Geophys., 8, 401-417. Click here to get the PDF (295 Kb).

	Observed yes	Observed no
Forecast yes	*hits*	*false alarms*
Forecast no	*misses*	*correct negatives*

	Observed yes	Observed no
Forecast >= threshold₁	*hits₁*	*false alarms₁*
Forecast < threshold₁	*misses₁*	*correct negatives₁*
Forecast >= threshold₂	*hits₂*	*false alarms₂*
Forecast < threshold₂	*misses₂*	*correct negatives₂*
...	...	...
...	...	...
Forecast >= threshold_K	*hits_K*	*false alarms_K*
Forecast < threshold_K	*misses_K*	*correct negatives_K*

	Forecast within distance₁		...		Forecast within distance_J
	Observed yes	Observed no	...	...	Observed yes	Observed no
Forecast >= threshold₁	*hits₁₁*	*false alarms₁₁*	...	...	*hits_J1*	*false alarms_J1*
Forecast < threshold₁	*misses₁₁*	*correct negatives₁₁*	...	...	*misses_J1*	*correct negatives_J1*
Forecast >= threshold₂	*hits₁₂*	*false alarms₁₂*	...	...	*hits_J2*	*false alarms_J2*
Forecast < threshold₂	*misses₁₂*	*correct negatives₁₂*	...	...	*misses_J2*	*correct negatives_J2*
...	...	...	...	...	...	...
...	...	...	...	...	...	...
Forecast >= threshold_K	*hits_1K*	*false alarms_1K*	...	...	*hits_JK*	*false alarms_JK*
Forecast < threshold_K	*misses_1K*	*correct negatives_1K*	...	...	*misses_JK*	*correct negatives_JK*