This report describes the accuracy of the forecasts issued by several forecasting agencies at the San Francisco International Airport (San Francisco, California, United States) weather station. The error metrics are computed by comparing the forecasts issued during December 2011 to what actually happened, as recorded by the weather station.
| Agency | Full Name | Number of Forecasts | Forecast Length |
|---|---|---|---|
| NOAA | National Weather Service | 30 | 6.8 days |
| Met.no | Norwegian Meteorological Institute | 43 | 9.4 days |
| WWO | World Weather Online | 27 | 4.8 days |
| WXC | Weather Central | 31 | 15.4 days |
In the table above, the number of forecasts listed in the third column is the number of forecasts that we have in our archive. Each agency may have issued more forecasts during the month, depending on their forecasting schedule.
The table below reports the three-day hourly root mean square error (RMSE) for each of the forecasting agencies. For reference, we include the baseline error rate resulting from forecasting each variable to be the average value of that variable from the same time in past years. For each variable, the agency with the smallest error (winner) is indicated in bold.
| NOAA | Met.no | WWO | WXC | Baseline | Winner | |
|---|---|---|---|---|---|---|
| Temperature | 3.7°F | 3.1°F | 4.6°F | 3.5°F | 4.3°F | Met.no |
| Cloud Cover | 33% | 41% | 43% | 37% | 46% | NOAA |
| Wind Speed | 4 mph | 5 mph | 4 mph | 5 mph | 6 mph | WWO |
| Wind Direction | 82° | 88° | 83° | 88° | 82° | NOAA |
| Dew Point | 5.8°F | 9.6°F | 6.0°F | 7.5°F | 8.1°F | NOAA |
| Humidity | 14% | 20% | 15% | 18% | 19% | NOAA |
Precipitation is a bit different from the other weather variables in that it consists of discrete events separated by periods of inactivity, and is not well represented by a smoothly varying number, as for example temperature. The hourly RMSE is consequently not a very good measure of forecast accuracy. Instead, we break the forecasts up into 24-hour chunks (days), and categorize each forecast as either predicting precipitation or not on each day. We then compare those yes-or-no predictions to what actually happened and count the cases where the forecasts were correct or incorrect in predicting precipitation or not.
The table below reports several precipitation forecast error metrics over the first three days of the forecasts.
| NOAA | Met.no | WWO | WXC | Winner | |
|---|---|---|---|---|---|
| Overall Error Rate | 13% | 14% | 15% | 18% | NOAA |
| Predicted but Not Observed* | 31% | 34% | 52% | 53% | NOAA |
| Observed but Not Predicted** | 11% | 11% | 9% | 9% | WWO |
| * as a fraction of all days where precipitation was predicted | |||||
| ** as a fraction of all days where precipitation was not predicted | |||||
Depending on your preferences, each of these error rates may be important.
In addition to the RMSE reported above, we also present two additional error metrics on the three-day forecast. The first of these (shown below) is the maximum absolute error (MAE). This provides a measure of how far off each agency was at their worst.
| NOAA | Met.no | WWO | WXC | Winner | |
|---|---|---|---|---|---|
| Temperature | 17.0°F | 9.5°F | 15.7°F | 11.7°F | Met.no |
| Cloud Cover | 92% | 100% | 100% | 100% | NOAA |
| Wind Speed | 21 mph | 17 mph | 14 mph | 17 mph | WWO |
| Wind Direction | 180° | 180° | 177° | 180° | WWO |
| Dew Point | 23.9°F | 32.8°F | 22.5°F | 26.5°F | WWO |
| Humidity | 62% | 55% | 56% | 62% | Met.no |
The second (shown below) is the mean error, a measure of how biased the forecasts were. A negative bias means the forecasting agency predicted a lower value on average than what was observed; a positive bias means they predicted higher values.
| NOAA | Met.no | WWO | WXC | Winner | |
|---|---|---|---|---|---|
| Temperature | 1.2°F | -0.2°F | 0.6°F | 0.0°F | WXC |
| Cloud Cover | -12% | -11% | -16% | -12% | Met.no |
| Wind Speed | 0 mph | 1 mph | 1 mph | -1 mph | NOAA |
| Wind Direction | 1° | -2° | 14° | -0° | WXC |
| Dew Point | 2.7°F | -5.1°F | 0.1°F | -4.4°F | WWO |
| Humidity | 3% | -9% | 0% | -6% | WWO |
While the RMSE is probably the most generally useful error metric for understanding the accuracy of the various forecasting agencies, these additional error metrics provide additional depth.
The graphs below show how the forecasting error evolves over time. They are useful for understanding how far out a forecast should be trusted, and provide a generally more nuanced picture than the three-day error rates reported above.
In the graphs below, we provide the baseline as a reference. The baseline is the error rate resulting from forecasting each variable to be the average from the same time in past years. If the forecasts consistently perform worse than the baseline a certain number of days out, we recommend looking at the averages more than the forecast to get a sense of what that aspect of the weather will be like.
For the same reasons listed in the Three-Day Forecast Accuracy section above, the daily precipitation forecast error graphs differ from those of the other variables above.