f u t u r e ra
Docs: forecasting / Toc / Statistics: errors

SUMMARY

-Error metrics

Error metrics for the forecast of a time serie. Measuring the accuracy and magnitude of errors of the forecast.



summary properties error metrics
MAE MSE RMSE MAPE SMAPE MASE
Simplicity HighMedium MediumHigh MediumMedium
Interpretability HighLow HighHigh HighLow
Scale-Invariant No No No No No Yes
Penalizes large errors No Yes Yes No No No
Outlier sensitivity Medium High High Medium Medium Medium
Symmetric Yes Yes Yes No Yes Yes
Original scale Yes No Yes No No No
Sensitivity to division by 0 No No No Yes Yes Yes
We have an example of data points / observations with the forecast.
# 1 2 3 4 5 6 7 8 9 10 11 12 13
actual 20 20 25 28 29 28 31 34 35 36 41 45
predicted 20 20 24 27 29 28 30 33 35 36 40 44
predictedyi: predicted value i-th data point / observation
actualyi: true value for the i-th data point / observation
n: number of data points / observations


Mean Absolute Error (MAE)
MAE measures the average size of the error between predicted and actual values - taking the absolute difference between the predicted and actual values ​​and averaging.

MAE = (  ∑ absolute( predictedyi - actualyi )  )  / n  
- Advantages:
simplicity, easy to interpret and less sensitive to outliers.

- Disadvantages:
does not distinguish between overestimation and underestimation
does not provide information about the direction or magnitude of individual errors
Mean Squared Error (MSE)
The average squared difference between the predicted and actual values. Used metric for evaluating the accuracy of forecasting models.

MSE = (  ∑ (predictedyi - actualyi)2  )  / n  
- Advantages:
Simplicity, capture large and small errors and penalizes large errors.

- Disadvantages:
emphasizes large errors
does not differentiate between overestimation and underestimation
Root Mean Squared Error (RMSE)

The square root of the MSE.
RMSE = MSE RMSE = ((  ∑ (predictedyi - actualyi)2  )  / n  )
- Advantages:
Penalizes large errors.

- Disadvantages:
sensitive to outliers
doesn't provide insights into the direction or pattern of forecast errors
doesn't distinguish between over- and under-predictions
Mean Absolute Percentage Error (MAPE)

The average percentage difference between the predicted values and the actual values - the absolute difference between the predicted and actual values, dividing it by the actual value, and then averaging the results.
MAPE = ((  ∑ ((predictedyi - actualyi) / actualyi)  )  / n  ) x 100
- Advantages:
errors in terms of percentage - easier to compare with different series.

- Disadvantages:
sensitive to extreme values
difficulty with zero or near-zero values
is not a symmetric metric- the direction of the error affects MAPE
Symmetric Mean Absolute Percentage Error (SMAPE)

Measures the percentage difference between the observed and predicted values. The absolute difference between the actual and forecasted values, dividing it by the average of the absolute values of the actual and forecasted values, and then multiplying by 100 to express it as a percentage.

SMAPE = ((  ∑ ( absolute(predictedyi - actualyi) / ( absolute(predictedyi) + absolute(actualyi)))  )  / n  ) x 100
- Advantages:
handle zero values and its symmetry (overestimation and underestimation errors are treated equally)

- Disadvantages:
sensitive to extreme values
difficulty with zero or near-zero values for actual and predicted values

Mean Absolute Scaled Error (MASE)

measures the relative forecast accuracy compared to the naïve or benchmark model - dividing the mean absolute error (MAE) of the model’s forecasts by the MAE .

MASE = MAE / ((∑absolute(predictedyi - actualyi-1) ) / (n-1)) i start with 2

- Advantages:
is scale-independent
is robust against outliers

- Disadvantages:
sensitive to zero or near-zero values in the denominator
assumes the naïve model is accurate (not always true)

BETA