Outliers, or unusual data points, can seriously affect the performance of modeling and forecasting. The HUrob
method is designed to eliminate their effect. This method utilizes the reflection-based principal component analysis (RAPCA) algorithm of Hubert, Rousseeuw, and Verboven (2002) to obtain projection-pursuit estimates of principal components and their associated scores. The integrated squared error provides a measure of the accuracy of the principal component approximation for each year (Hyndman and Ullah (2007)). Outlying years result in a larger integrated squared error than the critical value obtained by assuming normality of \(\epsilon_t(x)\) (see Hyndman and Ullah (2007) for details). By assigning zero weight to outliers, the HUrob
method can then be used to model and forecast mortality rates without the possible influence of outliers.
How the HUrob
Method Works
The HUrob
method uses an advanced technique called reflection-based principal component analysis (RAPCA). This technique helps to identify the main patterns in the data while filtering out those outliers that could lead to inaccurate predictions.
Here’s a simplified explanation:
- Principal Components: Think of these as the main patterns in your data. The method tries to capture these patterns and ignore the noise or outliers.
- Outliers: These are unusual data points that don’t fit the general pattern. If left unchecked, they can distort the model’s predictions.
By assigning zero weight to outliers, the HUrob
method effectively ignores these problematic data points, making the model’s predictions more robust and trustworthy.
Using the functional data analysis paradigm of Ramsay and Silverman (2005), Hyndman and Ullah (2007) proposed a nonparametric method for modeling and forecasting log mortality rates. This approach extends the Lee-Carter (LC) method in four ways:
- The log mortality rates are smoothed prior to modeling.
- Functional principal components analysis (FPCA) is used.
- More than one principal component is used in forecasting.
- The forecasting models for the principal component scores are typically more complex than the RWD model.
The log mortality rates are smoothed using penalized regression splines. To emphasize that age, \(x\), is now considered a continuous variable, we write \(m_t(x)\) to represent mortality rates for age \(x \in [x_1, x_p]\) in year \(t\). We then define \(z_t(x) = \log m_t(x)\) and write:
\[
z_t(x_i) = f_t(x_i) + \sigma_t(x_i) \epsilon_{t,i}, \quad i = 1, \dots, p, \quad t = 1, \dots, n,\ (2)
\]
where \(f_t(x_i)\) denotes a smooth function of \(x\) as before; \(\sigma_t(x_i)\) allows the amount of noise to vary with \(x_i\) in year \(t\), thus rectifying the assumption of homoskedastic error in the LC model; and \(\epsilon_{t,i}\) is an independent and identically distributed standard normal random variable.
Given continuous age \(x\), functional principal components analysis (FPCA) is used in the decomposition. The set of age-specific mortality curves is decomposed into orthogonal functional principal components and their uncorrelated principal component scores. That is:
\[
f_t(x) = a(x) + \sum_{j=1}^{J} b_j(x) k_{t,j} + e_t(x),
\]
where \(a(x)\) is the mean function estimated by \(\hat{a}(x) = \frac{1}{n} \sum_{t=1}^{n} f_t(x)\); \(\{b_1(x), \dots, b_J(x)\}\) is a set of the first \(J\) functional principal components; \(\{k_{t,1}, \dots, k_{t,J}\}\) is a set of uncorrelated principal component scores; \(e_t(x)\) is the residual function with mean zero; and \(J < n\) is the number of principal components used. Note that we use \(a(x)\) rather than \(a_x\) to emphasize that \(x\) is treated as a continuous variable.
Multiple principal components are used because the additional components capture non-random patterns that are not explained by the first principal component (Booth, Maindonald, and Smith (2002); Renshaw and Haberman (2003); Koissi, Shapiro, and Högnäs (2006)). Hyndman and Ullah (2007) found \(J = 6\) to be larger than the number of components actually required to produce white noise residuals, and this is the default value. The conditions for the existence and uniqueness of \(k_{t,j}\) are discussed by Cardot, Ferraty & Sarda (2003).
Although Lee and Carter (1992) did not rule out the possibility of a more complex time series model for the \(k_t\) series, in practice, an RWD model has typically been employed in the LC method. For higher-order principal components, which are orthogonal by definition to the first component, other time series models arise for the principal component scores. For all components, the HUrob
method selects the optimal time series model using standard model-selection procedures (e.g., AIC). By conditioning on the observed data \(I = \{z_1(x), \dots, z_n(x)\}\) and the set of functional principal components \(B = \{b_1(x), \dots, b_J(x)\}\), the \(h\)-step-ahead forecast of \(z_{n+h}(x)\) can be obtained by:
\[
\hat{z}_{n+h|n}(x) = E[z_{n+h}(x) | I, B] = \hat{a}(x) + \sum_{j=1}^{J} b_j(x) \hat{k}_{n+h|n,j},
\]
where \(\hat{k}_{n+h|n,j}\) denotes the \(h\)-step-ahead forecast of \(k_{n+h,j}\) using a univariate time series model, such as the optimal ARIMA model selected by the automatic algorithm of Hyndman et al. (2008), or an exponential smoothing state space model (Hyndman et al. (2008)).
Because of the orthogonality of all components, it is easy to derive the forecast variance as:
\[
\hat{v}_{n+h|n}(x) = \text{var}[z_{n+h}(x) | I, B] = \sigma^2_a(x) + \sum_{j=1}^{J} b_j^2(x) u_{n+h|n,j} + v(x) + \sigma^2_t(x),
\]
where \(\sigma^2_a(x)\) is the variance of \(\hat{a}(x)\); \(u_{n+h|n,j}\) is the variance of \(k_{n+h,j} | k_{1,j}, \dots, k_{n,j}\) (obtained from the time series model); \(v(x)\) is the variance of \(e_t(x)\); and \(\sigma_t(x)\) is defined in Equation (2). This expression is used to construct prediction intervals for future mortality rates in R.
Key Enhancements Over Traditional Methods
The HUrob
method improves upon traditional methods like the Lee-Carter (LC) model in several ways:
- Smoothing: Log mortality rates are smoothed using techniques that stabilize the data, reducing the impact of random fluctuations.
- Principal Components Analysis (PCA): Instead of relying on just one principal component, this method uses multiple components to capture more of the complexity in the data.
- Advanced Time Series Models: For predicting future trends, the
HUrob
method uses sophisticated models like ARIMA, which are more flexible and accurate than simpler models.
Putting It All Together
In simple terms, the HUrob
method is a powerful tool for making more accurate predictions about future mortality rates. By carefully managing outliers and using advanced statistical techniques, this method helps actuaries and analysts make better-informed decisions, whether they are pricing life insurance products, setting aside reserves, or planning for the future.