Methods
This section explains how each method (PDP, ALE, RHALE) defines and estimates the global effects and the heterogeneity.
Notation
- \( x_s \) is the feature of interest and \( \mathbf{x}_c \) the rest, so that \( \mathbf{x} = (x_s, \mathbf{x}_c) \).
- \( f^m(x_s) \) or \( m(x_s) \) is the global effect of feature \( x_s \) using method \( m \), for example \(f^{PDP}(x_s)\) or \(PDP(x_s)\).
- \(f_c^m(x_s)\) or \(m_c(x_s)\) is the centered global effect: $$ f_c^m(x_s) = f^m(x_s) - c, \text{ where } c = \frac{\int_{x_{s,\text{min}}}^{x_{s,\text{max}}} f^m(x_s) \, dx_s}{x_{s,\text{max}} - x_{s,\text{min}}} $$ The normalizer \( c \) is the mean value of the global effect over the range of \( x_s \).
- \( h^m(x_s) \) is the heterogeneity function of the effect of \( x_s \) using method \( m \).
- \( H^m_{x_s} \) is the heterogeneity value of the effect of \( x_s \) using method \( m \): $$ H_{x_s}^m = \frac{\int_{x_{s,\text{min}}}^{x_{s,\text{max}}} h^m(x_s) \, dx_s}{x_{s,\text{max}} - x_{s,\text{min}}}. $$
PDP
Global effect
The Partial Dependence Plot (PDP) is defined as the average of the predictions over the rest of the features:
and the centered global effect is \(PDP_c(x_s) = PDP(x_s) - c\).
Heterogeneity
The ICE plot is the effect of the feature \( x_s \) for each instance \( i \):
The heterogeneity function is the mean squared difference between the centered-ICE plot and the centered-PDP plot:
and the heterogeneity value is \( H_{x_s}^{PDP} \).
ALE
Global effect
ALE, first, partitions the range \( [x_{s,\text{min}}, x_{s,\text{max}}] \) into \( K \) intervals (bins), each containing \( \mathcal{S}_k \) instances (the ones with \( x_s \) in the \( k \)-th bin).
Each instance has a (local) effect that is computed as the difference between the prediction of the model after setting \(x_s\) to the right and the left boundary of the bin:
On bin \(k\), the bin-effect is the average of the local effects:
and the ALE effect at \(x_s\) is the sum of the bin-effects up to the bin containing \(x_s\):
The centered global effect is \( ALE_c(x_s) = ALE(x_s) - c \).
Heterogeneity
The bin-variance, \(\text{Var}_k\), is the variance of the local effects in the bin \(k\):
The heterogeneity function \(h^{ALE}(x_s)\) equals to the bin-variance of the bin containing \(x_s\):
The heterogeneity value is the mean of these variances: