Model with conditional interaction
In this example, we show global effects of a model with conditional interactions using PDP, ALE, and RHALE. In particular, we:
- show how to use
effector
to estimate the global effects using PDP, ALE, and RHALE - provide the analytical formulas for the global effects
- test that (1) and (2) match
We will use the following model:
where the features \(x_1, x_2, x_3\) are independent and uniformly distributed in the interval \([-1, 1]\).
The model has an interaction between \(x_1\) and \(x_2\) caused by the terms: \(f_{1,2}(x_1, x_2) = -x_1^2 \mathbb{1}_{x_2 <0} + x_1^2 \mathbb{1}_{x_2 \geq 0}\). This means that the effect of \(x_1\) on the output \(y\) depends on the value of \(x_2\) and vice versa. Therefore, there is no golden standard on how to split the effect of \(f_{1,2}\) to two parts, one that corresponds to \(x_1\) and one to \(x_2\). Each global effect method has a different strategy to handle this issue. Below we will see how PDP, ALE, and RHALE handle this interaction.
In contrast, \(x_3\) does not interact with any other feature, so its effect can be easily computed as \(e^{x_3}\).
import numpy as np
import matplotlib.pyplot as plt
import effector
np.random.seed(21)
model = effector.models.ConditionalInteraction()
dataset = effector.datasets.IndependentUniform(dim=3, low=-1, high=1)
x = dataset.generate_data(1_000)
PDP
Effector
Let's see below the PDP effects for each feature, using effector
.
pdp = effector.PDP(x, model.predict, dataset.axis_limits)
pdp.fit(features="all", centering=True)
for feature in [0, 1, 2]:
pdp.plot(feature=feature, centering=True, y_limits=[-2, 2])
PDP states that:
- \(x_1\) has a zero average effect on the model output
- \(x_2\) has a constant effect when \(x_2 <0\) or \(x_2 >0\), but when moving from \(x_2^-\) to \(x_2^+\) it adds (on average) \(+\frac{2}{3}\) units to \(y\)
- \(x_3\) has an effect of \(e^{x_3}\)
Derivations
How PDP leads to these explanations? Are they meaningfull? Let's have some analytical derivations. If you don't care about the derivations, skip the following three cells and go directly to the coclusions.
For \(x_1\):
For \(x_2\):
For \(x_3\):
Tests
def compute_centering_constant(func, start, stop, nof_points):
x = np.linspace(start, stop, nof_points)
y = func(x)
return np.mean(y)
def pdp_ground_truth(feature, xs):
if feature == 0:
ff = lambda x: np.zeros_like(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 1:
ff = lambda x: -1 / 3 * (x < 0) + 1 / 3 * (x >= 0)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 2:
ff = lambda x: np.exp(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
xx = np.linspace(-1, 1, 100)
y_pdp = []
for feature in [0, 1, 2]:
y_pdp.append(pdp_ground_truth(feature, xx))
plt.figure()
plt.title("PDP effects (ground truth)")
color_pallette = ["blue", "red", "green"]
for feature in [0, 1, 2]:
plt.plot(
xx,
y_pdp[feature],
color=color_pallette[feature],
linestyle="--",
label=f"feature {feature + 1}"
)
plt.legend()
plt.xlim([-1.1, 1.1])
plt.ylim([-2, 2])
plt.show()
# make a test
xx = np.linspace(-1, 1, 100)
for feature in [0, 1, 2]:
y_pdp = pdp.eval(feature=feature, xs=xx, centering=True)
y_gt = pdp_ground_truth(feature, xx)
np.testing.assert_allclose(y_pdp, y_gt, atol=1e-1)
Conclusions
Are the PDP effects intuitive?
- For \(x_1\) the effect is zero. The terms related to \(x_1\) are \(-x_1^2 \mathbb{1}_{x_2 <0}\) and \(x_1^2 \mathbb{1}_{x_2 \geq 0}\). Both terms involve an interaction with \(x_2\). Since \(x_2 \sim \mathcal{U}(-1,1)\), almost half of the instances have \(x_2^i < 0\) and the the other half \(x_2^i \geq 0\), so the the two terms cancel out.
- For \(x_2\), the effect is constant when \(x_2 < 0\) or \(x_2>0\) but has a positive jump of \(\frac{2}{3}\) when moving from \(x_2^-\) to \(x_2^+\). It makes sense; when \(x_2 < 0\) the active term is $-(x_1^i)^2 \mathbb{1}_{x_2 < 0} $ which adds a negative quantity to the output and when \(x_2 \geq 0\) the active term is \((x_1^i)^2 \mathbb{1}_{x_2 \geq 0}\) that adds something postive. Therefore in the transmission we observe a non-linearity.
- For \(x_3\), the effect is \(e^{x_3}\), as expected, since only the this term corresponds to \(x_3\) and has no interaction with other variables.
ALE
Effector
Let's see below the PDP effects for each feature, using effector
.
ale = effector.ALE(x, model.predict, axis_limits=dataset.axis_limits)
ale.fit(features="all", centering=True, binning_method=effector.axis_partitioning.Fixed(nof_bins=31))
for feature in [0, 1, 2]:
ale.plot(feature=feature, centering=True, y_limits=[-2, 2])
ALE states that: - \(x_1\) has a zero average effect on the model output (same as PDP) - \(x_2\) has a constant effect when \(x_2 <0\) or \(x_2 >0\), but when moving from \(x_2^-\) to \(x_2^+\) it adds (on average) \(+\frac{2}{3}\) units to \(y\) (same as PDP) - \(x_3\) has an effect of \(e^{x_3}\) (same as PDP)
Derivations
For all bins, except the central, it holds that bin limits are either both negative or both positive, so the effects cancel out. For central bin, i.e., the one from \(-\frac{2}{K}\) to \(\frac{2}{K}\), the effect is \(\frac{(2x^i_1)^2}{| \mathcal{S}_k |} = \frac{2}{3} \frac{K}{2} = \frac{K}{3}\). Therefore, the ALE effect is:
Tests
def ale_ground_truth(feature, xs):
if feature == 0:
ff = lambda x: np.zeros_like(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 1:
K = 51
ff = lambda x: -1/3 * (x < 0) + 1/3 * (x >= 0)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 2:
ff = lambda x: np.exp(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
xx = np.linspace(-1, 1, 100)
y_ale = []
for feature in [0, 1, 2]:
y_ale.append(ale_ground_truth(feature, xx))
plt.figure()
plt.title("ALE effects (ground truth)")
color_pallette = ["blue", "red", "green"]
for feature in [0, 1, 2]:
plt.plot(
xx,
y_ale[feature],
color=color_pallette[feature],
linestyle="--",
label=f"feature {feature + 1}"
)
plt.legend()
plt.xlim([-1.1, 1.1])
plt.ylim([-2, 2])
plt.show()
xx = np.linspace(-1, 1, 100)
for feature in [1]:# [0, 1, 2]:
y_ale = ale.eval(feature=feature, xs=xx, centering=True)
y_gt = ale_ground_truth(feature, xx)
# hack to remove the effect at undefined region
if feature == 1:
K = 31
ind = np.logical_and(xx > -1/K, xx < 1/K)
y_ale[ind] = 0
y_gt[ind] = 0
np.testing.assert_allclose(y_ale, y_gt, atol=1e-1)
Conclusions
Are the ALE effects intuitive?
ALE effects are identical to PDP effects which, as discussed above, can be considered intutive.
RHALE
Effector
Let's see below the RHALE effects for each feature, using effector
.
rhale = effector.RHALE(x, model.predict, model.jacobian, axis_limits=dataset.axis_limits)
rhale.fit(features="all", centering=True)
for feature in [0, 1, 2]:
rhale.plot(feature=feature, centering=True, y_limits=[-2, 2])
RHALE states that: - \(x_1\) has a zero average effect on the model output (same as PDP) - \(x_2\) has a zero average effect on the model output (different than PDP and ALE) - \(x_3\) has an effect of \(e^{x_3}\) (same as PDP)
Derivations
def rhale_ground_truth(feature, xs):
if feature == 0:
ff = lambda x: np.zeros_like(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 1:
K = 31
ff = lambda x: np.zeros_like(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
elif feature == 2:
ff = lambda x: np.exp(x)
z = compute_centering_constant(ff, -1, 1, 1000)
return ff(xs) - z
xx = np.linspace(-1, 1, 100)
y_rhale = []
for feature in [0, 1, 2]:
y_rhale.append(rhale_ground_truth(feature, xx))
plt.figure()
plt.title("RHALE effects (ground truth)")
color_pallette = ["blue", "red", "green"]
for feature in [0, 1, 2]:
plt.plot(
xx,
y_rhale[feature],
color=color_pallette[feature],
linestyle="-" if feature == 0 else "--",
label=f"feature {feature + 1}"
)
plt.legend()
plt.xlim([-1.1, 1.1])
plt.ylim([-2, 2])
plt.show()
for feature in [0, 1, 2]:
y_ale = rhale.eval(feature=feature, xs=xx, centering=True)
y_gt = rhale_ground_truth(feature, xx)
np.testing.assert_allclose(y_ale, y_gt, atol=1e-1)
Conclusions
Are the RHALE effects intuitive?
RHALE does not add something new, compared to ALE and PDP, for features \(x_1\) and \(x_3\). For \(x_2\), however, it does not capture the abrupt increase by \(+\frac{2}{3}\) units when moving from \(x_2^-\) to \(x_2^+\), which can be considered as an error mode of RHALE. In fact, RHALE requires a differentiable black box model, and since \(f\) is not differentiable with respect to \(x_2\), that is why we get a slightly misleading result.