Performative Validity of Recourse Explanations¶

Conference: NeurIPS 2025 arXiv: 2506.15366 Code: None Area: Causal Inference Keywords: Algorithmic recourse, performative effects, causal inference, counterfactual explanations, distribution shift

TL;DR¶

This paper formally analyzes the "performative" effects of recourse explanations — when a large number of rejected applicants act on recourse recommendations, their collective behavior induces distribution shift that renders recourse invalid after model retraining — and proves that only Improvement-based Causal Recourse (ICR), which intervenes solely on causal variables, preserves "performative validity" under broad conditions.

Background & Motivation¶

1. State of the Field¶

In high-stakes machine learning decision settings (loan approvals, graduate admissions, job interview screening, etc.), rejected applicants require "recourse explanations" — guidance on how to modify their features to obtain a favorable evaluation. Existing recourse methods fall into three main categories: Counterfactual Explanations (CE), Causal Recourse (CR), and Improvement-based Causal Recourse (ICR).

2. Limitations of Prior Work¶

Prior work has focused on the effect of external distribution shifts (temporal drift, geographic variation, etc.) on recourse validity, while overlooking a fundamental phenomenon: recourse itself causes distribution shift. When a large number of applicants act on the same set of recourse recommendations, their collective behavior alters the data distribution, and the decision boundary shifts as the model is retrained.

3. Root Cause¶

The "performativity" of recourse gives rise to a self-defeating paradox: recourse recommendations may invalidate themselves. For instance, when a system advises rejected job applicants to increase their GitHub activity, many applicants use automated commit tools to fabricate activity; after model retraining, the model no longer trusts that feature, and applicants who followed the advice are rejected again — wasting time and effort with no benefit.

4. Paper Goals¶

To formally characterize the conditions under which recourse explanations remain valid under their own performative effects — i.e., when does recourse avoid being invalidated by the distribution shift it induces?

5. Starting Point¶

From a causal graph perspective, recourse actions are modeled as causal interventions, and the paper analyzes the conditional independence relationship between the recourse action variable \(A\) and the post-recourse label \(L^p\). Through d-separation analysis, two root causes of performative failure are identified: (1) the action is influenced by effect variables; and (2) the action intervenes on effect variables.

6. Core Idea¶

Performative failure of recourse fundamentally stems from dependence on non-causal (effect) variables — only ICR, which recommends interventions solely on causal variables, can guarantee performative validity.

Method¶

Overall Architecture¶

The paper establishes a theoretical framework for "performative recourse," consisting of the following components:

Pre- to post-recourse distribution modeling: The post-recourse applicant distribution is modeled as a mixture of the original distribution and the post-action distribution: \(P(L^m, X^m) = \alpha P(L,X) + (1-\alpha)P(L^p, X^p \mid \hat{L}=0)\)
Structural Causal Model (SCM): The data-generating process is assumed to be described by an acyclic SCM, and recourse actions are causal interventions \(do(a)\)
Recourse action variable \(A\): A dedicated action variable is introduced to model recourse decisions and recommendations as causal paths from pre-recourse features \(X\) to action \(A\)
Performative Validity: Requires that for all accepted points \(x\) and all mixture weights \(\alpha\), the updated model satisfies \(\hat{L}^m(x) \geq \hat{L}(x)\)

Key Designs¶

Module 1: Sufficient Conditions for Performative Validity (Proposition 5.1)¶

Function: Establishes the equivalence "non-informative action \(\Rightarrow\) performative validity"
Mechanism: If observing whether an intervention was performed does not help predict the post-recourse label \(L^p\) (given post-recourse features \(X^p\)), then the original optimal model and the post-recourse optimal model agree on all originally accepted points
Design Motivation: This transforms the performative validity problem into a conditional independence test, amenable to analysis via d-separation on causal graphs

Module 2: Identification of Two Failure Sources (Theorem 5.2)¶

Function: Precisely characterizes the two paths in the causal graph that lead to \(A \not\perp L^p \mid X^p\)
Mechanism: In the augmented causal graph (containing both pre- and post-recourse variables), d-connected paths between \(A\) and \(Y^p\) can only pass through two critical edges: (1) \(X_{de(Y)} \to A\) (effect variables influencing the action) and (2) \(A \to X_E^p\) (the action intervening on effect variables)
Design Motivation: Decomposes the abstract conditional independence problem into two operationalizable checking conditions

Module 3: Two Recovery Assumptions¶

Assumption 5.4 (Noise resampling): If the unobserved causal influences (\(U_Y\), \(U_E\)) are independent across pre- and post-recourse periods (e.g., daily weather, mood), then recourse methods that intervene only on causal variables guarantee performative validity (Proposition 5.5).

Assumption 5.6 (Invertible aggregated noise): If the effect and noise in the structural equations can be aggregated and inverted (e.g., linear additive noise, multiplicative noise), then intervening only on causal variables guarantees performative validity even when noise is held fixed (Theorem 5.7).

Module 4: Comparative Analysis of Three Recourse Methods¶

Method	Intervention Target	Optimization Objective	Performative Validity
CE (Counterfactual Explanations)	Arbitrary features (including effect variables)	Flip prediction \(\hat{L}(x')=1\)	❌ May fail
CR (Causal Recourse)	Causal interventions (including effect variables)	\(P(\hat{L}(X^p)=1 \mid x, do(a)) \geq t_r\)	❌ May fail
ICR (Improvement-based Causal Recourse)	Causal variables only	\(P(L^p=1 \mid x, do(a)) \geq t_r\)	✅ Valid under broad conditions

Loss & Training¶

This paper is a theoretical work and involves no training. The core contribution consists of theorem proofs:

Corollary 5.9: Under conditions that exclude the first failure source (Assumption 5.4 or 5.6), only ICR guarantees performative validity; both CE and CR may fail.

Key Experimental Results¶

Main Results¶

Experiments are conducted on 5 synthetic data settings and 2 real-world data settings to validate the theory:

Data Setting	CE Acceptance Rate Change	CR Acceptance Rate Change	ICR Acceptance Rate Change
LAdd (linear additive noise)	Large decrease	Large decrease	No change
LMult (linear multiplicative noise)	Large decrease	Large decrease	No change
NLAdd (nonlinear additive noise)	Large decrease	Large decrease	No change
NLMult (nonlinear multiplicative noise)	Large decrease	Large decrease	No change
LCubic (polynomial noise)	Large decrease	Large decrease	Small positive change
GPA (university admissions)	≈−80%	≈−80%	No change
Credit (credit scoring)	Significant decrease	Significant decrease	No change

Ablation Study¶

Conditional distribution shift analysis (Q1) — comparing pointwise differences in conditional probabilities before and after recourse:

Data Setting	CE Cond. Prob. Change	CR Cond. Prob. Change	ICR Cond. Prob. Change
NLAdd	−70% ~ −100%	−70% ~ −100%	0%
LAdd	Significant negative shift	Significant negative shift	0%
LMult	Significant negative shift	Significant negative shift	0%
LCubic	Significant negative shift	Significant negative shift	0% ~ +60%

Key Findings¶

CE and CR invariably produce negative conditional distribution shifts and severe performative failure — acceptance rates drop substantially across all 7 data settings
ICR maintains performative validity in all settings — acceptance rates remain nearly unchanged, with results far exceeding the scope of theoretical guarantees
The only exception: In the LCubic setting, ind. ICR exhibits a slight positive increase (0–60%) in conditional probability, which is in fact beneficial to applicants
Real-world data validation: Acceptance rates for CE/CR plummet by approximately 80% on the GPA and Credit datasets, further corroborating the theoretical predictions

Highlights & Insights¶

Conceptual innovation: "Performative validity" is a highly insightful new concept that naturally bridges the performative prediction literature with the algorithmic recourse literature, revealing a broadly overlooked practical problem
Precise causal graph analysis: D-separation analysis decomposes the abstract distribution shift problem into two concrete causal paths, providing clear and actionable guidance
Strong practical recommendation: The paper explicitly advocates the practical recommendation to "avoid CE and CR, and use only ICR," offering important guidance for the XAI community
Strong theory–experiment agreement: Experimental results perfectly validate the theoretical analysis; ICR performs well even beyond the regime covered by the theoretical guarantees

Limitations & Future Work¶

Strong causal knowledge assumption: ICR requires a complete causal graph and SCM, which are difficult to obtain in practice; the paper itself acknowledges the need to explore extensions under incomplete causal knowledge
Single-step recourse limitation: Only one recommendation–action–reevaluation cycle is considered; in practice, applicants may apply repeatedly
Absence of the model provider's perspective: The analysis is conducted solely from the applicant's viewpoint, without considering the possibility that model providers may strategically exploit recourse for guided manipulation
Discrete noise in experimental settings: Synthetic experiments use discrete noise with finite support to facilitate pointwise comparison of conditional distributions, which may limit the generalizability of the results
Lack of discussion regarding violations of causal sufficiency: The framework assumes no unobserved confounders, which often does not hold in practice

Performative Prediction (Perdomo et al., 2020): Performative validity is a natural generalization of performative stability, but requires a weaker condition — it does not require the model to remain unchanged, only that the updated model does not reject those who were originally accepted
Strategic Classification (Hardt et al., 2016): Connects recourse with strategic manipulation. The paper's central insight — "gaming vs. improvement" — is directly inherited from the strategic classification literature's distinction between interventions on causal vs. non-causal variables
Robust Recourse (Upadhyay et al., 2021): Existing robust recourse focuses on exogenous distribution shifts; this paper is the first to address endogenous shifts (induced by recourse itself)
ICR (König et al., 2023): This paper provides new theoretical support for ICR — not only is ICR preferable from the model provider's perspective, but from the standpoint of performative validity, ICR is the only reliable choice

Rating¶

Novelty: ⭐⭐⭐⭐⭐ — The concept of "performative validity" is novel, and the causal analysis identifying two failure sources is highly original
Experimental Thoroughness: ⭐⭐⭐⭐ — Theory is comprehensively validated across 5 synthetic and 2 real-world datasets, though experimental settings involve some simplifications
Writing Quality: ⭐⭐⭐⭐⭐ — The paper is clearly structured; the running example (GitHub activity vs. a master's degree) is woven throughout the paper and is highly illuminating
Value: ⭐⭐⭐⭐ — Offers direct practical guidance for XAI/recourse practice, though real-world deployment is constrained by the availability of causal knowledge