By Gaurav in ML/Statistics — 10 Sep 2025

Testing Distributional Implications of LATE

Many randomized encouragement designs have imperfect compliance, where only a fraction of people comply with their assignment. Examples include phone-bank get-out-the-vote (GOTV) campaigns and draft lotteries like the Vietnam Draft Lottery. In these settings, it is common to use instrumental variable (IV) regression for analysis.

Instrumental variables identify the Local Average Treatment Effect (LATE) under three standard conditions: random assignment, exclusion, and monotonicity. The exclusion restriction embodies a key intuition: we don't expect GOTV efforts over the phone to affect people we aren't able to reach, nor do we expect people who were merely drawn up in the Vietnam Draft Lottery to have different attitudes towards minorities solely from being drafted—whatever effects we see, we expect them to result from actual service.

This leads to a sharp testable implication. When the instrument Z flips, only compliers can change treatment status. Therefore, only compliers can contribute to the treatment effect, which implies that the distribution of the intention-to-treat (ITT) effect follows a specific pattern—one with a lumpy distribution reflecting the complier-only effects.

We turn this insight into testable implications about distributional equalities (see more here and here). Under LATE assumptions, the difference in outcome CDFs between instrument values must equal the complier share times the difference in complier potential outcome CDFs. Formally, for all y: $F_{Y|Z=1}(y) - F_{Y|Z=0}(y) = p_C[F_{1C}(y) - F_{0C}(y)]$. This extends LATE from a statement about means to a family of restrictions across the entire distribution.

These distributional restrictions provide leverage for detecting violations of the underlying assumptions. Exclusion violations shift the entire distribution, including regions where compliers are absent. Defiers create opposing movements that disturb the expected monotonicity of the CDF difference. The ITT effect distribution should exhibit concentrated movement where compliers lie and zero movement elsewhere—deviations from this pattern signal assumption failures.

We operationalize these insights through two main tests. The first is a uniform test of the distributional equality using either Kolmogorov-Smirnov or Cramér-von Mises statistics. We estimate complier CDFs using Abadie-style weighting, enforce shape restrictions through monotone rearrangement, and handle covariates via cross-fitting. The second is a GMM test that focuses on specific quantiles, which proves useful when violations are expected to concentrate in the tails of the distribution.

A complementary falsification test exploits heterogeneity in compliance propensity. In covariate regions where compliance approaches zero, LATE predicts null ITT effects. We test this implication by examining weighted conditional mean differences across the instrument, with weights concentrated on low-compliance regions. This provides a direct test of exclusion using observable variation.

Simulations validate the approach across realistic scenarios. Under the null with complier shares ranging from 10% to 30%, tests maintain nominal size. Exclusion violations generate detectable distributional distortions, with power approaching one for moderate direct effects (γ = 0.5). When defiers are present, detection power increases with the defier share—five percent defiers yield 14% power, while ten percent defiers yield 31% power at n = 2000.

The method's power depends predictably on sample size and complier share. Doubling the sample size from 2000 to 4000 approximately doubles power for small exclusion violations. The relationship with complier share proves more complex—very low or very high complier shares reduce power against defier alternatives, as the distributional signature becomes harder to distinguish from sampling variation.

These tests complement existing LATE diagnostics. While covariate balance tests check randomization and first-stage F-statistics assess instrument strength, our approach directly examines the exclusion and monotonicity assumptions that are typically untestable. The distributional perspective reveals violations that mean-based tests might miss—for instance, when positive and negative exclusion violations cancel in expectation but distort distributional shape.

In applications where exclusion holds approximately but not exactly, the magnitude of distributional distortions provides a measure of violation severity. This information guides sensitivity analyses and helps researchers assess whether IV estimates are sufficiently reliable for policy conclusions.

Implementation remains straightforward with standard econometric software. The main computational burden comes from bootstrap inference when covariates are present, but this remains manageable even for moderate sample sizes. Cross-fitting prevents overfitting in first-stage estimation while maintaining valid inference.

The broader methodological point concerns the testable implications of identification assumptions. These assumptions often imply restrictions beyond their immediate targets. By developing appropriate tests for these implications, we strengthen our ability to assess when causal identification strategies succeed or fail. The distributional approach presented here represents one avenue for improving the credibility of instrumental variable analyses.

Subscribe to Gojiberries