A discriminative approach for finding and characterizing positivity violations using decision trees

  • 2019-07-18 16:06:30
  • Ehud Karavani, Peter Bak, Yishai Shimoni
  • 1

Abstract

The assumption of positivity in causal inference (also known as commonsupport and co-variate overlap) is necessary to obtain valid causal estimates.Therefore, confirming it holds in a given dataset is an important first step ofany causal analysis. Most common methods to date are insufficient fordiscovering non-positivity, as they do not scale for modern high-dimensionalcovariate spaces, or they cannot pinpoint the subpopulation violatingpositivity. To overcome these issues, we suggest to harness decision trees fordetecting violations. By dividing the covariate space into mutually exclusiveregions, each with maximized homogeneity of treatment groups, decision treescan be used to automatically detect subspaces violating positivity. Byaugmenting the method with an additional random forest model, we can quantifythe robustness of the violation within each subspace. This solution is scalableand provides an interpretable characterization of the subspaces in whichviolations occur. We provide a visualization of the stratification rules thatdefine each subpopulation, combined with the severity of positivity violationwithin it. We also provide an interactive version of the visualization thatallows a deeper dive into the properties of each subspace.

 

Quick Read (beta)

loading the full paper ...