Quality evaluation of machine learning estimated conditional average treatment effects (CATEs). The quality is assessed by estimating the best linear predictor of the actual CATEs using the estimated CATEs, the sorted group average treatment effects, and the rank-weighted average treatment effects induced by the estimated CATEs.

To get started, please check the online short tutorial.

Installation

The current development version of the package can be installed using the devtools package:

devtools::install_github("riccardo-df/evaluCATE") # run install.packages("devtools") if needed.

References

  • Imbens, G. W., & Rubin, D. B. (2015). Causal inference for statistics, social, and biomedical sciences: An introduction. Cambridge University Press. [book]

  • Chernozhukov, V., Demirer, M., Duflo, E., & Fernandez-Val, I. (2017). Generic machine learning inference on heterogeneous treatment effects in randomized experiments. National Bureau of Economic Research. [paper]

  • Athey, S., Tibshirani, J., & Wager, S. (2019). Generalized random forests. Annals of Statistics. [paper]

  • Künzel, S. R., Sekhon, J. S., Bickel, P. J., & Yu, B. (2019). Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences. [paper]

  • Imai, K., & Li, M. (2021). Statistical inference for heterogeneous treatment effects discovered by generic machine learning in randomized experiments. arXiv preprint. [paper]

  • Yadlowsky, S., Fleming, S., Shah, N., Brunskill, E., & Wager, S. (2021). Evaluating treatment prioritization rules via rank-weighted average treatment effects. arXiv preprint. [paper]