论文标题
因果预测推断和目标试验仿真
Causal predictive inference and target trial emulation
论文作者
论文摘要
观察数据的因果推断可以看作是由与观察性研究相匹配的假设种群尺度随机试验引起的缺失数据问题。这将目标试验方案与推理相应的生成预测模型联系起来,为因果假设的透明交流和治疗效果的统计不确定性提供了完整的框架,而无需反事实。这项工作的直观基础是,整个人口随机试验将为任何可观察到的因果问题提供确定性的答案。因此,我们因果推断的基本问题是假设目标试验数据的缺失,我们通过从观察数据为条件的生成预测模型中重复插定来解决。因果假设将预测模型跨种群和条件的预测模型的可运输能力映射到直观条件。我们证明了使用贝叶斯添加剂回归树的扩展和逆概率加权研究孕产烟对出生体重的影响的实际数据应用方法。
Causal inference from observational data can be viewed as a missing data problem arising from a hypothetical population-scale randomized trial matched to the observational study. This links a target trial protocol with a corresponding generative predictive model for inference, providing a complete framework for transparent communication of causal assumptions and statistical uncertainty on treatment effects, without the need for counterfactuals. The intuitive foundation for the work is that a whole population randomized trial would provide answers to any observable causal question with certainty. Thus, our fundamental problem of causal inference is the missingness of the hypothetical target trial data, which we solve through repeated imputation from a generative predictive model conditioned on the observational data. Causal assumptions map to intuitive conditions on the transportability of predictive models across populations and conditions. We demonstrate our approach on a real data application to studying the effects of maternal smoking on birthweights using extensions of Bayesian additive regression trees and inverse probability weighting.