论文标题

选择用于多个插补的辅助变量的策略的比较

A comparison of strategies for selecting auxiliary variables for multiple imputation

论文作者

Mainzer, Rheanna M., Nguyen, Cattram D., Carlin, John B., Moreno-Betancur, Margarita, White, Ian R., Lee, Katherine J.

论文摘要

多重插补(MI)是处理缺失数据的流行方法。可以将辅助变量添加到插补模型中以改善MI估计值。但是,在插补模型中包含的辅助变量的选择并不总是直接的。包括太少的可能导致重要信息被丢弃,但是包括太多的信息可能会导致插补模型的估计程序收敛的问题。已经提出了几种数据驱动的辅助变量选择策略。本文使用仿真研究和案例研究,对八种辅助变量选择策略的性能进行全面比较,以便向MI的用户提供实用建议。还进行了完整的病例分析和MI分析,其中还进行了插图模型中包含的所有辅助变量(完整模型)进行比较。我们的仿真研究结果表明,完整模型的表现优于所有辅助变量选择策略,从而在可能的情况下提供了进一步的支持,以采用包容性辅助变量策略。使用最小绝对选择和收缩操作员(LASSO)的辅助变量选择是总体上表现最好的辅助变量选择策略,当完整模型失败时,是一个有希望的选择。我们能够适用于案例研究的所有MI分析策略都会得出相似的估计。

Multiple imputation (MI) is a popular method for handling missing data. Auxiliary variables can be added to the imputation model(s) to improve MI estimates. However, the choice of which auxiliary variables to include in the imputation model is not always straightforward. Including too few may lead to important information being discarded, but including too many can cause problems with convergence of the estimation procedures for imputation models. Several data-driven auxiliary variable selection strategies have been proposed. This paper uses a simulation study and a case study to provide a comprehensive comparison of the performance of eight auxiliary variable selection strategies, with the aim of providing practical advice to users of MI. A complete case analysis and an MI analysis with all auxiliary variables included in the imputation model (the full model) were also performed for comparison. Our simulation study results suggest that the full model outperforms all auxiliary variable selection strategies, providing further support for adopting an inclusive auxiliary variable strategy where possible. Auxiliary variable selection using the Least Absolute Selection and Shrinkage Operator (LASSO) was the best performing auxiliary variable selection strategy overall and is a promising alternative when the full model fails. All MI analysis strategies that we were able to apply to the case study led to similar estimates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源