Comparing Bayesian and non-Bayesian Strategies for Variable Selection in Regression Models with Missing Covariates
Presented by:
Thomas R. Belin
UCLA Department of Biostatistics
(Joint work with Xiaowei Yang, UC Davis Division of Biostatistics; W. John Boscardin, UC San Francisco; Gang Liu, Google, Inc.)
Start Date and Time:
June 16, 2009 3:15pm – 4:15pm
Center for Community Health, UCLA Wilshire Center
10920 Wilshire Blvd., Suite 350, Conference Room
Los Angeles, CA 90025
Refreshments:
*** LIGHT REFRESHMENTS WILL BE SERVED ***
Please arrive by 3:00 p.m. for refreshments
Viewing:
Seminar can also be viewed via video conference at
Drew University in Cobb Conference Room 189.
Abstract:
Bayesian methods such as "stochastic search variable selection" (George and McCulloch 1993 JASA) have been proposed as alternatives to traditional stepwise variable selection procedures in regression models. Instead of either fixing a regression coefficient at zero or allowing it to be estimated by least squares, as in stepwise procedures, stochastic search variable selection posits a mixture prior distribution for the given coefficient, both components being centered at zero but one with a small variance and the other with a large variance. Such a framework lends itself to summaries that average over model uncertainty, such as the posterior probability that a given predictor variable has a non-zero coefficient across the set of main-effect models. But even with complete data, Markov-chain Monte Carlo (MCMC) statistical computing procedures are indicated, and when there are missing covariates, there are choices in the development of a variable-selection procedure. Specifically, one could consider whether to incorporate draws of missing values into the MCMC variable-selection algorithm (an approach we call "simultaneously impute and select", or SIAS) or whether to perform multiple imputation for missing covariates separately and then to run a Bayesian variable-selection algorithm on the multiply imputed data sets (an approach we call "impute, then select", or ITS). In this presentation, we contrast SIAS and ITS with traditional stepwise selection in both simulated settings and in a mental-health services study. We also discuss extensions from models for continuous outcomes to models for binary outcomes.
Additional Information:
For further information or to view videos and slides of past seminars, please visit http://chipts.ucla.edu/events/methods_seminars/index.asp or contact Jae Lee at sjlee@mednet.ucla.edu
This seminar series is supported by the National Institute of Mental Health (P 30 MH-58107).
