News & Events

Subscribe to email list

Please select the email list(s) to which you wish to subscribe.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Image CAPTCHA

Enter the characters shown in the image.

User menu

You are here

Markov chain Monte Carlo and variational inference in the age of parallel computation

Thursday, February 6, 2025 - 10:30 to 11:30
Charles Margossian, Postdoc Research Fellow at the Flatiron Institute
Statistics Seminar
ESB 4192 / Zoom

To join this seminar virtually: Please request Zoom connection details from ea [at] stat.ubc.ca

Abstract: Probabilistic models describe complex data generating processes and have been applied to a broad range of fields, such as epidemiology, pharmacology, and astrophysics. Inference for probabilistic models poses significant computational challenges, particularly as models grow in complexity and datasets increase in size. Modern hardware, with its parallelization capabilities, offers new opportunities to accelerate statistical inference. However, many traditional methods are not inherently designed for parallel computation. Markov chain Monte Carlo (MCMC), for instance, typically relies on a few long-running chains. I propose an alternative approach: running hundreds or thousands of shorter chains in parallel. To support this paradigm, I introduce the nested “R-hat,” a novel convergence diagnostic tailored for the many-short-chains regime, paving the way for faster and more automated MCMC.

Next I examine variational inference (VI). VI already leverages the parallelization capacities of modern hardware, however it lacks the theoretical guarantees of MCMC and other statistical methods. I present two key theoretical results: (1) a positive result demonstrating that VI can effectively learn symmetries even under misspecified approximations, and (2) a negative result revealing that factorized (or mean-field) approximations lead to an impossibility theorem, preventing the simultaneous estimation of multiple measures of uncertainty . These findings provide practical guidance for selecting VI’s objective function and approximation family, offering a path toward robust and scalable inference.