Skip to main content

Couplings for Metropolized Non-Reversible MCMC

In a highly cited paper, Diaconis et al, (2000) lifted the symmetric random walk on the discrete circle by adding an auxiliary velocity variable that is flipped in each step with a specified probability, and proved that a well-tuned lifted walk on a circle of circumference n mixes in O(n) steps vs. O(n^2) steps for the unlifted walk. This work led to an avalanche of research on non-reversible MCMC including notably to the kinetic Langevin diffusion (and related processes) based on couplings Eberle et al. (2019) and hypocoercivity Cao et al. (2023). (Note, Eberle and Lörler (2024) recently proved the kinetic Langevin diffusion is a lifted Langevin diffusion.) By now, there are numerous non-asymptotic complexity upper bounds for unadjusted discretizations of kinetic Langevin diffusions including Cheng et al. 2018, Dalalyan and Riou-Durand (2020), Monmarché (2021) and Leimkuhler et al. (2024), just to name a few. However, quantitative mixing time guarantees for Metropolized discretizations are comparatively scarce.

In this context, I am happy to share a paper with Stefan Oberdörster (Bonn) which just came out in EJP: Mixing of Metropolis-Adjusted Markov Chains via Couplings: The High Acceptance Regime. In this work, we use coupling techniques to prove, for the first time, a log(1/\epsilon)-scaling of the \epsilon-mixing time of Metropolized Non-Reversible Markov chains without restrictive conditions on either the target or starting distribution. The key idea is to turn mixing time results for the unadjusted chain into mixing results for the Metropolized chain by controlling the rejection probability on the time scale it takes for a coupling of the unadjusted chain to meet. By iterating the Metropolized chain over \log(1/\epsilon) such epochs, an upper bound on the mixing time of the Metropolized chain is obtained. The work actually does much more than this: since it uses a localization argument it suffices to assume that the underlying unadjusted chain only locally mixes, which substantially relaxes the regularity and convexity conditions on the target distribution.