The eighty-first UQSay seminar on UQ, DACE and related topics will take place online on Thursday afternoon, January 23, 2025.
2–3 PM — Congye Wang (Newcastle University, UK)
Reinforcement Learning for Adaptive MCMC
An informal observation, made by several authors, is that the adaptive design of a Markov transition kernel has the flavour of a reinforcement learning task. Yet, to date it has remained unclear how to actually exploit modern reinforcement learning technologies for adaptive MCMC. The aim of this talk is to set out a general framework, called Reinforcement Learning Metropolis–Hastings, that we argue can be theoretically supported. Our principal focus is on learning fast-mixing Metropolis–Hastings transition kernels, which we cast as deterministic policies and optimise via a policy gradient. Preliminary results on the PosteriorDB benchmark will be presented.
References:
Joint work with W. Chen (University of Sydney) & H. Kanagawa (Newcastle University) & C. J. Oates (Newcastle University - Alan Turing Institute).
Organizing committee: Pierre Barbillon (MIA-Paris), Julien Bect (L2S), Nicolas Bousquet (EDF R&D), Vincent Chabridon (EDF R&D), Amélie Fau (LMPS), Filippo Gatti (LMPS), Clément Gauchy (CEA), Bertrand Iooss (EDF R&D), Alexandre Janon (LMO), Sidonie Lefebvre (ONERA), Didier Lucor (LISN), Sébastien Petit (LNE), Emmanuel Vazquez (L2S), Xujia Zhu (L2S).
Coordinators: Sidonie Lefebvre (ONERA) & Xujia Zhu (L2S)
Practical details: the seminar will be held online using Microsoft Teams.
If you want to attend this seminar (or any of the forthcoming online UQSay seminars), and if you do not already have access to the UQSay group on Teams, simply send an email and you will be invited. Please specify which email address the invitation must be sent to (this has to be the address associated with your Teams account).
You will find the link to the seminar on the "General" UQSay channel on Teams, approximately 15 minutes before the beginning.
The technical side of things: you can use Teams either directly from your web browser or using the "fat client", which is available for most platforms (Windows, Linux, Mac, Android & iOS). We strongly recommend the latter option whenever possible. Please give it a try before the seminar to anticipate potential problems.