Optimal experiment design in Markov chains

Date:

March 28, 2024

Author:

Mojmír Mutný



Abstract

Optimal Experiment Design is a classic field in statistics, closely related to Active Learning in Machine Learning. It assumes that through a series of system interactions, typically queries, we can estimate an unknown quantity. The goal is to develop an algorithmic strategy that optimally gathers information in a budget-constrained scenario. Traditionally, it is assumed that any query can be selected at any time or interaction round. However, in this talk, I will discuss more complex scenarios where interactions change the state of the experimenter, thereby restricting the possible queries. These state transitions are modeled using a Markov chain, and the overall process can be described as a Markov Decision Process (MDP) with a non-linear reward function. The framework can adapt to the experimenter's goal. I will examine two problems: classical exploration and best-arm identification in reproducing kernel Hilbert space with applications in spatial surveillance and chemical reactor optimization. Additionally, I will link this exposition to the optimal control literature and address the computational hardness of the general problem, along with practical approximation methods based on convex relaxation techniques.


Notes


  • Personal website can be found here.

Share on social media

Share on social media

Share on social media

Related Seminars

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

Mar 13, 2025

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

Mar 13, 2025

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

Jan 21, 2025

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

Jan 21, 2025

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

Oct 10, 2024

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

Oct 10, 2024

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

Sep 19, 2024

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

Sep 19, 2024