Reinforcement learning in the real world: How to “cheat” and still feel good about it

日付:

2020年12月17日

著者:

Hrvoje Stojic



Abstract

Reinforcement learning has had many successes, but significant amounts of time and/or data can be required to reach acceptable performance. If agents or robots are to be deployed in real-world environments, it is critical that our algorithms take advantage of existing data and human knowledge. This talk will discuss a selection of recent work that improves reinforcement learning by leveraging demonstrations and feedback from imperfect users, with an emphasis on how interactive machine learning can be extended to best leverage the unique abilities of both computers and humans.


Notes


  • Dr Matthew E. Taylor is an Associate Professor of Computing Science at the University of Alberta and a Fellow and Fellow-in-Residence at the Alberta Machine Intelligence Institute (Amii). He is the Director of the Intelligent Robot Learning (IRL) Lab (irll.ca)and a Principal Investigator at the Reinforcement Learning & Artificial Intelligence (RLAI) Lab, both at the University of Alberta.

  • His publication record on Google Scholar can be found here, and personal website here

ソーシャルメディアで共有

ソーシャルメディアで共有

ソーシャルメディアで共有

関連するセミナー

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

2025/03/13

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

2025/03/13

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

2025/01/21

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

2025/01/21

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

2024/10/10

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

2024/10/10

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

2024/09/19

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

2024/09/19