Efficient robot skill learning

日付:

2020年5月13日

著者:

Hrvoje Stojic



Abstract

For autonomous robots to operate in the open, dynamically changing world, they will need to be able to learn a robust set of skills from relatively little experience.  This talk begins by introducing Grounded Simulation Learning as a way to bridge the so-called reality gap between simulators and the real worl in order to enable transfer learning from simulation to a real robot (sim-to-real).  It then introduces two new algorithms for imitation learning from observation that enable a robot to mimic demonstrated skills from state-only trajectories, without any knowledge of the actions selected by the Demonstrator. Grounded Simulation Learning has led to the fastest known stable walk on a widely used humanoid robot, and imitation learning from observation opens the possibility of robots learning from the vast trove of videos available online.


Notes

  • The talk covers material in the following published papers on Grounded Simulation Learning [1,2,3] and Imitation Learning from Observation [4,5,6].

  • Peter Stone is a Professor at the Department of Computer Science, University of Texas at Austin, and an Executive Director at Sony AI America. His personal website can be found here.

ソーシャルメディアで共有

ソーシャルメディアで共有

ソーシャルメディアで共有

関連するセミナー

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

2025/03/13

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin - University of Cambridge

2025/03/13

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

2025/01/21

Return of the latent space cowboys: rethinking the use of VAEs in Bayesian optimisation over structured spaces

Henry Moss - University of Cambridge, Lancaster University

2025/01/21

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

2024/10/10

Advancing sequential decision-making: efficient querying in clustering and best of both worlds for contextual bandits

Yuko Kuroki - CENTAI Institute

2024/10/10

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

2024/09/19

AI in drug discovery - from model to process, from academic publication to decision-making

Andreas Bender - University of Cambridge

2024/09/19