Leveraging exploration in off-policy algorithms via normalizing flows

Reference

Leveraging exploration in off-policy algorithms via normalizing flows, Bogdan Mazoure, Thang Doan, Audrey Durand, Joelle Pineau, R. Devon Hjelm. Conference on Robot Learning(2020)

Publication

Abstract

The ability to discover approximately optimal policies in domains with sparse rewards is crucial to applying reinforcement learning (RL) in many real-world scenarios. Approaches such as neural dens...

Content citing this item

Seminar

Normalizing Flows for Policy Representation in Reinforcement Learning

In part 3 on Normalizing Flows, we will discuss how Reinforcement Learning could benefit from this class of methods for policy …

Frederik Heetmeyer

Generative Models

Aug 20, 2020

All works referenced in our site...