Skip to content

v1.0

Compare
Choose a tag to compare
@Kaixhin Kaixhin released this 18 May 21:00
· 28 commits to master since this release

Pretrained models for DeepMind Control Suite environments. Note that performance is roughly comparable to the original, except for ball_in_cup-catch (which has high variance).

Some models are trained for 2000 episodes as per previous versions of the paper, not 1000 episodes as per the final version (but all model checkpoints provided are from 1000 episodes). Does not include all preprocessing steps (5-bit quantisation and centering of observations).

cartpole-balance

newplot (7)

cartpole-swingup

newplot (8)

reacher-easy

newplot (12)

finger-spin

newplot (9)

cheetah-run

newplot (10)

ball_in_cup-catch

newplot

walker-walk

newplot (11)