MAML_Pytorch_RL

This repo contains code for the RL experiments of Model Agnostic Meta-Learning. Make sure you install requirements and get your Mujoco License to run the experiments here. Additionally, there is an implementation of PPO as the meta-optimizer instead of TRPO as used by the authors. This work is done as part of the RL Course Project (Monsoon 2020) Project Report.

Usage

Training for Navigation Task. Replace environment to switch experiments

python main.py --env-name 2DNavigation-v0 --fast-lr 0.1  --maml

Training for Locomotion Task using PPO as meta-optimizer.

python main_ppo2.py --env-name HalfCheetahVel-v1 --fast-lr 0.1  --maml  --meta-lr 0.1 --critic_weight 0.005 --eps_clip 0.2

Testing

This script is used for testing our meta-trained policies and plots the avg returns vs number of gradient steps taken for adaptation at test time.

python test_and_plot.py

Other Scripts

plot_eval_curves.py : Used for plotting avg returns vs number of iterations. Use this after downloading testing curves from tensorboard in JSON format.
demo_cheetah.py : Used for visualizing (Mujoco) the performance of trained policies for the HalfCheetah Environment. Saves a video of the visualization.

Acknowledgement

This code is an extension of the repo by Luisa M Zintgraf.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
envs		envs
policies		policies
rl_utils		rl_utils
viz		viz
.gitignore		.gitignore
Backward_Half_Cheetah_3.gif		Backward_Half_Cheetah_3.gif
README.md		README.md
__init__.py		__init__.py
arguments.py		arguments.py
baseline.py		baseline.py
episode.py		episode.py
main.py		main.py
main_ppo2.py		main_ppo2.py
metalearner.py		metalearner.py
metalearner_ppo2.py		metalearner_ppo2.py
plot_eval_curves.py		plot_eval_curves.py
requirements.txt		requirements.txt
sampler.py		sampler.py
test_and_plot.py		test_and_plot.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAML_Pytorch_RL

Usage

Training for Navigation Task. Replace environment to switch experiments

Training for Locomotion Task using PPO as meta-optimizer.

Testing

Other Scripts

Acknowledgement

About

Releases

Packages

Languages

anishmadan23/MAML_Pytorch_RL

Folders and files

Latest commit

History

Repository files navigation

MAML_Pytorch_RL

Usage

Training for Navigation Task. Replace environment to switch experiments

Training for Locomotion Task using PPO as meta-optimizer.

Testing

Other Scripts

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages