GitHub

MuZero

A TensorFlow implementation of DeepMind's MuZero algorithm for self-learning games without any knowledge of the rules. The algorithm is implemented as described in the original paper and pseudocode. It supports prioritized replay and is parallelized with the help of Ray. The repo structure is based on a muzero-pytorch.

Train: python main.py --mode train --env CartPole-v1 --force

Test: python main.py --mode test --env CartPole-v1 --force

TensorBoard: tensorboard --logdir=result_dir

At the moment, the code has only been tested for simple OpenAI gym environments like CartPole. Results are fairly sensitive to choices of hyperparameters.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cartpole		cartpole
.gitignore		.gitignore
README.md		README.md
central_actor_storage.py		central_actor_storage.py
env_config_mapping.py		env_config_mapping.py
experience_actor.py		experience_actor.py
game.py		game.py
logging_utils.py		logging_utils.py
main.py		main.py
mcts.py		mcts.py
model_utils.py		model_utils.py
muzero_config.py		muzero_config.py
muzero_network.py		muzero_network.py
ray_constants.py		ray_constants.py
replay_memory.py		replay_memory.py
requirements.txt		requirements.txt
test_runner.py		test_runner.py
train_runner.py		train_runner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MuZero

About

Releases

Packages

Languages

rvbansal/muzero_tf

Folders and files

Latest commit

History

Repository files navigation

MuZero

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages