Reinforcement Learning Algorithms

For a Reinforcement Learning class, I worked on a few algorithms :

Policy Iteration
Value Iteration
SARSA
Q-Learning

to work on the OpenAI gym Cliff Walking problem (for SARSA and Q-Learning) and Sutton's Reinforcement Learning book Grid World exercice (for Policy Iteration and Value Iteration).

How to use ?

python main.py {RD, VI, PI, SARSA, QL} With {RD: Random, VI: Value Iteration, PI: Policy Iteration, SARSA: SARSA, QL: Q-Learning} Use python main.py -h to know more.

For Policy Iteration and Value Iteration, plots will appear, showing a map for each move (LEFT, RIGHT, UP, DOWN) colored when the given move is the best for the square. For SARSA and Q-Learning, plots will appear, showing the final reward after each episode. The parameters have been tuned so that the learning works (reward increase along the episodes).

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
agents.py		agents.py
environment.py		environment.py
main.py		main.py
new_agent.py		new_agent.py
runner.py		runner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Algorithms

How to use ?

About

Releases

Packages

Languages

wesbz/RLAlgo

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Algorithms

How to use ?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages