GitHub - ergo-zyh/Reinforcement_Learning_Notes: A naive version.

Reinforcement Learning Notes

The (introductory) notes included Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization. For the state-of-the-art advances, one can refer to paper directly and some excellent blog.

Hope you enjoy your learning.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
README.md		README.md
Reinforcement Learning Notes.pdf		Reinforcement Learning Notes.pdf
Section 1 Introduction.pdf		Section 1 Introduction.pdf
Section 2 Probability.pdf		Section 2 Probability.pdf
Section 3 Bandit Algorithms.pdf		Section 3 Bandit Algorithms.pdf
Section 4 Markov Chains.pdf		Section 4 Markov Chains.pdf
Section 5 Markov Decision Process.pdf		Section 5 Markov Decision Process.pdf
Section 6 Model-Free Prediction.pdf		Section 6 Model-Free Prediction.pdf
Section 7 Model-Free Control.pdf		Section 7 Model-Free Control.pdf
Section 8 Value Function Approximation.pdf		Section 8 Value Function Approximation.pdf
Section 9 Policy Gradient.pdf		Section 9 Policy Gradient.pdf
Table of Contents.pdf		Table of Contents.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Notes

About

Releases

Packages

ergo-zyh/Reinforcement_Learning_Notes

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages