Rich's Slogan

Approximate the solution, not the problem (no special cases)
Drive from the problem
Take the agent’s point of view
Don’t ask the agent to achieve what it can’t measure
Don't ask the agent to know what it can't verify
Set measurable goals for subparts of the agent
Discriminative models are usually better than generative models
Work by orthogonal dimensions. Work issue by issue
Work on ideas, not software
Experience is the data of AI
Don’t be impressed by what you don’t understand
Ideas matter, you should care of that

Interesting:

Model-free RL: TD, E-TD

Model-based RL: feature construction, state constraction

Option abstraction

Super Dyna, Dyna with function approximation

gitbook对数学公式不是很友好，但是代码高亮不错，决定以后在数学公式方面侧重于用印象笔记记录，部分代码方面有gitbook记录。

聊了很多今晚。

Dyna with FA

how to sync with github？

{% hint style="info" %} Drafts --> Merge {% endhint %}

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.gitbook/assets		.gitbook/assets
experiment/openai-gym		experiment/openai-gym
keywords		keywords
python-and-ide		python-and-ide
README.md		README.md
SUMMARY.md		SUMMARY.md
existing-problems.md		existing-problems.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rich's Slogan

About

Releases

Packages

jkren6/Notes_for_RL

Folders and files

Latest commit

History

Repository files navigation

Rich's Slogan

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages