mnist-training-errors

  e_t<-
 / /
s_f

Introduction

Sequential DNN Supervised Learning with TF is based on: www.tensorflow.org/tutorials/quickstart/beginner

Used in 'TFConfigTrain' to test training sets+epochs, DNN Layer config on digit recognition accuracy
- "tf-test train" to train (+ log results in google sheets) a DNN for MNIST digit recognition
Used in 'TFConfigRL' to measure the efficacy of retraining at runtime a 50% accuracy start level

This was a "work in progress" for Reinforcement Learning patterns like the human brain. It created utilities like:
- "tf-test bitwise" to test the self review process
- "tf-test mnist" to get a good example of each digit for the self review
- "tf-test image" to train the DNN to label the original MNIST image with a digit graphic
- "tf-test cpd" to regularly retrain a low trained DNN (like people do with "continuing professional development")

Categorical DQN Reinforcement Learning with TF is based on: https://github.com/tensorflow/agents/tree/master/docs/tutorials

CartPole training with Categorical DQN

I changed the original tutorial video of multiple CartPole episodes at the end of training, to a one episode video every evaluation interval of 1000 training steps. This shows the increased time of a valid CartPole episode as the Categorical DQN is reinforcement trained.

The videos are stored in 'video/[Incremented Integer]/' directory along with the graph of “average return” (ie average step number of 10 episodes for the agent in an “evaluation gym”) for each interval.

"tf-test cartpole"

Install steps

$ brew install graphviz
$ brew install ffmpeg

$ pip install gspread==3.6.0
$ pip install oauth2client
$ pip install pydot
$ pip install 'imageio==2.4.0'
$ pip install pyvirtualdisplay
$ pip install pyglet
$ pip install --upgrade tensorflow
$ pip install --upgrade tf-agents
$ pip install pandas
$ pip install click

$ git clone https://github.com/doughazell/mnist-training-errors.git

Get 'client_secret.json' (originally got via https://www.twilio.com/blog/2017/02/an-easy-way-to-read-and-write-to-a-google-spreadsheet-in-python.html) from Google (and then adapt the spreadsheet code for your spreadsheet layout)

Run 'tf-test'

$ cd mnist-training-errors
$ ./tf-test --help

  Usage: tf-test [OPTIONS] COMMAND [ARGS]...

    The Sequential DNN is trained with MNIST digits:

      a) high level with Supervised Learning

      b) 50% level prior to Reinforcement Learning

    Categorical DQN uses OpenAI Gym for CartPole physics

  Options:
    --help  Show this message and exit.

  Commands:
    bitwise   Test correlation of 'y_test[index]' with bitwise-AND
    cartpole  Train the TF Categorical DQN with Reinforcement Learning
    cpd       Train the TF Sequential DNN with intermittent retraining
    image     Train the TF Sequential DNN for image manipulation
    mnist     Get example of an image for each digit
    rl        Train the TF Sequential DNN with Reinforcement Learning
    train     Train the TF Sequential DNN with Supervised Learning

Issues

Reverb only works on Linux (prob for commercial reasons) so this repo is platform independent (and done on my MacBook Pro) by using 'TFUniformReplayBuffer' (and not 'ReverbReplayBuffer' used in '1_dqn_tutorial.ipynb')

tensorflow/agents#532 (comment)

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
README.md		README.md
digitDictionaryINTEGER.pkl		digitDictionaryINTEGER.pkl
digitDictionarySELECTED.pkl		digitDictionarySELECTED.pkl
digitOutput0		digitOutput0
digitOutput1		digitOutput1
digitOutput2		digitOutput2
dqn_c51.py		dqn_c51.py
gspread_errors.py		gspread_errors.py
gspread_rl.py		gspread_rl.py
gspread_rl_parts.py		gspread_rl_parts.py
gym_config.py		gym_config.py
predicted-errors.txt		predicted-errors.txt
suite_gym_utils.py		suite_gym_utils.py
tf-test		tf-test
tf_config.py		tf_config.py
tf_config_image.py		tf_config_image.py
tf_config_misc.py		tf_config_misc.py
tf_config_rl.py		tf_config_rl.py
tf_config_train.py		tf_config_train.py
tf_model.py		tf_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mnist-training-errors

Introduction

CartPole training with Categorical DQN

Install steps

Run 'tf-test'

Issues

About

Releases

Packages

Languages

doughazell/mnist-training-errors

Folders and files

Latest commit

History

Repository files navigation

mnist-training-errors

Introduction

CartPole training with Categorical DQN

Install steps

Run 'tf-test'

Issues

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages