Initialization of Deep Neural Networks with ReLU Activation

Linear regions of networks with piecewise linear activation play an important role in their understanding. Since networks with higher number and more even spread of linear regions are believed to be able to approximate a richer class of functions, it may be beneficial to maximize their number. We implement our own initialization strategy with that aim, and run experiments comparing it with more standard strategies. We trained the networks (with 2 hidden layers of 10 units each and ReLU activation) for classification on MNIST data set. The code for experiments with isotropic scaling can be found on isoscale branch.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
data		data
experiments		experiments
old		old
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
debug.py		debug.py
example_experiment.py		example_experiment.py
experiments.py		experiments.py
first_steps_adam.py		first_steps_adam.py
first_steps_sgd.py		first_steps_sgd.py
initialisation.py		initialisation.py
layerwise_variance_test.py		layerwise_variance_test.py
log_classes.py		log_classes.py
mnist.py		mnist.py
relu_experiment.py		relu_experiment.py
sgd_relu_experiment.py		sgd_relu_experiment.py
visualisation.py		visualisation.py
visualisationTwoModels.py		visualisationTwoModels.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Initialization of Deep Neural Networks with ReLU Activation

About

Releases

Packages

Contributors 3

Languages

ZivaUrbancic/Maxout_Initializations

Folders and files

Latest commit

History

Repository files navigation

Initialization of Deep Neural Networks with ReLU Activation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages