Impact of Weight-Space Symmetries on Neural Network Training

Weight configurations where two or more weight vectors are identical have been studied as overlap singularities. Furthermore, if these points are critical points of the loss function being optimized, they are termed as permutation points and have been proven to be saddle points. My Master thesis is an attempt to empirically study if and when such pathological configurations have a practical impact on neural network optimization.

Using this package

Requirements: PyTorch, NumPy, Matplotlib

To generate the simulations described in the thesis, modify appropriately the experiment_settings.yaml file and run main.py. The jupyter notebooks provided also present some examples of how to use the plotting modules.

Detecting high weight vector overlaps

A direct way to detect if two or more weights are becoming aligned during training is to monitor the histogram of the cosine similarities between every weight vector of a given hidden layer. All data presented here are from networks trained on the MNIST digit classification task.

A typical network with a single hidden layer (40 hidden units)
A network with two hidden layers (20, 500), i.e, an overparametrized layer following a bottleneck

We see that high overlaps very close to 1 are not specifically encountered even with a high number of units. However the overparametrized layer in the second case shows some interesting behaviour. We can plot the pairwise cosines at different stages of training as a function of their initial values to see whether units with partially-aligned weight vectors are able to decorrelate throughout training.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
experiment_configs		experiment_configs
img		img
metrics		metrics
nets		nets
notebooks		notebooks
utils		utils
visualization		visualization
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
experiment_settings.yaml		experiment_settings.yaml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Impact of Weight-Space Symmetries on Neural Network Training

Using this package

Detecting high weight vector overlaps

About

Releases

Packages

Languages

mshalvagal/weightSymmetry

Folders and files

Latest commit

History

Repository files navigation

Impact of Weight-Space Symmetries on Neural Network Training

Using this package

Detecting high weight vector overlaps

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages