Keras Multiplexer

This is a Keras layer that acts as a multiplexer for Dense layers (or any other layer that has 1D output).

This layer is used to split the output of the previous layer into N groups of size output_dim, and choose which group to activate as output using a discrete control signal. During training, the only weights that are updated are those of the active group, while the others remain unchanged.

The layer takes as input two tensors, namely the output of the previous layer and a column tensor of type int32 or int64 for the control signal.

The input to Multiplexer (i.e. the output of the previous layer) must be of shape (None, N * output_dim), and the values in the control tensor must be between 0 (inclusive) and N (exclusive).

No checks are done at runtime to ensure that the input to the layer is correct or that the control signal contains legal values, so it's better to double check.

While basically implementing a controlled version of Dropout, this layer can be especially useful when learning a multidimensional function associated to a discrete space that conditions the output.

An example of this is deep Q-learning, where the Q-function depends on an action that can be discrete. In the DQN paper by DeepMind, the Q-network is trained by setting the target to be equal to the network output on all actions except the one being updated, as follows:

# Not using any particular notation
for sample in batch:
    # ...
    target = q_network.predict(sample.state)
    target[sample.action] = sample.reward +
                            df * max(q_network.predict(sample.state_))
    # ...
q_network.fit(states, targets)

This requires an extra forward pass of the Q-network in order to compute the target, which is not really necessary. With the Multiplexer layer, the same result can be achieved by simply feeding the action to the network as a separate input and updating only the associated weights (see example below for details on implementation).

Installation

It seemed overkill to package this as a library, so just copy and paste multiplexer.py in your project to use it. Note that the layer only works with Keras>=2.0.0 and the Tensorflow backend.

Example

This example implements the NN represented in the images above.

from numpy import array
from numpy.random import randn
from keras.models import Model
from keras.layers import Input, Dense
from multiplexer import Multiplexer

# Model definition
input = Input(shape=(3,))
control = Input(shape=(1,), dtype='int32')
hidden = Dense(6)(i)  # output_dim == 2, nb_ctrl_sig == 3
output = Multiplexer(2, 3)([hidden, control])

# Build and compile model
model = Model(input=[input, control], output=output)
model.compile('sgd', 'mse')

# Data
x = randn(3)  # Input has size 3

# Outputs the first two neurons of the Dense layer
model.predict([x, array([0])])

# Outputs the middle two neurons of the Dense layer
model.predict([x, array([1])])

# Outputs the last two neurons of the Dense layer
model.predict([x, array([2])])

To adapt this example to the DQN case, we would use two different models (q_net_train for training and q_net_test for testing) respectively with output layers output and hidden, and the Multiplexer layer configured with output_dim == 1 and nb_ctrl_sig == nb_actions.
We could then use sample.reward + df * max(q_net_test.predict(sample.state_)) as single target, and pass sample.state and sample.action as input to q_net_train.

Acknowledgments

Thanks to @carloderamo for porting the previous implementation to Keras 2.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
multiplexer.py		multiplexer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keras Multiplexer

Installation

Example

Acknowledgments

About

Releases

Packages

Languages

danielegrattarola/keras-multiplexer

Folders and files

Latest commit

History

Repository files navigation

Keras Multiplexer

Installation

Example

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages