Examples of regression? #166

shilpakancharla · 2022-03-31T19:21:59Z

shilpakancharla
Mar 31, 2022

I was wondering if anyone had used snnTorch for regression, and perhaps how you set your networks up. Just looking for simple, general examples! MSELoss would likely be the type of loss used as I see it.

ahenkes1 · 2022-05-30T08:02:40Z

ahenkes1
May 30, 2022
Collaborator

I would second this question, as it is not obvious for me how to decode spikes to the real line. If you decode rate based, then the prediction for large numbers would take more spikes (=more expensive?). A simple example would be great!

0 replies

mahmad2005 · 2022-08-03T07:24:03Z

mahmad2005
Aug 3, 2022

Have you got any example of Linear Regression using snnTorch? I'm trying to test simple f(x) = x linear regression problem using ssnTorch, following the tutorial 5, but could not improve the training loss. I'm using MSELoss as a loss function and Stochastic gradient descent SGD as an optimizer.

0 replies

shilpakancharla · 2022-08-03T17:06:53Z

shilpakancharla
Aug 3, 2022
Author

I actually used MSELoss and Adam in one of my projects albeit I'm sure it could still use some work (see my repo on event-based velocity prediction). I've also been trying to do some literature search of what appropriate loss and optimization functions are for SNNs in general.

0 replies

mahmad2005 · 2022-08-03T19:04:52Z

mahmad2005
Aug 3, 2022

I also used Adam but could not get model improved. I think my code has some problem. I have posted about in on discussion section if you or someone could help me to find where I made mistake.
#122

0 replies

jeshraghian · 2022-08-08T04:34:32Z

jeshraghian
Aug 8, 2022
Maintainer

The way I see it, there are a few ways to perform regression.

Set the target of the membrane potential of a spiking neuron to reach the desired value
Set the target of the total spike count at the end of the simulation to reach the desired value
Set the target of the spike time to reach the desired value

The most effective way to implement '1' would be to ensure the output layer has the reset mechanism disabled (e.g., snn.Leaky(beta=beta, reset="none") or by setting the threshold to an arbitrarily large value. Then you would need to decide at what point in time you will be measuring the output. E.g., are you only concerned with the membrane potential at the final time step, or for all time steps? This will be quite task-dependent.

The approach for '2' is quite straightforward, as you would simply sum together all of the spikes and try and set it to be the desired target value. The limiting factors here are i) the quantised nature of spike counts (i.e., it can only take on discrete values / natural numbers, and ii) the maximum permissible spike count is the total number of time steps. The second issue can be lessened by using multiple neurone to emit multiple spikes.

The approach for '3' is lesser explored in the context of a PyTorch backend, and in general, I find it is far less stable than rate-based loss functions. But I've put together a loss function with a few examples: mse_temporal_loss. I found that I often had to drop the threshold significantly to start seeing any action, especially as my networks became deeper.

https://snntorch.readthedocs.io/en/latest/snntorch.functional.html#snntorch.functional.loss.mse_temporal_loss

Let me know what types of problems you're trying to tackle, and I can account for it in future tutorials.

0 replies

ahenkes1 · 2022-08-08T06:03:17Z

ahenkes1
Aug 8, 2022
Collaborator

Thank you very much for the extensive answer! To begin with, a simple affine transformation of a temporal (linear) sequence would be helpful. That is,

x = [0, 1, 2, ..., t] / t (that is, linspace normalized between 0 and 1)

y = ax +b for some real numbers a,b (possibly chosen randomly for every sample)

I experimented with unrolled spiking lstm in combination with classical dense layers as a wrapper (also using different libraries), where I was able to overfit on small values of t, but at no extend to a precision like classical lstms.

So basically, I am stuck at trying to overfit on a single sample with, say, 25 - 100 timesteps for fixed values of a and b.

0 replies

jeshraghian · 2022-08-08T07:49:55Z

jeshraghian
Aug 8, 2022
Maintainer

This should definitely be possible! Are you able to apply MSELoss() to the output membrane potential?

I have a notebook in another repo where I train the membrane potential to linearly increase over time given random inputs. I expect learning a linear mapping is quite easier. Check out the ipynb file in this repo for inspo:

https://github.com/jeshraghian/snn-tha

0 replies

ahenkes1 · 2022-08-08T08:19:04Z

ahenkes1
Aug 8, 2022
Collaborator

Thank you for the feedback! I just used the normal MSE in the optimizer when I had a standard 'Linear' layer as output. I'll check your file and report back!

0 replies

ahenkes1 · 2022-08-08T11:48:52Z

ahenkes1
Aug 8, 2022
Collaborator

It seems to work in principal (your approach 1 using membrane potentials), but I have some (explainable) artifact in my results. I try to overfit on a single vector [TIME=10, BATCH=1, FEATURE=1], which is a simple linspace (True):

True            Prediction      
[('4.9626e-01', '0.0000e+00'),
 ('5.2647e-01', '0.0000e+00'),
 ('5.5669e-01', '0.0000e+00'),
 ('5.8691e-01', '0.0000e+00'),
 ('6.1713e-01', '6.1499e-01'),
 ('6.4735e-01', '6.4795e-01'),
 ('6.7757e-01', '6.7761e-01'),
 ('7.0779e-01', '7.0775e-01'),
 ('7.3800e-01', '7.3803e-01'),
 ('7.6822e-01', '7.6820e-01')]

I use the following network:

real input -> leaky integrator with linear weights -> leaky integrate and fire with linear weights -> leaky integrator with linear weights -> real output.

Now, how to get the "correct" results for the prediction also for the first iterations? I think in the beginning the membrane is not saturated enough ....

0 replies

jeshraghian · 2022-08-08T12:26:21Z

jeshraghian
Aug 8, 2022
Maintainer

When you say iterations, do you mean time steps? I think it might be challenging to achieve that as the neuron needs a bit of time to get excited / hit steady state. Could you share the code for your network architecture?

…

On Mon, 8 Aug 2022, 7:49 pm Alexander Henkes, ***@***.***> wrote: It seems to work in principal (your approach 1 using membrane potentials), but I have some (explainable) artifact in my results. I try to overfit on a single vector [TIME=10, BATCH=1, FEATURE=1], which is a simple linspace (True): True Prediction [('4.9626e-01', '0.0000e+00'), ('5.2647e-01', '0.0000e+00'), ('5.5669e-01', '0.0000e+00'), ('5.8691e-01', '0.0000e+00'), ('6.1713e-01', '6.1499e-01'), ('6.4735e-01', '6.4795e-01'), ('6.7757e-01', '6.7761e-01'), ('7.0779e-01', '7.0775e-01'), ('7.3800e-01', '7.3803e-01'), ('7.6822e-01', '7.6820e-01')] I use the following network: real input -> leaky integrator with linear weights -> leaky integrate and fire with linear weights -> leaky integrator with linear weights -> real output. Now, how to get the "correct" results for the prediction also for the first iterations? I think in the beginning the membrane is not saturated enough .... — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4VD2TZL7TFJJXOS24DVYDX25ANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

ahenkes1 · 2022-08-08T12:29:39Z

ahenkes1
Aug 8, 2022
Collaborator

I used a different lib for the experiment, I will make a small script in snntorch for the whole code for everyone to check out and post it here. And yes, I mean "time steps". Maybe one could use dummy entries for the first few time steps?

0 replies

jeshraghian · 2022-08-08T12:34:17Z

jeshraghian
Aug 8, 2022
Maintainer

Yeah, I know there's a Frontiers in Neuroscience paper out there by Emre Neftci which does exactly that on a classification task. The first 50 time steps do not have an objective function applied, as the input data is thought to be uninformative. In your case, this could go either way, as the input data at the start is still useful. I would also expect expanding the width of the hidden layer might help here!

…

On Mon, 8 Aug 2022, 8:29 pm Alexander Henkes, ***@***.***> wrote: I used a different lib for the experiment, I will make a small script in snntorch for the whole code for everyone to check out and post it here. And yes, I mean "time steps". Maybe one could use dummy entries for the first few time steps? — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4XZE7G42XICIW52BPDVYD4T7ANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

ahenkes1 · 2022-08-08T12:35:46Z

ahenkes1
Aug 8, 2022
Collaborator

Thanks for pointing out the paper! I already used 1024 as width, which seems to be quite an overkill for such a simple task, so I thought there would be more of a fundamental error :D

0 replies

ahenkes1 · 2022-08-08T13:50:20Z

ahenkes1
Aug 8, 2022
Collaborator

Well, I implemented my architecture from the other lib into snntorch and .... it worked?!? Here is my code:

"""A simple regression task using snntorch."""
import snntorch
import torch
import torch.utils.data


class Regression_dataset(torch.utils.data.Dataset):
    """Simple regression dataset."""

    def __init__(self, timesteps):
        """Linear relation between input and output"""
        lin_vec = torch.linspace(start=0.0, end=0.1, steps=timesteps)
        self.feature = lin_vec.view(timesteps, 1, 1)
        self.label = self.feature * 10

    def __len__(self):
        """Only one sample."""
        return 1

    def __getitem__(self, idx):
        """General implementation, but we only have one sample."""
        return self.feature[:, idx, :], self.label[:, idx, :]


class SNN(torch.nn.Module):
    """Simple spiking neural network in snntorch."""

    def __init__(self, timesteps, hidden):
        super().__init__()
        self.timesteps = timesteps
        self.hidden = hidden

        self.fc1 = torch.nn.Linear(in_features=1, out_features=self.hidden)
        self.lif = snntorch.Leaky(beta=0.5)
        self.fc2 = torch.nn.Linear(in_features=self.hidden, out_features=1)
        self.li = snntorch.Leaky(beta=0.5, reset_mechanism="none")

    def forward(self, x):
        """Forward pass for 10 time steps."""
        mem1 = self.lif.init_leaky()
        mem2 = self.li.init_leaky()

        cur3_rec = []
        mem2_rec = []

        for step in range(self.timesteps):
            cur1 = self.fc1(x[step, :, :])
            spk1, mem1 = self.lif(cur1, mem1)
            cur2 = self.fc2(spk1)
            cur3, mem2 = self.li(cur2, mem2)
            cur3_rec.append(cur3)
            mem2_rec.append(mem2)

        return torch.stack(cur3_rec, dim=0), torch.stack(mem2_rec, dim=0)


def main():
    """Training loop and prediction."""
    DEVICE = "cuda"
    TIMESTEPS = 11
    ITER = 2000
    HIDDEN = 1024

    dataloader = torch.utils.data.DataLoader(
        dataset=Regression_dataset(timesteps=TIMESTEPS)
    )

    model = SNN(timesteps=TIMESTEPS, hidden=HIDDEN).to(DEVICE)
    model.train()

    optimizer = torch.optim.Adam(params=model.parameters(), lr=3e-4)
    loss_function = torch.nn.MSELoss()

    feature = None
    label = None
    loss_val = None
    for i in range(ITER):
        train_batch = iter(dataloader)

        for feature, label in train_batch:
            feature = torch.swapaxes(input=feature, axis0=0, axis1=1)
            label = torch.swapaxes(input=label, axis0=0, axis1=1)
            feature = feature.to(DEVICE)
            label = label.to(DEVICE)

            cur, mem = model(feature)

            loss_val = loss_function(mem, label)
            optimizer.zero_grad()
            loss_val.backward()
            optimizer.step()

        print(f"Iter: {i}, Loss: {loss_val.detach().cpu().numpy()}")

    with torch.no_grad():
        model.eval()
        _, prediction = model(feature)

    label = torch.squeeze(label).cpu().numpy().tolist()
    prediction = torch.squeeze(prediction).cpu().numpy().tolist()
    result = list(zip(label, prediction))
    for i in result:
        print(i)

    return None


if __name__ == "__main__":
    main()

This are of course great news, but now I have to figure out, what exactly happened and why the other code is not working. By the way, for large TIMESTEPS I am not able to overfit, I think due to vanishing gradients. I will try your LSTM implementation on that.

Best

0 replies

jeshraghian · 2022-08-08T20:10:31Z

jeshraghian
Aug 8, 2022
Maintainer

Nice! Keep us updated on if you find out why. I've run into some brickwalls trying to explain the difference in performance between packages in the past. Hope you have more luck.

…

On Mon, 8 Aug 2022, 11:50 pm Alexander Henkes, ***@***.***> wrote: Well, I implemented my architecture from the other lib into snntorch and .... it worked?!? Here is my code: """A simple regression task using snntorch.""" import snntorch import torch import torch.utils.data class Regression_dataset(torch.utils.data.Dataset): """Simple regression dataset.""" def __init__(self, timesteps): """Linear relation between input and output""" lin_vec = torch.linspace(start=0.0, end=0.1, steps=timesteps) self.feature = lin_vec.view(timesteps, 1, 1) self.label = self.feature * 10 def __len__(self): """Only one sample.""" return 1 def __getitem__(self, idx): """General implementation, but we only have one sample.""" return self.feature[:, idx, :], self.label[:, idx, :] class SNN(torch.nn.Module): """Simple spiking neural network in snntorch.""" def __init__(self, timesteps, hidden): super().__init__() self.timesteps = timesteps self.hidden = hidden self.fc1 = torch.nn.Linear(in_features=1, out_features=self.hidden) self.lif = snntorch.Leaky(beta=0.5) self.fc2 = torch.nn.Linear(in_features=self.hidden, out_features=1) self.li = snntorch.Leaky(beta=0.5, reset_mechanism="none") def forward(self, x): """Forward pass for 10 time steps.""" mem1 = self.lif.init_leaky() mem2 = self.li.init_leaky() cur3_rec = [] mem2_rec = [] for step in range(self.timesteps): cur1 = self.fc1(x[step, :, :]) spk1, mem1 = self.lif(cur1, mem1) cur2 = self.fc2(spk1) cur3, mem2 = self.li(cur2, mem2) cur3_rec.append(cur3) mem2_rec.append(mem2) return torch.stack(cur3_rec, dim=0), torch.stack(mem2_rec, dim=0) def main(): """Training loop and prediction.""" DEVICE = "cuda" TIMESTEPS = 11 ITER = 2000 HIDDEN = 1024 dataloader = torch.utils.data.DataLoader( dataset=Regression_dataset(timesteps=TIMESTEPS) ) model = SNN(timesteps=TIMESTEPS, hidden=HIDDEN).to(DEVICE) model.train() optimizer = torch.optim.Adam(params=model.parameters(), lr=3e-4) loss_function = torch.nn.MSELoss() feature = None label = None loss_val = None for i in range(ITER): train_batch = iter(dataloader) for feature, label in train_batch: feature = torch.swapaxes(input=feature, axis0=0, axis1=1) label = torch.swapaxes(input=label, axis0=0, axis1=1) feature = feature.to(DEVICE) label = label.to(DEVICE) cur, mem = model(feature) loss_val = loss_function(mem, label) optimizer.zero_grad() loss_val.backward() optimizer.step() print(f"Iter: {i}, Loss: {loss_val.detach().cpu().numpy()}") with torch.no_grad(): model.eval() _, prediction = model(feature) label = torch.squeeze(label).cpu().numpy().tolist() prediction = torch.squeeze(prediction).cpu().numpy().tolist() result = list(zip(label, prediction)) for i in result: print(i) return None if __name__ == "__main__": main() This are of course great news, but now I have to figure out, what exactly happened and why the other code is not working. By the way, for large TIMESTEPS I am not able to overfit, I think due to vanishing gradients. I will try your LSTM implementation on that. Best — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4S6EHKQ74GDQH4C3ATVYEGCPANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

ahenkes1 · 2022-08-09T07:58:09Z

ahenkes1
Aug 9, 2022
Collaborator

I did some experiments with regard to the LSTM. I observed the following points:

If the threshold for the LSTM is not set to trainable, the loss will converge to a bad level.
If the threshold is trainable, I can reach losses in the order of O(1e-5), which is much worse than the architecture consisting only of LIF.
If I reduce the width of the LSTM to 512, I can overfit to machine precision. The LSTM consists of multiple networks, so the complexity is higher, as well as the difficulty to train. Reducing makes sense.
reset_mechanism does nothing with respect to the achievable loss.
This is all for 10 time steps. My expectation was, like in standard LSTM which work just fine, that the spiking LSTM will be able to overfit on even larger time steps, which is unfortunately not the case.

Do you have an idea why this happens or how to improve the LSTM?

P.S.: Concerning the other libs, I will dig into the implementation, write down the formulae and try to come up with a reasonable interpretation.

Here is the code to reproduce, you can switch between LSTM and LIF in the first layer by setting LSTM=True/False:

"""A simple regression task using snntorch."""
import numpy
import random
import snntorch
import snntorch.surrogate
import torch
import torch.utils.data

# Seed
torch.manual_seed(0)
random.seed(0)
numpy.random.seed(0)


class Regression_dataset(torch.utils.data.Dataset):
    """Simple regression dataset."""

    def __init__(self, timesteps):
        """Linear relation between input and output"""
        lin_vec = torch.linspace(start=0.0, end=1.0, steps=timesteps)
        self.feature = lin_vec.view(timesteps, 1, 1)
        self.label = self.feature * 1

    def __len__(self):
        """Only one sample."""
        return 1

    def __getitem__(self, idx):
        """General implementation, but we only have one sample."""
        return self.feature[:, idx, :], self.label[:, idx, :]


class SNN(torch.nn.Module):
    """Simple spiking neural network in snntorch."""

    def __init__(self, timesteps, hidden, lstm=False):
        super().__init__()
        self.timesteps = timesteps
        self.hidden = hidden
        self.lstm = lstm

        spike_grad = snntorch.surrogate.atan()

        if not self.lstm:
            beta_in = torch.rand(self.hidden)
            thr_in = torch.rand(self.hidden)

            self.fc1 = torch.nn.Linear(in_features=1, out_features=self.hidden)
            self.lif = snntorch.Leaky(
                beta=beta_in,
                threshold=thr_in,
                learn_beta=True,
                learn_threshold=True,
                spike_grad=spike_grad,
                reset_mechanism="subtract",
            )

        elif self.lstm:
            thr_lstm = torch.rand(self.hidden)

            self.slstm = snntorch.SLSTM(
                input_size=1,
                hidden_size=self.hidden,
                spike_grad=spike_grad,
                learn_threshold=True,
                threshold=thr_lstm,
                reset_mechanism="none",
            )

        else:
            raise SystemExit()

        beta_out = torch.rand(1)
        thr_out = torch.rand(1)

        self.fc2 = torch.nn.Linear(in_features=self.hidden, out_features=1)
        self.li = snntorch.Leaky(
            beta=beta_out,
            threshold=thr_out,
            learn_beta=True,
            learn_threshold=True,
            spike_grad=spike_grad,
            reset_mechanism="none",
        )

    def forward(self, x):
        """Forward pass for several time steps."""
        syn_in = None

        if not self.lstm:
            mem_in = self.lif.init_leaky()

        elif self.lstm:
            syn_in, mem_in = self.slstm.init_slstm()

        else:
            raise SystemExit()

        mem_out = self.li.init_leaky()

        cur_out_rec = []
        mem_out_rec = []

        for step in range(self.timesteps):
            x_timestep = x[step, :, :]

            if not self.lstm:
                cur_in = self.fc1(x_timestep)
                spk_in, mem_in = self.lif(cur_in, mem_in)

            elif self.lstm:
                spk_in, syn_in, mem_in = self.slstm(x_timestep, syn_in, mem_in)

            else:
                raise SystemExit()

            cur_out = self.fc2(spk_in)
            cur_out, mem_out = self.li(cur_out, mem_out)
            cur_out_rec.append(cur_out)
            mem_out_rec.append(mem_out)

        return torch.stack(cur_out_rec, dim=0), torch.stack(mem_out_rec, dim=0)


def main():
    """Training loop and prediction."""
    DEVICE = (
        torch.device("cuda")
        if torch.cuda.is_available()
        else torch.device("cpu")
    )
    TIMESTEPS = 10
    ITER = 2000
    HIDDEN = 1024
    LSTM = True

    dataloader = torch.utils.data.DataLoader(
        dataset=Regression_dataset(timesteps=TIMESTEPS)
    )

    model = SNN(timesteps=TIMESTEPS, hidden=HIDDEN, lstm=LSTM).to(DEVICE)
    model.train()

    optimizer = torch.optim.Adam(params=model.parameters(), lr=1e-3)
    loss_function = torch.nn.MSELoss()

    feature = None
    label = None
    loss_val = None
    for i in range(ITER):
        train_batch = iter(dataloader)

        for feature, label in train_batch:
            feature = torch.swapaxes(input=feature, axis0=0, axis1=1)
            label = torch.swapaxes(input=label, axis0=0, axis1=1)
            feature = feature.to(DEVICE)
            label = label.to(DEVICE)

            cur, mem = model(feature)

            loss_val = loss_function(mem, label)
            optimizer.zero_grad()
            loss_val.backward()
            optimizer.step()

        print(f"Iter: {i}, Loss: {loss_val.detach().cpu().numpy()}")

    with torch.no_grad():
        model.eval()
        _, prediction = model(feature)

    label = torch.squeeze(label).cpu().numpy().tolist()
    prediction = torch.squeeze(prediction).cpu().numpy().tolist()
    result = list(zip(label, prediction))
    for i in result:
        print(i)

    return None


if __name__ == "__main__":
    main()

0 replies

jeshraghian · 2022-08-09T09:40:11Z

jeshraghian
Aug 9, 2022
Maintainer

Interesting. I feel LSTMs should be better than LIFs too. My only guess at the moment is that perhaps the firing threshold of the LSTM is too high. Perhaps there is insufficient spiking activity propagating through the network. Lowering the threshold, rather than setting it randomly, might help with this. In my experience, the LSTM modules performed better than LIF on more complex, time-varying tasks (e.g., seizure detection/prediction on EEG data). But I had to drop the threshold quite low to get there (e.g., 0.1, 0.01...)

…

On Tue, Aug 9, 2022 at 3:58 PM Alexander Henkes ***@***.***> wrote: I did some experiments with regard to the LSTM. I observed the following points: 1. If the threshold for the LSTM is not set to trainable, the loss will converge to a bad level. 2. If the threshold is trainable, I can reach losses in the order of O(1e-5), which is much worse than the architecture consisting only of LIF. 3. reset_mechanism does nothing with respect to the achievable loss. 4. This is all for 10 time steps. My expectation was, like in standard LSTM which work just fine, that the spiking LSTM will be able to overfit (a) the 10 time steps and (b) even larger time steps. *Do you have an idea why this happens or how to improve the LSTM?* P.S.: Concerning the other libs, I will dig into the implementation, write down the formulae and try to come up with a reasonable interpretation. Here is the code to reproduce, you can switch between LSTM and LIF in the first layer by setting LSTM=True/False: """A simple regression task using snntorch.""" import numpy import random import snntorch import snntorch.surrogate import torch import torch.utils.data # Seed torch.manual_seed(0) random.seed(0) numpy.random.seed(0) class Regression_dataset(torch.utils.data.Dataset): """Simple regression dataset.""" def __init__(self, timesteps): """Linear relation between input and output""" lin_vec = torch.linspace(start=0.0, end=1.0, steps=timesteps) self.feature = lin_vec.view(timesteps, 1, 1) self.label = self.feature * 1 def __len__(self): """Only one sample.""" return 1 def __getitem__(self, idx): """General implementation, but we only have one sample.""" return self.feature[:, idx, :], self.label[:, idx, :] class SNN(torch.nn.Module): """Simple spiking neural network in snntorch.""" def __init__(self, timesteps, hidden, lstm=False): super().__init__() self.timesteps = timesteps self.hidden = hidden self.lstm = lstm spike_grad = snntorch.surrogate.atan() if not self.lstm: beta_in = torch.rand(self.hidden) thr_in = torch.rand(self.hidden) self.fc1 = torch.nn.Linear(in_features=1, out_features=self.hidden) self.lif = snntorch.Leaky( beta=beta_in, threshold=thr_in, learn_beta=True, learn_threshold=True, spike_grad=spike_grad, reset_mechanism="subtract", ) elif self.lstm: thr_lstm = torch.rand(self.hidden) self.slstm = snntorch.SLSTM( input_size=1, hidden_size=self.hidden, spike_grad=spike_grad, learn_threshold=True, threshold=thr_lstm, reset_mechanism="none", ) else: raise SystemExit() beta_out = torch.rand(1) thr_out = torch.rand(1) self.fc2 = torch.nn.Linear(in_features=self.hidden, out_features=1) self.li = snntorch.Leaky( beta=beta_out, threshold=thr_out, learn_beta=True, learn_threshold=True, spike_grad=spike_grad, reset_mechanism="none", ) def forward(self, x): """Forward pass for several time steps.""" syn_in = None if not self.lstm: mem_in = self.lif.init_leaky() elif self.lstm: syn_in, mem_in = self.slstm.init_slstm() else: raise SystemExit() mem_out = self.li.init_leaky() cur_out_rec = [] mem_out_rec = [] for step in range(self.timesteps): x_timestep = x[step, :, :] if not self.lstm: cur_in = self.fc1(x_timestep) spk_in, mem_in = self.lif(cur_in, mem_in) elif self.lstm: spk_in, syn_in, mem_in = self.slstm(x_timestep, syn_in, mem_in) else: raise SystemExit() cur_out = self.fc2(spk_in) cur_out, mem_out = self.li(cur_out, mem_out) cur_out_rec.append(cur_out) mem_out_rec.append(mem_out) return torch.stack(cur_out_rec, dim=0), torch.stack(mem_out_rec, dim=0) def main(): """Training loop and prediction.""" DEVICE = ( torch.device("cuda") if torch.cuda.is_available() else torch.device("cpu") ) TIMESTEPS = 10 ITER = 2000 HIDDEN = 1024 LSTM = True dataloader = torch.utils.data.DataLoader( dataset=Regression_dataset(timesteps=TIMESTEPS) ) model = SNN(timesteps=TIMESTEPS, hidden=HIDDEN, lstm=LSTM).to(DEVICE) model.train() optimizer = torch.optim.Adam(params=model.parameters(), lr=1e-3) loss_function = torch.nn.MSELoss() feature = None label = None loss_val = None for i in range(ITER): train_batch = iter(dataloader) for feature, label in train_batch: feature = torch.swapaxes(input=feature, axis0=0, axis1=1) label = torch.swapaxes(input=label, axis0=0, axis1=1) feature = feature.to(DEVICE) label = label.to(DEVICE) cur, mem = model(feature) loss_val = loss_function(mem, label) optimizer.zero_grad() loss_val.backward() optimizer.step() print(f"Iter: {i}, Loss: {loss_val.detach().cpu().numpy()}") with torch.no_grad(): model.eval() _, prediction = model(feature) label = torch.squeeze(label).cpu().numpy().tolist() prediction = torch.squeeze(prediction).cpu().numpy().tolist() result = list(zip(label, prediction)) for i in result: print(i) return None if __name__ == "__main__": main() — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4WMJPCBEJV4YVQN5WTVYIFRZANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

ahenkes1 · 2022-08-09T11:42:55Z

ahenkes1
Aug 9, 2022
Collaborator

I tried different widths, thresholds, learnable/not learnable thresholds, reset mechanism and surrogate gradients. Nothing seems to work for timesteps > 10 ....

0 replies

ahenkes1 · 2022-08-09T13:16:10Z

ahenkes1
Aug 9, 2022
Collaborator

However, I am able to overfit, if I pass the membrane potential of the slstm to the LI layer instead of the spikes, which makes me wonder if the problem really lies in the LSTM or maybe in the LI layer? Does this count as cheating and going around the idea of spikes / spiking neural networks? :D

0 replies

jeshraghian · 2022-08-10T01:36:11Z

jeshraghian
Aug 10, 2022
Maintainer

I tried different widths, thresholds, learnable/not learnable thresholds,

reset mechanism and surrogate gradients. Nothing seems to work for timesteps > 10 .... Have you tried plotting the output of the SLSTM layer? This is usually a good way to help isolate the problem. E.g., see if there is too much spiking? Not enough spiking?

Does this count as cheating and going around the idea of spikes / spiking

neural networks? Haha in my view, there's no precise definition of what an SNN is tbh. My personal, engineering-centric definition is one that uses 1/0 activations. 1 * W = W, where if W is stored in memory, the weight-multiply step is just a read-out of W from memory. 0 * W = 0, where if your hardware sees a '0' activation, a smart accelerator will just skip the memory access step. My goal is to use SNNs to improve efficiency. Some people have tighter definitions of SNNs; e.g., not only 1/0 activations, but also 1/0s during the backward pass. If you go by this rule, then snntorch isn't actually using SNNs at training time. Only inference. But there are plenty of analog phenomena in the brain that are not constrained to 1s and 0s... So cheating only depends on the rules you impose on yourself / your goals!

…

On Tue, Aug 9, 2022 at 9:16 PM Alexander Henkes ***@***.***> wrote: However, I am able to overfit, if I pass the membrane potential of the slstm to the LI layer instead of the spikes, which makes me wonder if the problem really lies in the LSTM or maybe in the LI layer? Does this count as cheating and going around the idea of spikes / spiking neural networks? :D — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4UNQNHUXJCU7SOUCQ3VYJK2LANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

ahenkes1 · 2022-08-10T08:41:06Z

ahenkes1
Aug 10, 2022
Collaborator

Thank you for the explanation, the engineering point of view suites me well ;)

What do you think, do you want my simple example to be added in your tutorial section? We could also try something more difficult, like f(x) = x^2, I dont know ....We could close this "issue" and open a thread in the discussion area, maybe to prepare a PR?

0 replies

jeshraghian · 2022-08-11T01:26:09Z

jeshraghian
Aug 11, 2022
Maintainer

Oh yeah absolutely! More than happy for you to take the lead on this initiative, or we can co-write a tutorial together. Go ahead and convert this to a discussion and we can brainstorm an outline.

…

On Wed, 10 Aug 2022, 4:41 pm Alexander Henkes, ***@***.***> wrote: Thank you for the explanation, the engineering point of view suites me well ;) What do you think, do you want my simple example to be added in your tutorial section? We could also try something more difficult, like f(x) = x^2, I dont know ....We could close this "issue" and open a thread in the discussion area, maybe to prepare a PR? — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4SJYSSJLE4HL5ZA6W3VYNTKZANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

shilpakancharla · 2022-08-11T15:43:27Z

shilpakancharla
Aug 11, 2022
Author

Just read through the discussion. I'd love to contribute to the tutorial or join in on the discussion as it's written - I've been using snnTorch quite frequently for my work and this would be a great opportunity for me too I think!

0 replies

shilpakancharla · 2022-08-11T15:52:29Z

shilpakancharla
Aug 11, 2022
Author

I think another interesting tutorial that could be done is using event data (timestamp, x, y, polarity) with regression in order to predict something about what is being captured by an event camera. Perhaps this is a bit more advanced regression problem, but I did my Masters thesis recently on this and have open-sourced my dataset, or there are a ton of existing ones in Tonic that could maybe be used albeit I'm not sure they have regression applications. Interested in hearing your thoughts or other potential ideas for a more real-world example of regression with SNNs.

0 replies

ahenkes1 · 2022-08-12T04:45:43Z

ahenkes1
Aug 12, 2022
Collaborator

I am currently working on a baseline code. Unfortunately, I have a memory leak which needs to be fixed, for which I have tank experience in pytorch. If everything is working, I'll report back!

0 replies

jeshraghian · 2022-10-11T06:58:02Z

jeshraghian
Oct 11, 2022
Maintainer

Thats awesome, lets do it. Do you have discord? We're currently brainstorming in the tutorial-dev channel there. I haven't used input spike times directly (without expanding them out to tensors), but I'm definitely intrigued!

…

On Thu, 11 Aug 2022, 11:52 pm Shilpa Kancharla, ***@***.***> wrote: I think another interesting tutorial that could be done is using event data (timestamp, x, y, polarity) with regression in order to predict something about what is being captured by an event camera. Perhaps this is a bit more advanced regression problem, but I did my Masters thesis recently on this and have open-sourced my dataset, or there are a ton of existing ones in Tonic that could maybe be used albeit I'm not sure they have regression applications. Interested in hearing your thoughts or other potential ideas for a more real-world example of regression with SNNs. — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJTFT4QMXSP57XDCMJUU7IDVYUOUPANCNFSM5SGMR4BA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

shilpakancharla · 2022-10-11T21:19:49Z

shilpakancharla
Oct 11, 2022
Author

Thats awesome, lets do it. Do you have discord? We're currently brainstorming in the tutorial-dev channel there. I haven't used input spike times directly (without expanding them out to tensors), but I'm definitely intrigued!
…
On Thu, 11 Aug 2022, 11:52 pm Shilpa Kancharla, @.> wrote: I think another interesting tutorial that could be done is using event data (timestamp, x, y, polarity) with regression in order to predict something about what is being captured by an event camera. Perhaps this is a bit more advanced regression problem, but I did my Masters thesis recently on this and have open-sourced my dataset, or there are a ton of existing ones in Tonic that could maybe be used albeit I'm not sure they have regression applications. Interested in hearing your thoughts or other potential ideas for a more real-world example of regression with SNNs. — Reply to this email directly, view it on GitHub <#98 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJTFT4QMXSP57XDCMJUU7IDVYUOUPANCNFSM5SGMR4BA . You are receiving this because you commented.Message ID: @.>

I do have Discord ~ I'm currently away on work but I will contact you as soon as I get back, I'd love to be part of that channel.

0 replies

shilpakancharla · 2022-10-20T19:57:35Z

shilpakancharla
Oct 20, 2022
Author

What's the name of the discord?

0 replies

jeshraghian · 2022-12-11T17:10:47Z

jeshraghian
Dec 11, 2022
Maintainer

Oops, teaching quarter just finished and I'm catching up on life now.
The channel name is just snnTorch & the link is in the readme "Chat" badge.
Converting this to a discussion in the meantime!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples of regression? #166

{{title}}

Replies: 29 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Examples of regression? #166

shilpakancharla Mar 31, 2022

Replies: 29 comments

ahenkes1 May 30, 2022 Collaborator

mahmad2005 Aug 3, 2022

shilpakancharla Aug 3, 2022 Author

mahmad2005 Aug 3, 2022

jeshraghian Aug 8, 2022 Maintainer

ahenkes1 Aug 8, 2022 Collaborator

jeshraghian Aug 8, 2022 Maintainer

ahenkes1 Aug 8, 2022 Collaborator

ahenkes1 Aug 8, 2022 Collaborator

jeshraghian Aug 8, 2022 Maintainer

ahenkes1 Aug 8, 2022 Collaborator

jeshraghian Aug 8, 2022 Maintainer

ahenkes1 Aug 8, 2022 Collaborator

ahenkes1 Aug 8, 2022 Collaborator

jeshraghian Aug 8, 2022 Maintainer

ahenkes1 Aug 9, 2022 Collaborator

jeshraghian Aug 9, 2022 Maintainer

ahenkes1 Aug 9, 2022 Collaborator

ahenkes1 Aug 9, 2022 Collaborator

jeshraghian Aug 10, 2022 Maintainer

ahenkes1 Aug 10, 2022 Collaborator

jeshraghian Aug 11, 2022 Maintainer

shilpakancharla Aug 11, 2022 Author

shilpakancharla Aug 11, 2022 Author

ahenkes1 Aug 12, 2022 Collaborator

jeshraghian Oct 11, 2022 Maintainer

shilpakancharla Oct 11, 2022 Author

shilpakancharla Oct 20, 2022 Author

jeshraghian Dec 11, 2022 Maintainer

shilpakancharla
Mar 31, 2022

ahenkes1
May 30, 2022
Collaborator

mahmad2005
Aug 3, 2022

shilpakancharla
Aug 3, 2022
Author

mahmad2005
Aug 3, 2022

jeshraghian
Aug 8, 2022
Maintainer

ahenkes1
Aug 8, 2022
Collaborator

jeshraghian
Aug 8, 2022
Maintainer

ahenkes1
Aug 8, 2022
Collaborator

ahenkes1
Aug 8, 2022
Collaborator

jeshraghian
Aug 8, 2022
Maintainer

ahenkes1
Aug 8, 2022
Collaborator

jeshraghian
Aug 8, 2022
Maintainer

ahenkes1
Aug 8, 2022
Collaborator

ahenkes1
Aug 8, 2022
Collaborator

jeshraghian
Aug 8, 2022
Maintainer

ahenkes1
Aug 9, 2022
Collaborator

jeshraghian
Aug 9, 2022
Maintainer

ahenkes1
Aug 9, 2022
Collaborator

ahenkes1
Aug 9, 2022
Collaborator

jeshraghian
Aug 10, 2022
Maintainer

ahenkes1
Aug 10, 2022
Collaborator

jeshraghian
Aug 11, 2022
Maintainer

shilpakancharla
Aug 11, 2022
Author

shilpakancharla
Aug 11, 2022
Author

ahenkes1
Aug 12, 2022
Collaborator

jeshraghian
Oct 11, 2022
Maintainer

shilpakancharla
Oct 11, 2022
Author

shilpakancharla
Oct 20, 2022
Author

jeshraghian
Dec 11, 2022
Maintainer