Skip to content

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training"

Notifications You must be signed in to change notification settings

BICLab/Spike-Driven-Transformer-V3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 

Repository files navigation

spikezip_logo

Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training

Man Yao*, Xuerui Qiu*, Tianxiang Hu, Jiakui Hu, Yuhong Chou, Keyu Tian, Jianxing Liao, Luziwei Leng, Bo Xu, Guoqi Li

*Equal contribution.

BICLab, Institute of Automation, Chinese Academy of Sciences

This repo is the official implementation of Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training . It currently concludes codes and models for the following tasks:

Base Model ImageNet From Scratch: See Train_Base.md.
Large Model ImageNet Pretrain and Finetune: See Train_Large.md.
Object Detection: See Detection.md.
Semantic Segmentation: See Segementation.md.
DVS: See DVS.md.

🚀 🚀 🚀 News:

  • Dec. 19, 2023: Release the code for training and testing.

Abstract

The ambition of brain-inspired Spiking Neural Networks (SNNs) is to become a low-power alternative to traditional Artificial Neural Networks (ANNs). This work addresses two major challenges in realizing this vision: the performance gap between SNNs and ANNs, and the high training costs of SNNs. We identify intrinsic flaws in spiking neurons caused by binary firing mechanisms and propose a Spike Firing Approximation (SFA) method using integer training and spike-driven inference. This optimizes the spike firing pattern of spiking neurons, enhancing efficient training, reducing power consumption, improving performance, enabling easier scaling, and better utilizing neuromorphic chips. We also develop an efficient spike-driven Transformer architecture and a spike-masked autoencoder to prevent performance degradation during SNN scaling. On ImageNet-1k, we achieve state-of-the-art top-1 accuracy of 78.5%, 79.8%, 84.0%, and 86.2% with models containing 10M, 19M, 83M, and 173M parameters, respectively. For instance, the 10M model outperforms the best existing SNN by 7.2% on ImageNet, with training time acceleration and inference energy efficiency improved by 4.5x and 3.9x, respectively. We validate the effectiveness and efficiency of the proposed method across various tasks, including object detection, semantic segmentation, and neuromorphic vision tasks. This work enables SNNs to match ANN performance while maintaining the low-power advantage, marking a significant step towards SNNs as a general visual backbone.

avatar

Results

We address the performance and training consumption gap between SNNs and ANNs. A key contribution is identifying the mechanistic flaw of binary spike firing in spiking neurons. To overcome these limitations, we propose a Spike Firing Approximation (SFA) method. This method is based on integer training and spike-driven inference, aiming to optimize the spike firing pattern of spiking neurons. Our results demonstrate that optimization the spike firing pattern leads to comprehensive improvements in SNNs, including enhanced training efficiency, reduced power consumption, improved performance, easier scalability, and better utilization of neuromorphic chips. Additionally, we develop an efficient spike-driven Transformer architecture and a spike masked autoencoder to prevent performance degradation during SNN scaling. By addressing the training and performance challenges of large-scale SNNs, we pave the way for a new era in neuromorphic computing.

avatar

Contact Information

@article{yao2024scaling,
  title={Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training},
  author={Yao, Man and Qiu, Xuerui and Hu, Tianxiang and Hu, Jiakui and Chou, Yuhong and Tian, Keyu and Liao, Jianxing and Leng, Luziwei and Xu, Bo and Li, Guoqi},
  journal={arXiv preprint arXiv:2411.16061},
  year={2024}
}

For help or issues using this git, please submit a GitHub issue.

For other communications related to this git, please contact [email protected] and [email protected].

Acknowledgement

The pretraining and finetuning of our project are based on DeiT, MCMAE, Spark. and MAE. The object detection and semantic segmentation parts are based on MMDetection and MMSegmentation respectively. Thanks for their wonderful work.

About

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training"

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages