Treffer: ASFNOformer—A Superior Frequency Domain Token Mixer in Spiking Transformer.

Title:
ASFNOformer—A Superior Frequency Domain Token Mixer in Spiking Transformer.
Source:
Electronics (2079-9292); Dec2025, Vol. 14 Issue 24, p4860, 19p
Database:
Complementary Index

Weitere Informationen

As the third generation of neural networks, Spiking Neural Networks (SNNs) simulate the event-driven processing mode of the brain, offering superior energy efficiency and biological interpretability compared to traditional deep learning. Combining the architectural strengths of Transformers with SNNs has recently demonstrated high accuracy and significant potential. SNNs process binary spikes and rich temporal information, resulting in lower computational complexity and making them particularly suitable for neuromorphic datasets. However, neuromorphic data typically involve dynamic edges and high-frequency pixel intensity changes. Capturing this frequency information is challenging for traditional spatial methods but is critical for event-driven vision. To address this, we investigate the integration of the Fast Fourier Transform (FFT) into SNNs and propose the Adaptive Spiking Fourier Neural Operator Transformer (ASFNOformer). This architecture adapts the Adaptive Fourier Neural Operator (AFNO)—originally validated in Artificial Neural Networks (ANNs)—specifically for the spiking domain. Unlike standard AFNOs, our module applies FFT across both spatial (H, W) and temporal (T) dimensions, followed by a Multi-Layer Perceptron structure (MLP) mechanism with a block-diagonal weight matrix. This design effectively captures both spatial features and temporal dynamics inherent in event streams. Furthermore, we incorporate Leaky Integrate-and-Fire (LIF) neurons optimized with Learnable Weight Parameters (LWP-LIF) to enhance temporal feature extraction and adaptivity. Experimental results on standard benchmarks indicate that our method reduces the parameter count by approximately 25%. In terms of recognition accuracy, ASFNOformer is comparable to mainstream models on static datasets and demonstrates superior performance on neuromorphic datasets by efficiently capturing frequency features. Notably, ablation studies confirm the model's generalizability, and when using QKformer as a baseline, our method achieves state-of-the-art (SOTA) performance on the CIFAR10-DVS dataset. This work advances frequency-domain analysis in SNNs, paving the way for efficient deployment on neuromorphic hardware. [ABSTRACT FROM AUTHOR]

Copyright of Electronics (2079-9292) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)