Wrong implementation of Spiking_Self_Attention

It seems that the last lif (proj_lif) in Spiking_Self_Attention get input with shape $[T\times B, C, N]$ rather than $[T, B, C, N]$.
The reshape operators should be executed after proj_bn and before proj_lif.

The Accuracy of trained HST-10-384 is 78.732% when batch_size is set to $32$ (default in imagenet/test.py).
When fixing this problem by setting batch_size to 1, the performance drops to 74.202% (-4.530%).
When fixing this problem by replacing it with following code, the performance drops to 74.196% (-4.536%).
```python
x = self.proj_lif(self.proj_bn(self.proj_conv(x)).reshape(T, B, C, W, H))
```

https://github.com/zhouchenlin2096/QKFormer/blob/43f0adf64e7a19690dbcd422f2001adf91221f39/cifar10-dvs/model.py#L143-L146

https://github.com/zhouchenlin2096/QKFormer/blob/43f0adf64e7a19690dbcd422f2001adf91221f39/cifar10/model.py#L115-L118

https://github.com/zhouchenlin2096/QKFormer/blob/43f0adf64e7a19690dbcd422f2001adf91221f39/cifar100/model.py#L115-L118

https://github.com/zhouchenlin2096/QKFormer/blob/43f0adf64e7a19690dbcd422f2001adf91221f39/dvs128-gesture/model.py#L143-L146

https://github.com/zhouchenlin2096/QKFormer/blob/43f0adf64e7a19690dbcd422f2001adf91221f39/imagenet/qkformer.py#L155-L158

	x = x.transpose(3, 4).reshape(T, B, C, N).contiguous()
	x = self.attn_lif(x)
	x = x.flatten(0, 1)
	x = self.proj_lif(self.proj_bn(self.proj_conv(x))).reshape(T, B, C, W, H)

	x = x.transpose(3, 4).reshape(T, B, C, N).contiguous()
	x = self.attn_lif(x)
	x = x.flatten(0,1)
	x = self.proj_lif(self.proj_bn(self.proj_conv(x))).reshape(T,B,C,W,H)

	x = x.transpose(3, 4).reshape(T, B, C, N).contiguous()
	x = self.attn_lif(x)
	x = x.flatten(0,1)
	x = self.proj_lif(self.proj_bn(self.proj_conv(x))).reshape(T,B,C,W,H)

	x = x.transpose(3, 4).reshape(T, B, C, N).contiguous()
	x = self.attn_lif(x)
	x = x.flatten(0, 1)
	x = self.proj_lif(self.proj_bn(self.proj_conv(x))).reshape(T, B, C, W, H)

	x = x.transpose(3, 4).reshape(T, B, C, N).contiguous()
	x = self.attn_lif(x)
	x = x.flatten(0,1)
	x = self.proj_lif(self.proj_bn(self.proj_conv(x))).reshape(T,B,C,W,H)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong implementation of Spiking_Self_Attention #14

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Wrong implementation of Spiking_Self_Attention #14

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions