CSE463 Project

This repository contains my personal experiments on the "Lung & Colon Cancer Dataset-unstable dataset" and the "Brain Tumor MRI Dataset" using different hybrid architectures, including CNN, KAN, MLP, ConvNeXt-V2, SE blocks, etc., for comparative analysis.

Note: These are personal experiments, so I cannot guarantee their correctness.
If you find any issues, have suggestions or want to contribute, feel free to open an issue or submit a pull request. Short Architecture Description:
You can find the architecture and implementation details in this file(MRI): 20-10 ConvNeXt-V2 KAN Notebook
Description: Here, I will provide a summary of the architecture and key points from the notebook.

Summary Results

Model	Test Loss	Test Accuracy	Test Precision	Test Recall	Test F1 Score
convnextv2_se_kan	0.082185	0.974085	0.973709	0.973070	0.973245
convnextv2_kan	0.053409	0.984756	0.984538	0.983366	0.983717
convnextv2_baseline	0.070767	0.981707	0.981618	0.980033	0.980486
convnextv2_se	0.082423	0.978659	0.978121	0.977167	0.977484

Architecture Overview

1️. ConvNeXt-V2 Baseline Block

Input: (N, C, H, W) — keep a copy as shortcut for residual.
DWConv 7×7 (Depthwise Conv): mixes spatial information per channel, keeps channel count C.
Permute → channels-last (N, H, W, C): needed for position-wise MLPs and LayerNorm.
LayerNorm: normalizes per channel to stabilize training.
PW1: expand (C → 4C): position-wise MLP, acts like 1×1 conv per position to increase feature dimension.
GELU: activation function, adds non-linearity for learning complex mappings.
GRN (Global Response Norm): normalizes features globally, improves feature calibration.
PW2: project (4C → C): reduces dimension back to original channel size.
Permute → channels-first (N, C, H, W)
DropPath (optional): randomly drops residual paths during training for regularization.
Residual Add: adds shortcut input to output for stable training.

2️. ConvNeXt-V2 + SE Block (Squeeze-and-Excitation)

Steps 1–8: same as baseline.
Global Average Pool over H, W: squeezes spatial info into (N, C).
SE MLP: Linear(C → C/reduction) → GELU → Linear(C/reduction → C) → Sigmoid — computes channel attention weights.
Scale features: x = x * se — applies channel-wise gating.
Steps 12–13: permute back → DropPath → Residual Add.

SE is applied after channel projection, before residual addition.

3️. ConvNeXt-V2 + KAN Block

Input: (N, C, H, W) — keep shortcut.
DWConv 7×7: spatial mixing.
Permute → channels-last & flatten positions (Npos, C) for KAN.
KAN1: expand (C → 4C): replaces PW1 MLP, uses spline-based KANLinear for richer channel mixing.
GELU
GRN on 4C: global normalization.
KAN2: project (4C → C): replaces PW2 MLP, brings back original channels.
Permute → channels-first (N, C, H, W)
DropPath → Residual Add

KAN replaces MLPs for more expressive non-linear channel transformations.

4️. ConvNeXt-V2 + KAN + SE Block (SEKAN)

Steps 1–7: same as KAN block.
Global Average Pool (H, W): for SE gating.
SE MLP → Sigmoid: compute attention per channel.
Scale features: x = x * se.
Steps 11–12: permute back → DropPath → Residual Add.

Combines KAN non-linear mixing with SE attention gating before residual addition.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CSE463-LAB1		CSE463-LAB1
2-20-10convnext-v2-kan-100ep.ipynb		2-20-10convnext-v2-kan-100ep.ipynb
20-10convnext-v2-kan-100ep.ipynb		20-10convnext-v2-kan-100ep.ipynb
22101158_CSE463_LAB4_TASK1.ipynb		22101158_CSE463_LAB4_TASK1.ipynb
22101158_CSE463_LAB4_TASK1.ipynb - Colab.pdf		22101158_CSE463_LAB4_TASK1.ipynb - Colab.pdf
22101158_LAB4_CSE463_final.ipynb		22101158_LAB4_CSE463_final.ipynb
22101158_Task2_CSE463_LAB4.ipynb		22101158_Task2_CSE463_LAB4.ipynb
22101158_Task2_CSE463_LAB4.ipynb - Colab.pdf		22101158_Task2_CSE463_LAB4.ipynb - Colab.pdf
8-11convnext-v2-kan-3-5-10ep.ipynb		8-11convnext-v2-kan-3-5-10ep.ipynb
CSE463_AssignmentNo02_22101158_Farhin_Ulfat.ipynb		CSE463_AssignmentNo02_22101158_Farhin_Ulfat.ipynb
KAN_CNN_test2_ok.ipynb		KAN_CNN_test2_ok.ipynb
README.md		README.md
convnext.ipynb		convnext.ipynb
gradcam.ipynb		gradcam.ipynb
test3-kan-allcnn.ipynb		test3-kan-allcnn.ipynb
trainedep50.ipynb		trainedep50.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSE463 Project

Summary Results

Architecture Overview

1️. ConvNeXt-V2 Baseline Block

2️. ConvNeXt-V2 + SE Block (Squeeze-and-Excitation)

3️. ConvNeXt-V2 + KAN Block

4️. ConvNeXt-V2 + KAN + SE Block (SEKAN)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CSE463 Project

Summary Results

Architecture Overview

1️. ConvNeXt-V2 Baseline Block

2️. ConvNeXt-V2 + SE Block (Squeeze-and-Excitation)

3️. ConvNeXt-V2 + KAN Block

4️. ConvNeXt-V2 + KAN + SE Block (SEKAN)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages