Skip to content

Check eval performance when extracting center token for classification tasks#497

Open
favyen2 wants to merge 2 commits intomainfrom
favyen/20260219-try-using-center-token
Open

Check eval performance when extracting center token for classification tasks#497
favyen2 wants to merge 2 commits intomainfrom
favyen/20260219-try-using-center-token

Conversation

@favyen2
Copy link
Collaborator

@favyen2 favyen2 commented Feb 19, 2026

This adds an option to pool over the center tokens only (across timesteps/modalities) instead of all tokens for classification tasks. The motivation is previous observations that this helps for some tasks like Nandi.

I also added an option to use bilinear interpolation followed by linear layer for segmentation tasks.

It doesn't seem to make a big difference for most tasks: https://wandb.ai/eai-ai2/2026_02_14_eval_changes

This means pooling over the center tokens only (across timesteps/modalities)
instead of all tokens. Since previously we observed this helps for some
tasks. Also add option to use bilinear interpolation followed by linear
layer for segmentation tasks.
Copy link
Collaborator

@yawenzzzz yawenzzzz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants