Skip to content

Segment 3 Notebook Submission#16

Open
Ayesha-Imr wants to merge 2 commits intoCohere-Labs-Community:mainfrom
Ayesha-Imr:main
Open

Segment 3 Notebook Submission#16
Ayesha-Imr wants to merge 2 commits intoCohere-Labs-Community:mainfrom
Ayesha-Imr:main

Conversation

@Ayesha-Imr
Copy link
Contributor

Segment 03: Dataset Exemplars with ImageNet Validation Set

Summary

Added Segment 03 notebook for finding real-world dataset exemplars — the top-10 ImageNet images that maximally activate each neuron in InceptionV1's mixed4a layer.

Uses validation set (50K images) instead of full training set (1.28M).

Why validation set?

Initial approach used streaming + checkpointing for full training set, but hit limitations: streaming datasets have no random access. On resume, the pipeline must re-iterate through all previous images, making checkpoint recovery impractical.

What it does

  • Streams ImageNet validation split
  • Passes each image through InceptionV1 and captures mixed4a activations
  • Tracks top-10 images per channel using min-heaps (O(log K) efficient)
  • Visualizes results with clear channel/rank labels and activation values

Results

Grid shows 10 channels × 10 top images.

Copilot AI review requested due to automatic review settings February 6, 2026 10:11
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot wasn't able to review any files in this pull request.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant