Skip to content

Conversation

@matsen
Copy link
Contributor

@matsen matsen commented Jun 19, 2025

Summary

  • Added ParentIndependentBinarySelectionModel class that learns position-specific log selection factors
  • Model is parent-independent, serving as a neural network analog to phylogenetic methods like Bloom & Neher (2023)
  • Uses weight decay for regularization instead of explicit regularization code

Details

The model stores a learnable parameter tensor with shape (max_seq_length, output_dim) or (max_seq_length,) depending on output dimension. During forward pass, it returns the appropriate slice expanded to match batch size, completely ignoring the parent sequence content.

This provides a meaningful baseline between SingleValueBinarySelectionModel (uniform selection) and transformer-based models (context-dependent selection).

Fixes #145

🤖 Generated with Claude Code

matsen and others added 3 commits June 19, 2025 14:28
This model learns position-specific log selection factors that are
parent-independent, serving as a neural network analog to phylogenetic
methods like Bloom & Neher (2023).

Fixes #145

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <[email protected]>
@jgallowa07 jgallowa07 self-requested a review July 14, 2025 17:17
Copy link
Member

@jgallowa07 jgallowa07 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The model has been tested, and the implementation looks good to me.

@jgallowa07 jgallowa07 merged commit 0008954 into main Jul 14, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add ParentIndependentBinarySelectionModel as a baseline model

3 participants