Fix several latent bugs in MSA, ranking, and structure-context code by mooreneural · Pull Request #420 · chaidiscovery/chai-lab

mooreneural · 2026-05-18T22:21:30Z

Fixes 5 small but real bugs found while reviewing the data pipeline, ranking, and feature-generator code. Each is a one- to four-line change.

Fixes

colabfold.py — error_count = 0 was inside the while True retry loop in submit() and status(), so the counter reset every iteration and if error_count > 5: raise was unreachable. The submit/status helpers would retry forever on persistent server errors. Moved the initialization above the loop. (download() was already correct.)
all_atom_structure_context.py — In AllAtomStructureContext.merge, token_backbone_frame_index (which stores atom indices, see backbone_atoms_indices in data/dataset/structure/utils.py) was being offset by token_offsets rather than atom_offsets when merging multiple chains. The sibling fields token_centre_atom_index and token_ref_atom_index in the same function correctly use atom_offsets. Currently latent because the returned frame_idces from get_frames_and_mask is discarded by chai1.py, but it's still a real semantic bug.
token_dist_restraint.py — Stray extra [ ... ] around the key_entity_types tensor literal gave it shape (1, n) instead of (n,), asymmetric with query_entity_types defined right above it. Currently unused at the call site, but trivially wrong.
ranking/clashes.py — In has_inter_chain_clashes, the two ratio checks used inconsistent einops patterns: "... c -> ... c 1" (ellipsis-form) on one line and "b c -> b 1 c" (hardcoded b) on the next. Functionally equivalent for the 2D batch input used today, but the latter will fail if batch dims ever change. Made both ellipsis-form.
features/generators/msa.py — MSADataSourceGenerator._generate mutated the input msa_sequence_source tensor in place (msa_sequence_source[msa_sequence_source.eq(query)] = none) before reassigning it via the non-inplace masked_fill on the next line. Idempotent so it doesn't cause incorrect output today, but fragile — replaced with masked_fill for consistency.

Test plan

python -m py_compile on all modified files (passes locally)
Existing tests/test_inference_dataset.py::test_protein_with_smiles exercises the multi-chain merge() code path and still passes
Manual verification of einops pattern equivalence on 2D input + correctness on 3D input
Manual verification that key_entity_types now has shape (n,) matching query_entity_types

- colabfold.py: move error_count out of retry loop in submit/status so the retry cap actually triggers (was reset to 0 every iteration, making `if error_count > 5: raise` unreachable) - all_atom_structure_context.py: offset token_backbone_frame_index by atom_offsets (it stores atom indices), not token_offsets, when merging chains - token_dist_restraint.py: remove stray extra brackets around key_entity_types tensor that gave it shape (1, n) instead of (n,) - clashes.py: use ellipsis-form rearrange pattern for atoms_per_chain consistently across both ratio checks - msa.py: replace in-place mutation of input msa_sequence_source with masked_fill

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix several latent bugs in MSA, ranking, and structure-context code#420

Fix several latent bugs in MSA, ranking, and structure-context code#420
mooreneural wants to merge 1 commit into
chaidiscovery:mainfrom
mooreneural:chai-lab2

mooreneural commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mooreneural commented May 18, 2026

Fixes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant