IS-DD (Information-Set Double-Dummy)

Code: colver-core/src/search/is_dd.rs

Realistic player based on the DD solver. Samples N "determinized worlds" consistent with current beliefs about hidden cards, solves each with DD, and aggregates per-card scores.

Naming: IsDdSearch (the struct) is the unified search; "Smart IS-DD" in the arena/code refers to the same struct with a BeliefNet loaded.

Algorithm

for each determinization (default 20):
    sample hidden cards weighted by belief weights
    solve resulting full-information state with DD
    for each legal card → record exact NS points
aggregate per-card → pick best (max for NS, min for EW)

DD returns exact NS points per legal card per world, so far fewer samples are needed than IS-MCTS (which uses noisy MCTS rollouts). 20 determinizations is usually enough.

Hard constraints vs soft beliefs

These are two completely different things in IS-DD:

Hard constraints (facts — always on)

Things we know to be true from the public game state. They are not configurable and are applied unconditionally in every code path:

Voids: a player who couldn't follow suit no longer has any card of that suit
Trump ceiling: a player who undertrumped (or discarded under "ne pisse pas") cannot have a higher trump
Played cards: any card already played is no longer in any hand
Observer's hand: the cards in our own hand are known

These constraints zero out probabilities for impossible (player, card) combinations. There is no use_hard_constraints flag — it would be like a flag for "use facts".

Hard constraints are computed inside CardBeliefs::raw_weights() and applied automatically:

Heuristic path (use_nn_beliefs=false): CardBeliefs::normalized_weights() already excludes impossible cards
NN path (use_nn_beliefs=true): NN soft predictions are masked by the same hard constraints (any card with raw_weight==0.0 in CardBeliefs gets nn_weight=0.0)

Soft beliefs (probabilistic guesses — all OFF by default)

Adjustments to probabilities based on inferences (which may be wrong). All disabled by default:

Source	Flag	What it does
Heuristic soft inference	`use_soft_inference`	Applies dominance reasoning ("player X followed without playing the highest → downweight their higher unknowns") and optional bid signal interpretation in `CardBeliefs`
NN soft beliefs	`use_nn_beliefs`	Loads a trained `BeliefNet` and uses its soft predictions for card locations. Hard constraints are still applied on top.
Elephant memory	`use_elephant_memory`	Particle filter that accumulates determinizations as particles, filters them by observed plays, and blends with base beliefs. See section below.

Multiple soft sources can be enabled simultaneously — e.g. use_nn_beliefs=true + use_elephant_memory=true gives NN soft + hard constraints + particle blend.

If all soft beliefs are off and no hard constraints zero out any unknown (early game), the determinizer falls back to uniform determinize_greedy over the remaining cards.

Configuration (`IsDdConfig`)

Field	Default	Effect
`determinizations`	20	Number of worlds sampled (overridden by `time_limit_ms` if set)
`time_limit_ms`	None	Time budget per move; scaled by cards remaining: `effective_ms = ms × cards_left / 8`. Lets early tricks have more time and endgame finish quickly.
`use_soft_inference`	false	Soft heuristic from play (dominance, "ne pisse pas" weight adjustments).
`use_nn_beliefs`	false	Use a loaded `BeliefNet` for soft predictions.
`use_elephant_memory`	false	Particle filter from past determinizations.
`early_termination`	true	Skip search when forced (1 legal move) or when beliefs uniquely determine all hidden cards (single DD solve = exact answer). Always on by default.
`dominance_factor`	1.0	Used by `use_soft_inference`. When a player follows suit without playing the highest, downweight their higher unknown cards by this factor. 0.3 = aggressive, 1.0 = off (only relevant if soft inference is enabled)
`bid_function`	`ImprovedV2`	Used during bidding phase (IS-DD only acts during play)

Hard constraints (voids, trump ceiling, played cards) and early_termination are always on — they're correct by construction, no flag needed.

Note on bid-derived beliefs

A previous experiment exposed soft bid inference (partner bid 100 → likely strong trump) in BeliefState for BisDd. It was rejected: against NN bidders, the heuristic interpretation rejected reality 72% of the time. See BIS_DD.md. The bid belief NN v4 (bid_belief_v4.bin) replaced it. The dominance-based play heuristic in CardBeliefs::use_soft_inference is independent of bid interpretation.

Early termination

Two cases skip the determinization loop entirely:

Forced move — only 1 legal action. Return immediately with score=81 (neutral midpoint).
Resolved position — try_resolve_position() checks if beliefs uniquely determine every hidden card's owner (via raw_weights() > 0 test). If so, build the fully-known state and call solve_with_scores once. This is exact, no determinization needed.

These trigger more often than expected: late in a deal, voids accumulate and 4-5 cards become uniquely owned, so endgame becomes a single DD solve.

Elephant memory

Particle filter for online belief refinement. Stores up to N "particles" (each = a [u32; 4] hand assignment) accumulated from past determinizations. After observing each card play, particles inconsistent with the play are filtered out.

Field	Default	Effect
`use_elephant_memory`	false	Enable the particle filter
`elephant_smoothing`	0.05	Blending factor when combining base beliefs with particle evidence. Lower = stronger particle influence
`elephant_dominance_penalty`	0.5	Soft penalty per dominant card not played (a player who didn't play their best is unlikely to have it)
`elephant_use_dominance`	true	Apply the dominance penalty to particle scoring
`elephant_decay`	0.8	Decay factor for old particles (0.8 = particles lose 20% influence per turn). 1.0 = no decay

Lifecycle

init_deal_with_config() — reset particle pool for new deal
Each search_with_stats() call — generated determinizations are added as new particles via elephant.add_particles()
record_action() after each play — elephant.observe_play() filters surviving particles consistent with the observed card. If dominance penalty is on, particles where the player held a dominant card they didn't play get downweighted
Next search — compute_evidence() aggregates surviving particle distributions, then blend_with_evidence() combines with base beliefs (NN or heuristic) using elephant_smoothing
Resolved position case — when a single DD solve is enough, the resolved hands are fed back as a particle (re-seeds the pool with ground truth)

Stats: elephant_stats() → (surviving_particles, total_particles) and elephant_evidence(state) → Option<weights>.

When to use

Elephant memory is most useful mid-to-late game when belief uncertainty has narrowed and particles can converge. In early tricks the particle pool is sparse and noisy. It's off by default in production bots — primarily an experimental feature for belief studies.

Performance

Config	Time per move	Notes
20 dets, no belief, full hand	~50 ms	Default setup
20 dets, with NN belief	~70 ms	+20ms for NN forward + hybrid blend
20 dets, mid-game (4 tricks left)	~20 ms	Smaller search trees
With `time_limit_ms=50`, full hand	up to 50ms × 8/8 = 50ms	Auto-scaled
With `time_limit_ms=50`, last trick	up to 50ms × 1/8 = 6ms	Endgame finishes fast
Resolved position (early term)	~5-10 ms	Single DD solve
Forced move (early term)	<1 µs	Constant return

Variants in the arena

Set in arena/bots/ TOML files:

[play]
method = "smart_is_dd"     # alternatives: "is_dd", "smart_ismcts"
time_ms = 50               # → time_limit_ms
determinizations = 20

[belief]
model = "models/belief_v3.bin"     # optional, loads BeliefNet (soft predictions)

Hard constraints are applied automatically — there is no flag for them.

Reference bots:

nn_v2_isdd_no_belief — heuristic CardBeliefs only, no NN
nn_v2_isdd — NN belief net + hard constraints (current production strongest)

API

use colver_core::is_dd::{IsDdSearch, IsDdConfig};

let mut search = IsDdSearch::new();
// optional: search.load_belief_net("models/belief_v3.bin")?;

let config = IsDdConfig {
    determinizations: 20,
    time_limit_ms: Some(50),
    use_nn_beliefs: true,    // optional soft beliefs; hard constraints always on
    ..Default::default()
};

// Initialize beliefs at start of deal
search.init_deal_with_config(&state, observer, &config);

// Each turn (any player):
search.record_action(&state_before, player, action);  // update beliefs

// When it's our turn:
let action = search.search(&state, &config, &mut rng);
// Or with stats:
let result = search.search_with_stats(&state, &config, &mut rng);
// result.best_action, result.card_scores, result.determinizations

Parallel variant: search_parallel() (requires parallel feature, uses rayon).

Comparison with DMC

DMC and IS-DD have very similar mean MAE vs DD (~19) but make different errors — see bid/reward_studies/v3_reward_study.md. Hamming distance on the same deal is 29/32 cards: stylistically opposed (DMC plays Aces immediately, IS-DD pulls trumps systematically with the J).

Both are realistic players. IS-DD is slightly stronger on extreme hands (capots, very weak); DMC is slightly stronger on standard mid-range hands. Their max(DMC, ISDD) is the basis for the bid_v3_max_20M champion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IS-DD (Information-Set Double-Dummy)

Algorithm

Hard constraints vs soft beliefs

Hard constraints (facts — always on)

Soft beliefs (probabilistic guesses — all OFF by default)

Configuration (`IsDdConfig`)

Note on bid-derived beliefs

Early termination

Elephant memory

Lifecycle

When to use

Performance

Variants in the arena

API

Comparison with DMC

FilesExpand file tree

is_dd.md

Latest commit

History

is_dd.md

File metadata and controls

IS-DD (Information-Set Double-Dummy)

Algorithm

Hard constraints vs soft beliefs

Hard constraints (facts — always on)

Soft beliefs (probabilistic guesses — all OFF by default)

Configuration (IsDdConfig)

Note on bid-derived beliefs

Early termination

Elephant memory

Lifecycle

When to use

Performance

Variants in the arena

API

Comparison with DMC

Configuration (`IsDdConfig`)