You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix fuzzy dedup producing zero duplicates under Ray
When _load_fuzzy_dupe_map_shard and _compute_fuzzy_dedup_stats created a
ZephyrContext with client=LocalClient(), the coordinator job called
current_client() which auto-detected Ray and spawned workers as separate
processes. The add_to_dup_map closure then modified serialized copies of
shard_dup_map, leaving the original dict empty. Wrap both call sites with
set_current_client so the LocalClient propagates via context variables.
0 commit comments