Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 923 Bytes

File metadata and controls

25 lines (19 loc) · 923 Bytes

Experiment 13: Scatter Access Pattern

1. Focus

  • Measure the cost of indirect writes and the slowdown introduced by target collisions.

2. Question

  • How much do write-target distribution and contention change scatter throughput?

3. Variants

  • unique_targets
  • low_collision_random
  • high_collision_clustered

4. Method

  • Keep the number of logical writes and arithmetic work fixed while changing only the target-index distribution.
  • Validate the final output against deterministic CPU reference behavior for each collision regime.

5. Outputs

  • Median GPU time by scatter distribution.
  • Relative slowdown vs the unique-target baseline.
  • Contention-sensitive throughput comparison.

6. Interpretation

  • Scatter is not just the write-side mirror of gather because collisions can serialize progress.
  • The result should guide whether later pipelines need privatization, staging, or compaction.