`RedistributeCPU` should be a no-OP if run with 1 MPI rank and 1 OpenMP thread. In a benchmarked of Warp vs WarpX shown by @dpgrote today, it does in fact still take ~20% of the time of the 2D simulation loop in WarpX. Am I overlooking something? :)