Also related: #536 In general I think our hierarchical parallelism documentation needs to be improved.