Once the kokkos threading is done, adding MPI support on top could potentially turn this into a real HPC program.