As per mdame, we can't ensure that the cluster is actually balanced if other tests are adding or deleting pods in parallel.