Inference Speed with 11-33-40 vit-small

I found that the inference speed for ViT-small is only about 10% faster than that of ViT-large on a Jetson AGX Orin, using the same set of parameters and both without using TensorRT for inference. Could you let me know if this seems right, and whether this speed improvement is similar to what you observe?