I run docling-serve on a server with Nvidia GPU support. I use the docling-serve-cu128 image (but have tried docling-serve-cu130 too). Versions below v1.13.0 work fine. The v1.13.0/v1.13.1 versions of docling-serve throw:
import flash_attn_2_cuda as flash_attn_gpu
ModuleNotFoundError: No module named 'flash_attn_2_cuda'
when using the surya_ocr plugin. I see others have had this issue in the past: #272. It was supposedly fixed in #292.
Anything I can change to solve this?