Skip to content

docling-serve v1.13.0 and upwards give `No module named 'flash_attn_2_cuda' error #517

@TomNes94

Description

@TomNes94

I run docling-serve on a server with Nvidia GPU support. I use the docling-serve-cu128 image (but have tried docling-serve-cu130 too). Versions below v1.13.0 work fine. The v1.13.0/v1.13.1 versions of docling-serve throw:

import flash_attn_2_cuda as flash_attn_gpu
ModuleNotFoundError: No module named 'flash_attn_2_cuda'

when using the surya_ocr plugin. I see others have had this issue in the past: #272. It was supposedly fixed in #292.

Anything I can change to solve this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions