Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fix Docker deployment #30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Uh oh!
There was an error while loading. Please reload this page.
Fix Docker deployment #30
Changes from 19 commits
5ab4b24c4d5d9d5b50722af09dcfc673dc9b9e3665ef8ff05c3bf9cf3e685edec43f2945cd1303b249d1a7f404c50a401a6023cb27484eec3957f8e864a42a8e5ded7560d384b18d843File filter
Filter by extension
Conversations
Uh oh!
There was an error while loading. Please reload this page.
Jump to
Uh oh!
There was an error while loading. Please reload this page.
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's a bit weird to not use the same base image depending on the target platform, aditionnally
ubuntu-latestdoes support linux/arm64 as per the documentationAs a future improvement we could have a workflow that reuse ubuntu-latest with several target platforms
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This means you build both the GPU and CPU images while only actually using one of the two in the end
On my computer the build takes >10min, we could make it more optimized
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
there should also be a CPU-only image (there will soon be support for API-based LLMs that do not require GPUs)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i have created the
Dockerfile.cpuusingpython:3.11-slimbutsglangis a hard dependency in thepyproject.toml. so its fails because it needs CUDA headers doesnt exist in CPU image.one thing we can do is move sglang and realted dependencies to
project.optional-dependenciesinproject.tomlor we can wrap the SGlang import in the
llm_processor.pywithtry/excepthow would you like me to proceed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you can take inspiration from mmore's dependencies, with rules separated depending on CPU/GPU
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After testing on RCP,
lmsysorg/sglang:latestis based over CUDA 13 but dependencysgl_kernelrequires a CUDA 12 lib, so we'll have to fix the base image to an older version under CUDA 12There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated the description
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.