|
1 | 1 | # Fork of KServe for huggingfaceserver CVE fixes |
2 | 2 |
|
3 | | -This is a fork of kserve that serves to document how we built the image: |
| 3 | +This is a fork of kserve that serves to document how we built the images: |
4 | 4 |
|
5 | 5 | ``` |
6 | | -*******782.dkr.ecr.us-east-1.amazonaws.com/library/kserve-huggingfaceserver:v0.16.0 |
| 6 | +*******782.dkr.ecr.us-east-1.amazonaws.com/library/kserve-huggingfaceserver:v0.16.0* |
7 | 7 | ``` |
8 | 8 |
|
9 | 9 | The official image released by kserve had several high and critical CVEs. To build our version, use the `python/huggingface_server.Dockerfile` dockerfile. |
@@ -37,22 +37,18 @@ curl -v http://0.0.0.0:8080/openai/v1/chat/completions -H "Content-Type: applica |
37 | 37 |
|
38 | 38 | The `reasoning_effort` is not available for all models. |
39 | 39 |
|
40 | | -## SHA256 fix |
| 40 | +# Updating vLLM version |
41 | 41 |
|
42 | | -The image: |
| 42 | +To update the vLLM version, edit the following files: |
43 | 43 |
|
44 | 44 | ``` |
45 | | -**********782.dkr.ecr.us-east-1.amazonaws.com/library/kserve-huggingfaceserver:v0.16.0.sha256.1 |
| 45 | +python/huggingface_server.Dockerfile # (VLLM_VERSION arg) |
| 46 | +python/huggingfaceserver/pyproject.toml |
| 47 | +python/kserve/pyproject.toml |
46 | 48 | ``` |
47 | 49 |
|
48 | | -is a temporary workaround to allow vLLM to work in FIPS constrained environments, where `hashlib.md5` is disabled. This image was made by first building the one above, and then exec-ing into it and running the following commands: |
| 50 | +Make sure you test your builds before deploying them after updating vLLM's version. The vLLM project is known to sometimes shuffle stuff internally and that can break kserve's vllm usage patterns. |
49 | 51 |
|
50 | | -```bash |
51 | | -$ cd /kserve-workspace/prod_venv/lib64/python3.12/site-packages/vllm/ |
52 | | -$ find . -type f -exec sed -i 's/hashlib\.md5/hashlib.sha256/g' {} + |
53 | | -``` |
54 | | - |
55 | | -This replaces all `hashlib.md5` calls with `hashlib.sha256`. Once that change is made inside the container, that running image is committed so the changes persist. |
56 | 52 |
|
57 | 53 | # KServe |
58 | 54 | [](https://pkg.go.dev/github.com/kserve/kserve) |
|
0 commit comments