Commit a0ff41d
authored
[VLLM][ARM64] Currency Release (#5154)
* build arm 64 vllm image
* modify change log to add arm64
* make arm64 true
* build 0.10.1
* build 0.10.1 add platform
* build 0.10.1 add upstream commands
* build 0.10.1 add upstream commands
* build 0.10.1 build target fix
* build 0.10.1 build target fix
* build 0.10.1
* add pip setuptools
* add pip setuptools
* build without oss compliance
* build without oss compliance
* remove --mount
* build base, wheel and final
* build base, wheel and final
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* build arm64
* add max jobs
* remove pytorch installation with pip
* reduce layers
* reduce layers
* reduce layers
* increase max jobs
* try precompiled wheels
* try precompiled wheels
* add additional dep
* add python installation in vllm-base
* fix fun name
* fix instance type
* fix instance type
* fix ec2 launch fuction as arm64 is non efa
* fix ec2 launch fuction as arm64 is non efa
* fix ec2 launch fuction as arm64 is non efa
* fix ec2 launch fuction as arm64 is non efa
* add sleep for manual testing
* add sleep for manual testing
* use precompiled
* rebuild arm64
* rebuild arm64
* rebuild arm64
* test
* test
* try offline inference
* try offline inference
* try offline inference
* try offline inference
* try offline inference with new built image
* remove commands
* add cd command
* add cd command
* add cuda targt
* modify docker image
* modify docker image
* modify file from github
* add final target
* add final target
* add final target
* remove xformers
* build arm64
* build arm64
* build arm64
* add max jobs
* add requirements
* max job 20
* max job 20
* add agent testing
* test
* rebuild
* rebuild
* rebuild
* rebuild
* rebuild
* add pytorch wheels
* rebuild
* test
* test
* test and build
* test
* test
* test
* test
* test
* test
* test
* test
* test
* rebuild
* remove strands
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* test x86 vllm
* test x86
* test x86
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* rebuild and test agents
* change triton to 3.4.0
* test with main
* test with main
* test with main
* use vllm v0
* test with vllm serve
* increase attempts
* add more logging
* flashinfer wheels:
* flashinfer wheels:
* flashinfer wheels:
* downgrade flashinfer
* downgrade flashinfer and triton
* downgrade flashinfer and triton
* fic flashinfer
* fix flashinfer
* fix flashinfer
* install flashinfer seperately
* use float32
* try new vllm serve command with gpu memory utilization
* test agents with new docker setup
* test agents with new docker setup
* test vllm with autogen
* test vllm with autogen
* test autogen vllm
* format logs
* format logs
* format logs
* format logs
* test open ai example
* final testing
* Final build
* revert toml
* final testing
* perform openai script test
* perform openai script test
* perform openai script test with reasoning
* perform openai script test with reasoning
* perform openai script test with reasoning
* Try Qwen model
* Try Qwen model
* revert changes
* add vllm in toml
* remove test_agents.py
* change version
* remove changes in changelog
* remove changes in changelog1 parent 502da71 commit a0ff41d
File tree
13 files changed
+564
-156
lines changed- scripts
- test
- dlc_tests/sanity
- vllm
- ec2
- infra
- test_artifacts
- utils
- vllm
- arm64/gpu
13 files changed
+564
-156
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
179 | 179 | | |
180 | 180 | | |
181 | 181 | | |
182 | | - | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
5 | 19 | | |
6 | | - | |
| 20 | + | |
7 | 21 | | |
8 | 22 | | |
9 | 23 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
24 | 29 | | |
25 | 30 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
34 | 38 | | |
35 | 39 | | |
36 | 40 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
435 | 435 | | |
436 | 436 | | |
437 | 437 | | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
438 | 442 | | |
439 | 443 | | |
440 | 444 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
160 | 160 | | |
161 | 161 | | |
162 | 162 | | |
163 | | - | |
| 163 | + | |
164 | 164 | | |
165 | 165 | | |
166 | 166 | | |
167 | 167 | | |
168 | 168 | | |
| 169 | + | |
169 | 170 | | |
170 | 171 | | |
| 172 | + | |
171 | 173 | | |
172 | 174 | | |
173 | 175 | | |
174 | | - | |
| 176 | + | |
175 | 177 | | |
176 | 178 | | |
177 | 179 | | |
| |||
0 commit comments