Commit efc1616
authored
Maas api add gpu and restructure (#95)
* Working version with qwen
* Initial restructured commit
* Semi woking token based limiting, for enterprise for example it stops on 120 tokens but it should happen on 1000
* Restructuring as per PR comments + update token rate limiting, however there are still issues with tiers that needs to be debugged and fixed
* Fixes as per coderabbit comments
* fix: Remove duplicate HTTPRoute definition after rebase
- Removed duplicate gateway-routes.yaml from base
- HTTPRoute for maas-api is now only in base/networking/httproute.yaml
- Fixed kustomization build error after rebasing on main
* fix: Restore maas-api/deploy files that exist on main
- Restored kuadrant.yaml, gateway.yaml, httproute.yaml files
- Restored gateway-auth-policy.yaml for model policies
- Fixed duplicate ServiceAccount in deployment/samples/models
- All validations now pass except 2 ODH-related that also fail on main
(external GitHub dependency issue with kustomize)
These files were incorrectly removed during rebase thinking they were
duplicates, but they serve different purposes than the ones in deployment/
* Fixes to deployment, added updated opendatahub/maas-api image, added a script for fast openshift deployment
* Referenced PR comments, tested deployment
* Improve scrit, removed dynamic AWS ELB address retrieving and added openshif route retrieving
* Removed kustomization config as it is redundant
* Updating policies
* Updating rbac and some minor changes around the install script and instructions
* Trying to move VLLMInferenceService to deployment/ not everything works, but simulator should work
* Updating instructions and cluster role issues
* Fixed qwen for LLMInferenceService, not finished facebook cpu model
* Removed conflicting parts from maas-api
* Revert maas-api/ directory to latest main branch state
---------
Co-authored-by: Jamie Land <hokie10@gmail.com> & Bartos1 parent d744fa3 commit efc1616
File tree
105 files changed
+2806
-3046
lines changed- deployment
- base
- maas-api
- networking
- gateway-class
- policies
- token-rate-limiting
- components
- kserve
- observability
- grafana
- observability
- prometheus
- odh
- examples
- basic-deployment
- gpu-deployment-openshift
- gpu-deployment
- kustomize-templates
- auth/api-keys
- models
- qwen3
- simulator
- observability
- simulator-deployment-openshift
- simulator-deployment
- infrastructure
- kustomize-templates
- istio
- kserve
- kuadrant
- kuadrant-configure
- maas-api
- namespaces
- overlays/openshift
- overlays
- kubernetes
- openshift
- samples/models
- facebook-opt-125m-cpu
- qwen3
- rbac
- simulator
- scripts
- installers
- scripts
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
105 files changed
+2806
-3046
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
12 | | - | |
| 11 | + | |
| 12 | + | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
29 | | - | |
| 29 | + | |
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
50 | 50 | | |
51 | | - | |
52 | | - | |
| 51 | + | |
53 | 52 | | |
54 | | - | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
55 | 66 | | |
56 | | - | |
| 67 | + | |
57 | 68 | | |
58 | 69 | | |
59 | 70 | | |
| |||
107 | 118 | | |
108 | 119 | | |
109 | 120 | | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | 121 | | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
136 | 129 | | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
| 130 | + | |
143 | 131 | | |
144 | | - | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
145 | 138 | | |
146 | | - | |
147 | | - | |
148 | | - | |
| 139 | + | |
149 | 140 | | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
154 | 147 | | |
155 | | - | |
| 148 | + | |
156 | 149 | | |
157 | | - | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
167 | 155 | | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
| 156 | + | |
172 | 157 | | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
| 158 | + | |
178 | 159 | | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
| 160 | + | |
| 161 | + | |
183 | 162 | | |
184 | | - | |
185 | | - | |
186 | | - | |
187 | | - | |
188 | | - | |
| 163 | + | |
| 164 | + | |
189 | 165 | | |
190 | 166 | | |
191 | | - | |
| 167 | + | |
192 | 168 | | |
193 | | - | |
194 | | - | |
195 | | - | |
| 169 | + | |
| 170 | + | |
196 | 171 | | |
197 | 172 | | |
198 | | - | |
| 173 | + | |
199 | 174 | | |
200 | | - | |
201 | | - | |
202 | | - | |
| 175 | + | |
| 176 | + | |
203 | 177 | | |
204 | 178 | | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
205 | 186 | | |
206 | 187 | | |
| 188 | + | |
207 | 189 | | |
208 | | - | |
209 | | - | |
210 | | - | |
211 | | - | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
212 | 195 | | |
213 | | - | |
| 196 | + | |
214 | 197 | | |
215 | | - | |
| 198 | + | |
216 | 199 | | |
217 | | - | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
218 | 205 | | |
219 | | - | |
| 206 | + | |
220 | 207 | | |
221 | | - | |
222 | | - | |
223 | | - | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
0 commit comments