Commit ee4dc5d
Emit X-SMG-Routing-Key from the session server for sticky agentic routing (#30)
The session server proxies each agent turn to the externally-launched SMG
gateway. Tag every proxied chat-completion with X-SMG-Routing-Key=session_id so
a routing-key gateway policy (manual / consistent_hashing) pins the session to
one worker, reusing its KV cache across turns.
Emitted unconditionally: the gateway is launched by the cluster job (RL360), not
by miles, so miles cannot know its policy. The header is ignored by policies
that do not route on it (e.g. cache_aware); only manual / consistent_hashing
read it. Selecting manual + min_load is a gateway-launch (RL360) change, not a
miles change.
Co-authored-by: Rupesh K Srivastava <rupspace@gmail.com>1 parent a73cff0 commit ee4dc5d
1 file changed
Lines changed: 12 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
253 | 253 | | |
254 | 254 | | |
255 | 255 | | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
256 | 264 | | |
257 | 265 | | |
258 | | - | |
| 266 | + | |
259 | 267 | | |
260 | 268 | | |
261 | 269 | | |
| |||
279 | 287 | | |
280 | 288 | | |
281 | 289 | | |
282 | | - | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
283 | 293 | | |
284 | 294 | | |
285 | 295 | | |
| |||
0 commit comments