Skip to content

Releases: mostlygeek/llama-swap

v77

17 Dec 22:39
891f6a5
Compare
Choose a tag to compare
v77

Changelog

v76

17 Dec 00:26
7183f6b
Compare
Choose a tag to compare
v76

Changelog

  • 7183f6b fix bad logging due to wrong []byte used #28
  • d89bfeb add .DS_Store to .gitignore
  • 9a0c6be Improve stop exceptions (#28) (#29)
  • d6ca535 tweak release tagging so it is not based on number of commits

v75

14 Dec 18:31
27302c0
Compare
Choose a tag to compare
v75

This is the first release that changes change the Release versioning from semver to just the number of commits in the main branch.

Changelog

  • 27302c0 change llama-swap to use goreleaser default ldflag values

v0.1.5

10 Dec 03:12
5fbd53c
Compare
Choose a tag to compare

Changelog

  • 5fbd53c delay TTL check until after all requests are complete (#25)

v0.1.4

09 Dec 05:39
97dae50
Compare
Choose a tag to compare

Changelog

  • 97dae50 update readme
  • cb978f7 add web interface to /logs
  • 387f0ef use new timings data in server response in run-benchmark.sh

v0.1.3

04 Dec 00:01
18c1346
Compare
Choose a tag to compare

Changelog

  • 18c1346 Add Access-Control-Allow-Origin CORS header to /v1/models endpoint
  • da2326b add example: optimizing code generation
  • da46545 fix profile example in README

v0.1.2

01 Dec 17:12
04b4760
Compare
Choose a tag to compare

!! Note breaking change in this commit if you're using profiles !!

  • model names with a profile has changed from profile/model to profile:model. The / was swapped to a :
  • Example, coding/qwen-2.5-coder-32B is now coding:qwen-2.5-coder-32B.

Changelog

v0.1.1

30 Nov 23:26
cf82b3c
Compare
Choose a tag to compare

Changelog

v0.1.0

24 Nov 03:47
73ad85e
Compare
Choose a tag to compare

Changelog

  • 73ad85e Implement Multi-Process Handling (#7)
  • 533162c add support for automatically unloading a model (#10) (#14)
  • ba39ed4 Add support for legacy v1/completions API (#12)
  • 21f54f9 Merge pull request #13 from mostlygeek/set-content-length
  • 7eec51f Dechunk HTTP requests by default (#11)
  • 5021e0f remove the process handler override
  • c9233d2 use gin instead of standard http lib in main
  • a33ac6f update README
  • 401aa88 move log handlers to separate file
  • e9e88fd rename proxy.go to proxymanager.go
  • c3b4bb1 use gin for http server
  • e5c909d add tests for proxy.Process
  • 36a31f4 add proxy.Process to manage upstream proxy logic
  • a8e5ee1 Add logging with pipes example to README

v0.0.10

10 Nov 04:22
5944a86
Compare
Choose a tag to compare

Changelog