Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

mostlygeek / llama-swap Public

Notifications You must be signed in to change notification settings
Fork 34
Star 674

Code
Issues 11
Pull requests 3
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: mostlygeek/llama-swap

Releases Tags

Releases · mostlygeek/llama-swap

v77

17 Dec 22:39

github-actions

v77

891f6a5

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v77

Changelog

891f6a5 Add /upstream endpoint (#30)

Assets 7

All reactions

v76

17 Dec 00:26

github-actions

v76

7183f6b

This commit was signed with the committer’s verified signature.

mostlygeek Benson Wong

GPG key ID: 8C992B23151E99AF

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v76

Changelog

7183f6b fix bad logging due to wrong []byte used #28
d89bfeb add .DS_Store to .gitignore
9a0c6be Improve stop exceptions (#28) (#29)
d6ca535 tweak release tagging so it is not based on number of commits

Assets 7

All reactions

v75

14 Dec 18:31

github-actions

v75

27302c0

This commit was signed with the committer’s verified signature.

mostlygeek Benson Wong

GPG key ID: 8C992B23151E99AF

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v75

This is the first release that changes change the Release versioning from semver to just the number of commits in the main branch.

Changelog

27302c0 change llama-swap to use goreleaser default ldflag values

Assets 7

All reactions

v0.1.5

10 Dec 03:12

github-actions

v0.1.5

5fbd53c

This commit was signed with the committer’s verified signature.

mostlygeek Benson Wong

GPG key ID: 8C992B23151E99AF

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.1.5

Changelog

5fbd53c delay TTL check until after all requests are complete (#25)

Assets 7

All reactions

v0.1.4

09 Dec 05:39

github-actions

v0.1.4

97dae50

This commit was signed with the committer’s verified signature.

mostlygeek Benson Wong

GPG key ID: 8C992B23151E99AF

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.1.4

Changelog

97dae50 update readme
cb978f7 add web interface to /logs
387f0ef use new timings data in server response in run-benchmark.sh

Assets 7

All reactions

v0.1.3

04 Dec 00:01

github-actions

v0.1.3

18c1346

This commit was signed with the committer’s verified signature.

mostlygeek Benson Wong

GPG key ID: 8C992B23151E99AF

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.1.3

Changelog

18c1346 Add Access-Control-Allow-Origin CORS header to /v1/models endpoint
da2326b add example: optimizing code generation
da46545 fix profile example in README

Assets 7

All reactions

v0.1.2

01 Dec 17:12

github-actions

v0.1.2

04b4760

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.1.2

!! Note breaking change in this commit if you're using profiles !!

model names with a profile has changed from profile/model to profile:model. The / was swapped to a :
Example, coding/qwen-2.5-coder-32B is now coding:qwen-2.5-coder-32B.