Skip to content

Commit e9d8b85

Browse files
committed
release: v0.5.10
1 parent 31c0334 commit e9d8b85

3 files changed

Lines changed: 15 additions & 2 deletions

File tree

CHANGELOG.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,19 @@ All notable changes to this project will be documented in this file.
44

55
The format is based on [Keep a Changelog](https://keepachangelog.com/), and this project adheres to [Semantic Versioning](https://semver.org/).
66

7+
## [0.5.10] - 2026-06-11
8+
9+
### Fixed
10+
11+
- Strong partial-offload candidates are no longer buried below weaker full-GPU
12+
models because the final ranking sort no longer counts full-GPU fit a second
13+
time after runtime-fit and speed penalties have already been applied. Light
14+
partial offload is penalized less aggressively, while heavy dense offload
15+
remains strongly discounted. (#105, #108)
16+
- MoE partial-offload scoring now uses the active parameter working set when it
17+
can plausibly stay on GPU, so active-small MoE models are not penalized like
18+
dense models with the same total parameter count. (#105, #108)
19+
720
## [0.5.9] - 2026-06-10
821

922
### Added

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[project]
22
name = "whichllm"
3-
version = "0.5.9"
3+
version = "0.5.10"
44
description = "Find the best LLM that runs on your hardware"
55
authors = [{name = "Andyyyy64"}]
66
readme = "README.md"

uv.lock

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)