@@ -4,10 +4,28 @@ All notable changes to this project will be documented in this file.
44
55The format is based on [ Keep a Changelog] ( https://keepachangelog.com/ ) , and this project adheres to [ Semantic Versioning] ( https://semver.org/ ) .
66
7- ## [ Unreleased]
7+ ## [ 0.5.9] - 2026-06-10
8+
9+ ### Added
10+
11+ - MXFP4 and NVFP4 4-bit quantization support across ID/filename parsing, VRAM
12+ estimation, quality penalties, speed efficiency, and family grouping.
13+ Repos shipping these formats were previously labeled FP16 and their VRAM
14+ requirement overestimated about 3.5x. (#99 )
15+ - Apple M5-family entries for ` --gpu ` simulation. (#92 )
16+ - Kepler-era Quadro bandwidth and compute capability entries. (#75 )
817
918### Fixed
1019
20+ - AMD discrete GPU detection on Linux: rocm-smi names are read from the
21+ correct ` Card Series ` key, compound lspci names such as
22+ ` Navi 22 [Radeon RX 6700/6700 XT/6750 XT ...] ` resolve bandwidth, sysfs VRAM
23+ enriches the fallback path, and discrete cards are no longer mislabeled
24+ ` shared memory ` . Adds RX 6750 XT / RX 6700 / RX 6650 XT / RX 6600 series and
25+ Radeon AI PRO R9700 to the bandwidth catalog. (#61 , #68 )
26+ - Community GGUF repos without ` base_model ` metadata (for example
27+ ` unsloth/...-GGUF ` ) now inherit the official model's benchmark score by
28+ name matching instead of falling through to no evidence. (#94 )
1129- GPU bandwidth detection no longer depends solely on the hand-curated
1230 catalog. When a detected card is missing from ` GPU_BANDWIDTH ` , bandwidth is
1331 now resolved from the bundled TechPowerUp database (dbgpu, 2824 GPUs) using
0 commit comments