Add `llama.acceleration.fit` by will-lms · Pull Request #560 · lmstudio-ai/lmstudio-js

will-lms · 2026-03-12T04:16:51Z

Add llama.acceleration.fit to the KV Config schema. This is a new boolean that will enable / disable llama.cpp's -fit model loading logic.

When enabled, the other GPU parameters will be ignored.

To maintain compatibility with existing SDK usage, I set the default value in schema to false. This allows a call such as load({gpu: {ratio: 0.5}}) to still work as originally intended. I plan to default the standard GUI and CLI paths to true though via explicit overrides in those locations.

will-lms · 2026-03-12T04:50:56Z

Review

Architecture / Unintended Consequences

At a high level this PR is well-scoped. The public type, schema, conversion logic, and regression tests all move together.

The only subtle area is the inferred fit=false behavior, because it is intentionally encoding precedence rules from the config stack into the converter. In this case that is justified: the converter is exactly where callers expect “preserve my intent when serializing config” logic to live.

I do not see broader architectural problems or unintended cross-package consequences in the actual origin/main...HEAD diff.

Findings

No blocking findings in the corrected PR diff.

Residual Risk

The only minor gap I see is that the tests are centered on partial-config behavior. There is no targeted test for the useDefaultsForMissingKeys: true path in kvConfigToLLMLoadModelConfig, but that is a testing gap rather than a clear defect in this PR.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5e639b1c19

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts

yagil · 2026-03-12T13:02:47Z

packages/lms-shared-types/src/llm/LLMLoadModelConfig.ts

   * where 0 means no work is distributed and 1 means all work is distributed. Can also specify the
   * string "off" to mean 0 and the string "max" to mean 1.
+   *
+   * When `fit` is true, this field is ignored.


Debate: should this be ignored, or should we throw?

I do share the concern of this type of implicit ignorance of parameters/relationship between parameters here, and wonder if there's a better way to represent the relationship (maybe not)

Potential language enforced mechanism for this general type of thing: Discriminated union where you can't set the other params if you set fit. Haven't thought through what the backwards compat impact of doing this here would be, but the relationship itself does seem discriminatory?

In order to be backwards compatible, I think we have to ignore.

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts

packages/lms-kv-config/src/schema.ts

packages/lms-shared-types/src/llm/LLMLoadModelConfig.ts

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 94c807417c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1222ccd6ff

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-13T04:38:55Z

packages/lms-client/src/llm/.test-snapshots/LLM.heavy.test.ts.snap

 }
 `;
+
+exports[`LLM with default model fixture Can tokenize correctly 1`] = `


Remove stale snapshot keys after renaming heavy tests

The snapshot file now contains both the old LLM ... keys and the new LLM with default model fixture ... keys, but LLM.heavy.test.ts only defines the renamed tests (see the describe("with default model fixture") block), so running this heavy test file will report obsolete snapshots and fail until they are cleaned up. Keeping both sets of keys makes the snapshot suite unstable for anyone running heavy tests.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-13T04:38:56Z

packages/lms-client/src/llm/.test-snapshots/LLM.heavy.test.ts.snap

+  "identifier": Any<String>,
+  "indexedModelIdentifier": "lmstudio-community/Qwen2.5-0.5B-Instruct-GGUF/Qwen2.5-0.5B-Instruct-Q4_K_M.gguf",
+  "instanceReference": Any<String>,
+  "lastUsedTime": 1773374807159,


Stop snapshotting volatile last-used timestamps

This snapshot pins lastUsedTime to a concrete Unix timestamp, but lastUsedTime is inherently time-dependent model metadata, so future heavy-test runs will drift and fail even when behavior is correct. Add a property matcher (e.g. expect.any(Number)/null) for this field instead of hardcoding a specific value.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 45725f5624

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-13T04:45:51Z

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts

+    config.gpu?.ratio !== undefined
+    || config.gpu?.numCpuExpertLayersRatio !== undefined
+    || config.gpu?.mainGpu !== undefined
+    || config.gpu?.splitStrategy !== undefined;


Exclude default splitStrategy from fit inference

Inferring fit=false from any defined gpu.splitStrategy causes a regression when the strategy was auto-populated as the default ("evenly") rather than explicitly chosen by the caller. Fresh evidence: starting from { gpu: { disabledGpus: [1] } }, llmLoadModelConfigToKVConfig writes a split config, kvConfigToLLMLoadModelConfig reconstructs gpu.splitStrategy: "evenly", and this condition then forces llama.fit=false on the next serialization, which unexpectedly overrides higher-priority defaults where fit should remain enabled (disabled GPUs are documented as fit-compatible).

Useful? React with 👍 / 👎.

will-lms added 15 commits March 6, 2026 17:26

llama fit

46e33ab

nest llama.acceleration.fitMinContext

906f1f1

remove fitMinContext

3d095af

New LLMLlamaLayerRatio type

73decb1

Use llamaLayerRatio for embeddings offloadRatio

26b24fe

Merge remote-tracking branch 'origin/main' into will/llama-fit

424f25a

Separate fit boolean

93d3f17

[temp] lms-cli to my branch

c37fc39

schema level default false

88e7441

remove "unknown"

0c7ff32

Don't ignore explicit params

51c026f

stray

5be33bf

comments

2a25f1e

comment

837e032

improve check

79dab19

github-actions bot added the CLA signed Indicates if all contributors have signed the CLA label Mar 12, 2026

restore lms-cli to main

680b610

will-lms mentioned this pull request Mar 12, 2026

Add lms load --gpu auto lmstudio-ai/lms#501

Draft

add test

5e639b1

will-lms requested review from mattjcly, rugvedS07 and ryan-the-crayon March 12, 2026 04:52

will-lms marked this pull request as ready for review March 12, 2026 04:52

chatgpt-codex-connector bot reviewed Mar 12, 2026

View reviewed changes

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts Outdated Show resolved Hide resolved

yagil reviewed Mar 12, 2026

View reviewed changes

mattjcly reviewed Mar 12, 2026

View reviewed changes

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts Outdated Show resolved Hide resolved

mattjcly reviewed Mar 12, 2026

View reviewed changes

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts Outdated Show resolved Hide resolved

mattjcly reviewed Mar 12, 2026

View reviewed changes

packages/lms-kv-config/src/schema.ts Outdated Show resolved Hide resolved

mattjcly reviewed Mar 12, 2026

View reviewed changes

packages/lms-shared-types/src/llm/LLMLoadModelConfig.ts Outdated Show resolved Hide resolved

heavy test for round trip

94c8074

chatgpt-codex-connector bot reviewed Mar 12, 2026

View reviewed changes

packages/lms-kv-config/src/conversion/llmLoadModelConfig.ts Outdated Show resolved Hide resolved

will-lms added 3 commits March 12, 2026 22:39

clean

a4a6d85

move fit to LLMLoadModelConfig

0df603d

llama.fit

1222ccd

chatgpt-codex-connector bot reviewed Mar 13, 2026

View reviewed changes

helper

45725f5

chatgpt-codex-connector bot reviewed Mar 13, 2026

View reviewed changes

Conversation

will-lms commented Mar 12, 2026

Uh oh!

will-lms commented Mar 12, 2026

Review

Architecture / Unintended Consequences

Findings

Residual Risk

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

yagil Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

mattjcly Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

mattjcly Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

ryan-the-crayon Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants