Move more normalization into the model-runner backend #335

ericcurtin · 2025-10-28T15:42:54Z

model-runner daemon is more well equipped to identify between
model name and model id.

gemini-code-assist · 2025-10-28T15:43:11Z

Summary of Changes

Hello @ericcurtin, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request resolves an issue within the model-cli rm command where model identifiers, particularly SHA256 hashes, were being incorrectly normalized. The implemented changes introduce logic to intelligently differentiate between full model names and potential model IDs based on the presence of a slash. This ensures that model IDs are either passed through without modification or are properly expanded by the desktop client, thereby preventing incorrect processing and improving the reliability of model removal operations.

Highlights

Preventing SHA256 Normalization: The model-cli rm command now correctly handles SHA256 model identifiers by preventing their unintended normalization, ensuring they are treated as unique IDs.
Refined Model ID Handling: The Remove function in the desktop client has been updated to prioritize checking for model IDs (strings without slashes) before applying general model name normalization, expanding them if possible.
Improved CLI Robustness: This change enhances the robustness of the model-cli rm command when dealing with various forms of model identifiers, especially SHA256 hashes, preventing erroneous removal attempts.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sourcery-ai

Hey there - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

gemini-code-assist

Code Review

This pull request fixes an issue where model IDs (like sha256 hashes) were being incorrectly normalized when using model-cli rm. The changes in cmd/cli/commands/rm.go and cmd/cli/desktop/desktop.go correctly differentiate between model names and model IDs, ensuring that IDs are not normalized. The logic is sound. I've added one suggestion to refactor a part of the code in desktop.go to improve readability and reduce duplication.

cmd/cli/desktop/desktop.go

ilopezluna

I was testing this PR and I faced a similar issue when packaging a model:

MODEL_RUNNER_HOST=http://localhost:13434 docker model ls
MODEL NAME  PARAMETERS  QUANTIZATION  ARCHITECTURE  MODEL ID  CREATED  CONTEXT  SIZE 

MODEL_RUNNER_HOST=http://localhost:13434 docker model pull ai/smollm2:latest
Downloaded 270.60MB of 270.60MB
Model pulled successfully

MODEL_RUNNER_HOST=http://localhost:13434 docker model ls
MODEL NAME  PARAMETERS  QUANTIZATION    ARCHITECTURE  MODEL ID      CREATED       CONTEXT  SIZE       
smollm2     361.82 M    IQ2_XXS/Q4_K_M  llama         354bf30d0aa3  7 months ago        -  256.35 MiB  

MODEL_RUNNER_HOST=http://localhost:13434 docker model package --from ai/smollm2 --context-size 2048 smollm2:2k
INFO[0000] Successfully initialized store               
Reading model from store: "ai/smollm2"
INFO[0000] Getting model by reference: ai/smollm2       
Creating builder from existing model
Setting context size 2048
Creating lightweight model variant...
INFO[0000] Writing lightweight model variant            
Model variant created successfully
Loading model to Model Runner...
Transferred: 0.01 MB
Failed to package model
package model: failed to load packaged model: tag model: tagging failed with status 404 Not Found: tagging model: model not found

MODEL_RUNNER_HOST=http://localhost:13434 docker model ls
MODEL NAME  PARAMETERS  QUANTIZATION    ARCHITECTURE  MODEL ID      CREATED       CONTEXT  SIZE       
smollm2     361.82 M    IQ2_XXS/Q4_K_M  llama         354bf30d0aa3  7 months ago        -  256.35 MiB  
smollm2:2k  361.82 M    IQ2_XXS/Q4_K_M  llama         7522a58ecdbd  7 months ago     2048  256.35 MiB

At this point the store is corrupted and I can not remove the model:

MODEL_RUNNER_HOST=http://localhost:13434 docker model rm smollm2:2k
no such model: ai/smollm2:2k
Failed to remove model

MODEL_RUNNER_HOST=http://localhost:13434 docker model ls
MODEL NAME  PARAMETERS  QUANTIZATION    ARCHITECTURE  MODEL ID      CREATED       CONTEXT  SIZE       
smollm2:2k  361.82 M    IQ2_XXS/Q4_K_M  llama         7522a58ecdbd  7 months ago     2048  256.35 MiB  

MODEL_RUNNER_HOST=http://localhost:13434 docker model rm 7522a58ecdbd
no such model: sha256:7522a58ecdbda6cdeea5480bebd456329e65fc4bec7cd9484193a2813d4b3413
Failed to remove model

ericcurtin · 2025-10-29T17:41:43Z

@ilopezluna ready for re-review

doringeman · 2025-10-30T09:40:16Z

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model tag smollm2 my-smollm2
Model "smollm2" tagged successfully with "my-smollm2"

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model inspect my-smollm2
invalid model name: my-smollm2
Failed to get model my-smollm2

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model version
Docker Model Runner version cmd/cli/v0.1.46-67-g8e782432

doringeman · 2025-10-30T15:18:30Z

The inspect still doesn't work.

docker model inspect smollm2

And something's odd with package's tag.

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model package --from ai/smollm2:latest --context-size 1024 my-smollm2:1024
INFO[0000] Successfully initialized store
Reading model from store: "ai/smollm2:latest"
INFO[0000] Getting model by reference: ai/smollm2:latest
Creating builder from existing model
Setting context size 1024
Creating lightweight model variant...
INFO[0000] Writing lightweight model variant
Model variant created successfully
Loading model to Model Runner...
Transferred: 0.01 MB
Failed to package model
package model: failed to load packaged model: tag model: tagging failed with status 404 Not Found: tagging model: model not found

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model ls my-smollm2:1024
MODEL NAME  PARAMETERS  QUANTIZATION  ARCHITECTURE  MODEL ID  CREATED  CONTEXT  SIZE

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model ls | grep my-smollm2:1024
my-smollm2:1024                   361.82 M    IQ2_XXS/Q4_K_M  llama            c41fa645278e  7 months ago     1024  256.35 MiB

We lack tests for these.

ericcurtin · 2025-10-30T15:52:11Z

inspect seems to work with ai/ , it's a case of not enough normalisation there, the other needs to be investigated

ericcurtin · 2025-10-30T15:52:38Z

Tomorrows problem at this point

sourcery-ai

New security issues found

pkg/inference/models/manager.go

sourcery-ai

New security issues found

pkg/inference/models/manager.go

ericcurtin · 2025-11-01T19:02:14Z

@sourcery-ai please re-review

sourcery-ai · 2025-11-01T19:02:21Z

Sure! I'm generating a new review now.

sourcery-ai

Hey there - I've reviewed your changes - here's some feedback:

Blocking issues:

Detected directly writing or similar in 'http.ResponseWriter.write()'. This bypasses HTML escaping that prevents cross-site scripting vulnerabilities. Instead, use the 'html/template' package and render data using 'template.Execute()'. (link)
Detected 'printf' or similar in 'http.ResponseWriter.write()'. This bypasses HTML escaping that prevents cross-site scripting vulnerabilities. Instead, use the 'html/template' package to render data to users. (link)

General comments:

The partial name matching logic is duplicated in multiple handlers—extract it into a shared helper to avoid inconsistencies and reduce code churn.
handlePackageModel only supports lightweight, config‐only repackaging; consider adding support for full rebuilds or explicitly documenting why non-config changes aren’t allowed.
Now that the CLI stops auto-adding a ":latest" tag, verify that the backend still applies default tags for non-HF models and adjust tests or documentation to reflect this behavior.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The partial name matching logic is duplicated in multiple handlers—extract it into a shared helper to avoid inconsistencies and reduce code churn.
- handlePackageModel only supports lightweight, config‐only repackaging; consider adding support for full rebuilds or explicitly documenting why non-config changes aren’t allowed.
- Now that the CLI stops auto-adding a ":latest" tag, verify that the backend still applies default tags for non-HF models and adjust tests or documentation to reflect this behavior.

## Individual Comments

### Comment 1
<location> `pkg/inference/models/manager.go:727` </location>
<code_context>
	w.Write([]byte(fmt.Sprintf("Model tagged successfully with %q", html.EscapeString(target))))
</code_context>

<issue_to_address>
**security (go.lang.security.audit.xss.no-direct-write-to-responsewriter):** Detected directly writing or similar in 'http.ResponseWriter.write()'. This bypasses HTML escaping that prevents cross-site scripting vulnerabilities. Instead, use the 'html/template' package and render data using 'template.Execute()'.

*Source: opengrep*
</issue_to_address>

### Comment 2
<location> `pkg/inference/models/manager.go:727` </location>
<code_context>
	w.Write([]byte(fmt.Sprintf("Model tagged successfully with %q", html.EscapeString(target))))
</code_context>

<issue_to_address>
**security (go.lang.security.audit.xss.no-printf-in-responsewriter):** Detected 'printf' or similar in 'http.ResponseWriter.write()'. This bypasses HTML escaping that prevents cross-site scripting vulnerabilities. Instead, use the 'html/template' package to render data to users.

*Source: opengrep*
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

pkg/inference/models/manager.go

sourcery-ai · 2025-11-01T19:03:04Z

Hey @ericcurtin, I've posted a new review for you!

model-runner daemon is more well equipped to identify between model name and model id. Signed-off-by: Eric Curtin <[email protected]>

ericcurtin · 2025-11-01T19:10:50Z

@doringeman @ilopezluna PTAL

ericcurtin · 2025-11-03T08:35:35Z

@sourcery-ai please re-review

sourcery-ai · 2025-11-03T08:35:42Z

Sure! I'm generating a new review now.

sourcery-ai

Hey there - I've reviewed your changes - here's some feedback:

Consolidate the partial name matching and ID fallback logic into a shared resolver to reduce duplication between handlers and simplify maintenance.
Listing all models on every request for name/ID resolution can be expensive—consider caching the model list or delegating partial-match filtering into the distribution client.
handlePackageModel only supports config-only (lightweight) repackaging—either extend support for full rebuilds or explicitly error out/document unsupported layer changes.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- Consolidate the partial name matching and ID fallback logic into a shared resolver to reduce duplication between handlers and simplify maintenance.
- Listing all models on every request for name/ID resolution can be expensive—consider caching the model list or delegating partial-match filtering into the distribution client.
- handlePackageModel only supports config-only (lightweight) repackaging—either extend support for full rebuilds or explicitly error out/document unsupported layer changes.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-11-03T08:36:19Z

Hey @ericcurtin, I've posted a new review for you!

It now passes

ilopezluna · 2025-11-03T08:40:45Z

cmd/cli/desktop/desktop.go

+			// Extract the model name without tag part (e.g., from "ai/smollm2:latest" get "ai/smollm2")
+			tagWithoutVersion := tag
+			if idx := strings.LastIndex(tag, ":"); idx != -1 {
+				tagWithoutVersion = tag[:idx]
+			}
+
+			// Get just the name part without organization (e.g., from "ai/smollm2" get "smollm2")
+			namePart := tagWithoutVersion
+			if idx := strings.LastIndex(tagWithoutVersion, "/"); idx != -1 {
+				namePart = tagWithoutVersion[idx+1:]
+			}
+
+			// Check if the ID matches the name part
+			if namePart == id {
+				return m.ID, nil
+			}


I think you can use name.ParseReference(tag) to get same results:

Suggested change

// Extract the model name without tag part (e.g., from "ai/smollm2:latest" get "ai/smollm2")

tagWithoutVersion := tag

if idx := strings.LastIndex(tag, ":"); idx != -1 {

tagWithoutVersion = tag[:idx]

}

// Get just the name part without organization (e.g., from "ai/smollm2" get "smollm2")

namePart := tagWithoutVersion

if idx := strings.LastIndex(tagWithoutVersion, "/"); idx != -1 {

namePart = tagWithoutVersion[idx+1:]

}

// Check if the ID matches the name part

if namePart == id {

return m.ID, nil

}

// Try to parse both the stored tag and the input ID

tagRef, err1 := name.ParseReference(tag)

idRef, err2 := name.ParseReference(id)

if err1 != nil || err2 != nil {

// fallback to string comparison if parsing fails

if tag == id {

return m.ID, nil

}

continue

}

// Compare normalized names (without tag or digest)

if tagRef.Context().Name() == idRef.Context().Name() {

return m.ID, nil

}

Actually I think this change also fixes the case were name and tag are specified but not the repository:

MODEL_RUNNER_HOST=http://localhost:13434 docker model inspect ai/smollm2:latest { "id": "sha256:354bf30d0aa3af413d2aa5ae4f23c66d78980072d1e07a5b0d776e9606a2f0b9", "tags": [ "ai/smollm2:latest" ], "created": 1742816981, "config": { "format": "gguf", "quantization": "IQ2_XXS/Q4_K_M", "parameters": "361.82 M", "architecture": "llama", "size": "256.35 MiB" } } MODEL_RUNNER_HOST=http://localhost:13434 docker model inspect smollm2:latest Failed to get model smollm2:latest: invalid model name: smollm2:latest ```

ilopezluna · 2025-11-03T08:49:12Z

pkg/inference/models/manager.go

 	} else {
-		apiModel, err = getLocalModel(m, modelName)
+		// For local lookups, first try without normalization (as ID), then with normalization
+		apiModel, err = getLocalModel(m, modelRef)


Adding these extra steps to handle different model retrieval cases introduces additional complexity and some performance impact (even if minimal).

In my opinion, the benefit of not having to specify the namespace does not justify these extra steps across all commands.

I understand that the feature is already implemented, so I’m fine with keeping it as it is for now. However, we should aim to improve this feature eventually.

ilopezluna · 2025-11-03T08:59:51Z

FYI, I’ve opened an issue in go-containerregistry to request support for customizing the default repository. I don’t have high expectations for a quick response, I’ve had a PR open there for quite a while, but it would be nice to see this implemented upstream.

sourcery-ai bot reviewed Oct 28, 2025

View reviewed changes

gemini-code-assist bot reviewed Oct 28, 2025

View reviewed changes

cmd/cli/desktop/desktop.go Show resolved Hide resolved

ericcurtin force-pushed the model-cli-rm-sha branch from c2d5b36 to 90163b4 Compare October 28, 2025 15:54

ilopezluna reviewed Oct 29, 2025

View reviewed changes

ericcurtin changed the title ~~Fix model-cli rm sha256~~ Move more normalization into the model-runner backend Oct 29, 2025

ericcurtin force-pushed the model-cli-rm-sha branch from 90163b4 to 61e5f4e Compare October 29, 2025 17:41

ericcurtin force-pushed the model-cli-rm-sha branch from 61e5f4e to 8e78243 Compare October 29, 2025 17:56

ericcurtin force-pushed the model-cli-rm-sha branch 3 times, most recently from 9832fe2 to e4338c3 Compare October 30, 2025 12:30

ericcurtin force-pushed the model-cli-rm-sha branch from e4338c3 to c958a3f Compare November 1, 2025 16:39

sourcery-ai bot requested changes Nov 1, 2025

View reviewed changes

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Nov 1, 2025

View reviewed changes

pkg/inference/models/manager.go Fixed Show fixed Hide fixed

ericcurtin force-pushed the model-cli-rm-sha branch from c958a3f to ea07b4a Compare November 1, 2025 17:02

sourcery-ai bot requested changes Nov 1, 2025

View reviewed changes

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

ericcurtin force-pushed the model-cli-rm-sha branch from ea07b4a to aaf52a6 Compare November 1, 2025 17:04

sourcery-ai bot previously requested changes Nov 1, 2025

View reviewed changes

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

pkg/inference/models/manager.go Outdated Show resolved Hide resolved

Move more normalization into the model-runner backend

dbc3cbb

model-runner daemon is more well equipped to identify between model name and model id. Signed-off-by: Eric Curtin <[email protected]>

ericcurtin force-pushed the model-cli-rm-sha branch from aaf52a6 to dbc3cbb Compare November 1, 2025 19:07

sourcery-ai bot reviewed Nov 3, 2025

View reviewed changes

ilopezluna approved these changes Nov 3, 2025

View reviewed changes

ericcurtin merged commit 61dec0e into main Nov 3, 2025
9 checks passed

ericcurtin deleted the model-cli-rm-sha branch November 3, 2025 09:04

Move more normalization into the model-runner backend #335

Move more normalization into the model-runner backend #335

Uh oh!

Conversation

ericcurtin commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Oct 28, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ilopezluna left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ericcurtin commented Oct 29, 2025

Uh oh!

doringeman commented Oct 30, 2025

Uh oh!

doringeman commented Oct 30, 2025

Uh oh!

ericcurtin commented Oct 30, 2025

Uh oh!

ericcurtin commented Oct 30, 2025

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Nov 1, 2025

Uh oh!

sourcery-ai bot commented Nov 1, 2025

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sourcery-ai bot commented Nov 1, 2025

Uh oh!

ericcurtin commented Nov 1, 2025

Uh oh!

ericcurtin commented Nov 3, 2025

Uh oh!

sourcery-ai bot commented Nov 3, 2025

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot commented Nov 3, 2025

Uh oh!

ilopezluna Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

ilopezluna Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

ilopezluna Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

ilopezluna commented Nov 3, 2025

Uh oh!

Uh oh!

Reviewers

ericcurtin commented Oct 28, 2025 •

edited

Loading

ilopezluna left a comment •

edited

Loading