chore: update jan inference model Dockerfile cmd #22

hiento09 · 2025-08-12T03:18:26Z

This pull request updates the container startup command in the apps/jan-inference-model/Dockerfile to enhance model serving performance and resource management. The new command introduces several runtime options to better control memory usage, execution mode, and model input size.

Model serving configuration improvements:

Added --gpu-memory-utilization 0.65 to limit GPU memory usage to 65%, helping prevent out-of-memory errors.
Enabled --enforce-eager to force eager execution mode, which can simplify debugging and improve compatibility.
Set --max_model_len 32768 to increase the maximum allowed model input length, supporting larger prompts or documents.

Minh141120

LGTM!

* feat: add ci workflows (#18) * chore: update jan inference model Dockerfile cmd (#22) --------- Co-authored-by: hiento09 <[email protected]>

chore: update jan inference model Dockerfile cmd

08ff2e4

hiento09 requested review from Minh141120 and jjchen01 August 12, 2025 03:18

hiento09 self-assigned this Aug 12, 2025

Minh141120 approved these changes Aug 12, 2025

View reviewed changes

hiento09 merged commit 1518fb0 into dev Aug 12, 2025
1 check passed

hiento09 deleted the fix/dockerfile-inference-model branch August 12, 2025 03:29

jjchen01 added a commit that referenced this pull request Aug 12, 2025

Enable CI/CD flow for dev branch (#24)

6dfc89a

* feat: add ci workflows (#18) * chore: update jan inference model Dockerfile cmd (#22) --------- Co-authored-by: hiento09 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: update jan inference model Dockerfile cmd #22

chore: update jan inference model Dockerfile cmd #22

Uh oh!

hiento09 commented Aug 12, 2025

Uh oh!

Minh141120 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chore: update jan inference model Dockerfile cmd #22

chore: update jan inference model Dockerfile cmd #22

Uh oh!

Conversation

hiento09 commented Aug 12, 2025

Uh oh!

Minh141120 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants