Improvements for Multimodal (issue #133) and other minor fixes by danielferr85 · Pull Request #134 · NotPunchnox/rkllama

danielferr85 · 2026-03-19T03:44:32Z

Reduce the prompt cache file from 7 days to 1 day (a lot of disk space use this files)
Better resizing of images for better recognition in multimodal
Restore clear cache after inference without affecting time response of the user (run after inference without waiting)
Fix duplicate (tokenizer + rkllm) chat template passed to multimodal models. Now only use the generated by the tokenizer one.
Fix close loop after abort inference.

Improvements for Multimodal (issue NotPunchnox#133) and other minor fixes

Improvements for Multimodal and other minor fixes

0f22e52

danielferr85 mentioned this pull request Mar 19, 2026

process hangs while use multimodel #133

Open

NotPunchnox approved these changes Mar 20, 2026

View reviewed changes

NotPunchnox merged commit 20fe932 into NotPunchnox:main Mar 20, 2026
1 of 2 checks passed

jaylfc added a commit to jaylfc/rkllama that referenced this pull request Apr 5, 2026

Merge pull request NotPunchnox#134 from danielferr85/main

c40219e

Improvements for Multimodal (issue NotPunchnox#133) and other minor fixes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements for Multimodal (issue #133) and other minor fixes#134

Improvements for Multimodal (issue #133) and other minor fixes#134
NotPunchnox merged 1 commit into
NotPunchnox:mainfrom
danielferr85:main

danielferr85 commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielferr85 commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants