feat: allow for more customization around embedding model#157
feat: allow for more customization around embedding model#157mergify[bot] merged 1 commit intoopendatahub-io:mainfrom
Conversation
|
holding for signoff from RAG and TrustyAI teams |
WalkthroughEnvironment variable references updated: Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes
Poem
Pre-merge checks and finishing touches✅ Passed checks (3 passed)
✨ Finishing touches🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (2)
🚧 Files skipped from review as they are similar to previous changes (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
🔇 Additional comments (3)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Actionable comments posted: 1
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
📒 Files selected for processing (2)
distribution/README.md(1 hunks)distribution/run.yaml(2 hunks)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
- GitHub Check: build-test-push (linux/amd64)
- GitHub Check: Summary
🔇 Additional comments (3)
distribution/README.md (1)
17-17: Documentation update correctly reflects the environment variable rename.The change aligns with the run.yaml updates that use TRUSTYAI_EMBEDDING_MODEL for TrustyAI eval configurations.
distribution/run.yaml (2)
133-145: Verify eval provider configuration changes.Lines 133-134 appear to be new fields added to the trustyai_lmeval provider config. Please confirm:
- Are
use_k8sandbase_urlnew fields required by the updated provider version?- Are the environment variable defaults correct?
Additionally, the changes to lines 135 and 144 correctly switch from EMBEDDING_MODEL to TRUSTYAI_EMBEDDING_MODEL, aligning with the PR objective and README update.
281-285: Verify embedding dimension and model_id defaults are appropriate.The externalization of embedding model metadata (lines 281-284) looks good and aligns with the PR objective. The defaults appear sensible:
EMBEDDING_DIMENSION:=768(standard dimension)EMBEDDING_MODEL:=granite-embedding-125m(matches previous hardcoded value)EMBEDDING_PROVIDER_MODEL_ID:=ibm-granite/granite-embedding-125m-english(matches previous hardcoded value)However, please confirm that these defaults match the previously hardcoded values and are appropriate for the distribution's use case.
skamenan7
left a comment
There was a problem hiding this comment.
LGTM except that code rabbit pointed out at "Comment on lines R281 to R284"
8dbcb31 to
2c1eb85
Compare
jgarciao
left a comment
There was a problem hiding this comment.
LGTM. Let's wait until Francisco and/or Bill answer about the question of the embedding model name
|
/lgtm |
|
LGTM I don't wanna approve till the question above has been resolved, otherwise the bot will merge it 👍🏽 |
Bot will not merge since I have the |
2c1eb85 to
d2ac45e
Compare
make embedding dimension, model_id, and provider_model_id configurable fields. prev hardcoded values are now defaults. update TrustyAI config to use TRUSTYAI_EMBEDDING_MODEL instead of EMBEDDING_MODEL Signed-off-by: Nathan Weinberg <nweinber@redhat.com>
d2ac45e to
7b5e788
Compare
| module: llama_stack_provider_ragas.remote | ||
| config: | ||
| embedding_model: ${env.EMBEDDING_MODEL:=} | ||
| embedding_model: ${env.TRUSTYAI_EMBEDDING_MODEL:=} |
There was a problem hiding this comment.
how is this model going to be served? i don't see it being optionally registered down below, is it expected?
There was a problem hiding this comment.
This was not happening before so AFAIK yes this is expected
|
holding for the comments. |
What does this PR do?
make embedding dimension, model_id, and provider_model_id configurable fields. prev hardcoded values are now defaults.
update TrustyAI config to use TRUSTYAI_EMBEDDING_MODEL instead of EMBEDDING_MODEL
Summary by CodeRabbit
Chores
Documentation
✏️ Tip: You can customize this high-level summary in your review settings.