Fix #4168: Support Claude V3 response format in DefaultLLMImpl #4275

sonianuj287 · 2025-10-07T14:59:23Z

This PR updates the DefaultLlmImpl class to properly parse responses from the Claude V3 family of Bedrock models. Previously, only the V2 response format (completion field) was supported. With this change, the parser now supports the V3 format (content -> text) while maintaining backward compatibility with V2.

Changes Made

Updated buildChatCompletionOutput():
Detects "content" field for Claude V3 responses.
Extracts text from the first element of the content list.
Adds extracted text to answers.
Added a new unit test in DefaultLlmImplTests.java:
testChatCompletionApiForBedrockClaudeV3()
Mocks a V3-style Bedrock response and validates output parsing.
Reformatted test files to satisfy Spotless formatting rules.

Why this change is needed
Claude V2 models are deprecated; V3 is the current standard.
Without this fix, requests to "bedrock/anthropic-claude" with V3 models fail to extract answers, causing RAG pipelines to break.
Ensures users don’t need to pass llmResponseField manually for default Bedrock Claude models.

How to test
Run all existing unit tests:
./gradlew clean build

resolves #4168 (comment)

sonianuj287 · 2025-10-07T15:09:43Z

Hi @mingshl @rithin-pullela-aws , Could you please review my PR and suggest changes.

Thanks :)

mingshl · 2025-10-07T18:48:27Z

oh great, this is going to use the same interface string, and it will auto detect these two formats and parse accordingly. Can you also add in ITs? this is the best way we can monitor and keep track when the model interface failed.

I ran your CIs and also I added the resolved issues and link the issue in the PR descriptions.

sonianuj287 · 2025-10-09T16:54:27Z

Hi @mingshl @akolarkunnu @rithin-pullela-aws , Please review the code changes and run the test CIs. I also covered the test coverage in last PR.

rithin-pullela-aws

Thanks for raising the PR!!
Added few comments

plugin/src/test/java/org/opensearch/ml/rest/RestBedRockInferenceIT.java

...ain/java/org/opensearch/searchpipelines/questionanswering/generative/llm/DefaultLlmImpl.java

Signed-off-by: Anuj Soni <[email protected]>

…ntent, text) Signed-off-by: Anuj Soni <[email protected]>

Signed-off-by: Anuj Soni <[email protected]>

sonianuj287 · 2025-10-13T18:13:43Z

Hi @rithin-pullela-aws , sorry for bothering you again and again. But please if you have time to review them once and suggest me the changes, it'll keep me occupied by working on it.
Thanks :)

rithin-pullela-aws

Small Comment about the IT.
It should simple to have the IT fixed instead of writing a new one.

We can remove the BM25_SEARCH_REQUEST_WITH_CONVO_WITH_LLM_RESPONSE_TEMPLATE from plugin/src/test/java/org/opensearch/ml/rest/RestMLRAGSearchProcessorIT.java to have the IT hit the code change

rithin-pullela-aws · 2025-10-14T01:31:35Z

plugin/src/test/java/org/opensearch/ml/rest/RestBedRockInferenceIT.java

+    private Map<String, Object> invokeBedrockInference(Map<String, Object> mockResponse) throws Exception {
+        // Create DefaultLlmImpl and mock ML client
+        DefaultLlmImpl connector = new DefaultLlmImpl("model_id", null); // Use getClient() from MLCommonsRestTestCase


This looks more like a unit test than an integration test.
Generally in Integration tests, we make an actual call to the LLM and verify the response is as expected.

It should be something like:

GET /<index_name>/_search?search_pipeline=rag_pipeline { "query": { "match": { "text": "Abraham Lincoln" } }, "ext": { "generative_qa_parameters": { "llm_model": "bedrock/anthropic-claude", "llm_question": "who is lincoln", "system_prompt": "null", "user_instructions": "null", "context_size": 5, "message_size": 5, "timeout": 60 } } }

and we need to verify this is ITs.

This was being tested previously in the ITs, but we added a llm_response_field to get it working.

In this PR: dc8403f#diff-413a5184ffbe5b2e3f86084003df503a3b1fb86ec76ecf81ecb28ba213d67bca a new llm_response_field was added to solve the issue. We can just undo the changes in this PR to check if the ITs pass after this change.

Now that code handles it we do not need this template: https://github.com/opensearch-project/ml-commons/blob/main/plugin/src/test/java/org/opensearch/ml/rest/RestMLRAGSearchProcessorIT.java#L473C33-L492

I believe instead of these changes we can try to update the other ITs

sonianuj287 requested review from HenryL27, Zhangxunmt, austintlee, b4sjoo, dhrubo-os, jngz-es, mingshl, model-collapse, pyek-bot, rbhavna, sam-herman, xinyual, ylwu-amzn and zane-neo as code owners October 7, 2025 14:59

sonianuj287 requested a deployment to ml-commons-cicd-env-require-approval October 7, 2025 15:01 — with GitHub Actions Waiting

sonianuj287 force-pushed the fix/claude-v3-bedrock-support branch from aa64626 to 540a52e Compare October 7, 2025 15:02

sonianuj287 had a problem deploying to ml-commons-cicd-env-require-approval October 7, 2025 15:03 — with GitHub Actions Failure

sonianuj287 had a problem deploying to ml-commons-cicd-env-require-approval October 7, 2025 15:03 — with GitHub Actions Error

sonianuj287 had a problem deploying to ml-commons-cicd-env-require-approval October 7, 2025 15:03 — with GitHub Actions Failure

sonianuj287 had a problem deploying to ml-commons-cicd-env-require-approval October 7, 2025 15:03 — with GitHub Actions Error

sonianuj287 mentioned this pull request Oct 7, 2025

[BUG] Outdated Default Claude model parsing in Generative QA Processor #4168

Open

sonianuj287 requested a deployment to ml-commons-cicd-env-require-approval October 7, 2025 20:02 — with GitHub Actions Waiting

rithin-pullela-aws reviewed Oct 9, 2025

View reviewed changes

sonianuj287 added 6 commits October 10, 2025 22:20

Fix: support Claude V3 output parsing in Generative QA Processor

b2613da

Signed-off-by: Anuj Soni <[email protected]>

Test Fix: Adjust Bedrock Claude error expectation in DefaultLlmImplTests

ef41c2b

Signed-off-by: Anuj Soni <[email protected]>

Add Bedrock Claude response format auto-detection and IT validation

52d46e4

Signed-off-by: Anuj Soni <[email protected]>

Refactor: define constants for Claude response fields (completion, co…

663912e

…ntent, text) Signed-off-by: Anuj Soni <[email protected]>

Add complete Bedrock Claude V3 test coverage for DefaultLlmImpl

2eea8a6

Signed-off-by: Anuj Soni <[email protected]>

fixed comments suggested changes

cbe84d8

Signed-off-by: Anuj Soni <[email protected]>

sonianuj287 force-pushed the fix/claude-v3-bedrock-support branch from a6199dc to cbe84d8 Compare October 10, 2025 16:52

Merge branch 'main' into fix/claude-v3-bedrock-support

f20f285

sonianuj287 requested a deployment to ml-commons-cicd-env-require-approval October 10, 2025 16:54 — with GitHub Actions Waiting

Merge branch 'main' into fix/claude-v3-bedrock-support

4552f03

sonianuj287 requested review from akolarkunnu and rithin-pullela-aws October 11, 2025 06:52

sonianuj287 requested a deployment to ml-commons-cicd-env-require-approval October 11, 2025 06:53 — with GitHub Actions Waiting

Merge branch 'main' into fix/claude-v3-bedrock-support

efaeb4b

sonianuj287 temporarily deployed to ml-commons-cicd-env-require-approval October 13, 2025 18:14 — with GitHub Actions Inactive

sonianuj287 deployed to ml-commons-cicd-env-require-approval October 13, 2025 18:14 — with GitHub Actions Active

sonianuj287 requested a deployment to ml-commons-cicd-env-require-approval October 13, 2025 20:00 — with GitHub Actions Waiting

rithin-pullela-aws reviewed Oct 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #4168: Support Claude V3 response format in DefaultLLMImpl #4275

Fix #4168: Support Claude V3 response format in DefaultLLMImpl #4275

sonianuj287 commented Oct 7, 2025 •

edited by mingshl

Loading

Uh oh!

sonianuj287 commented Oct 7, 2025

Uh oh!

mingshl commented Oct 7, 2025 •

edited

Loading

Uh oh!

sonianuj287 commented Oct 9, 2025

Uh oh!

rithin-pullela-aws left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonianuj287 commented Oct 13, 2025

Uh oh!

rithin-pullela-aws left a comment

Uh oh!

rithin-pullela-aws Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix #4168: Support Claude V3 response format in DefaultLLMImpl #4275

Are you sure you want to change the base?

Fix #4168: Support Claude V3 response format in DefaultLLMImpl #4275

Conversation

sonianuj287 commented Oct 7, 2025 • edited by mingshl Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonianuj287 commented Oct 7, 2025

Uh oh!

mingshl commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonianuj287 commented Oct 9, 2025

Uh oh!

rithin-pullela-aws left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonianuj287 commented Oct 13, 2025

Uh oh!

rithin-pullela-aws left a comment

Choose a reason for hiding this comment

Uh oh!

rithin-pullela-aws Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sonianuj287 commented Oct 7, 2025 •

edited by mingshl

Loading

mingshl commented Oct 7, 2025 •

edited

Loading