fix(core): use parsed mime_type for base64 file blocks in openai translator by anmolg1997 · Pull Request #36940 · langchain-ai/langchain

Anmol Jaiswal (anmolg1997) · 2026-04-22T07:21:25Z

The file branch of _convert_openai_format_to_data_block hard-codes mime_type="application/pdf", while the image branch right above it uses parsed["mime_type"] from the data URI. So a CSV sent via the OpenAI file block shape comes out with mime_type="application/pdf" in the v1 content block.

One-line change to read it off the parsed data URI, same as the image branch. _parse_data_uri returns None when the mime_type is missing, so parsed["mime_type"] is always set inside this branch.

Test added with a CSV and a text/plain data URI. Existing tests still pass since they use data:application/pdf;....

ccurme (ccurme) · 2026-04-26T19:33:08Z

Closing pending discussion on the issue.

codspeed-hq · 2026-04-29T01:12:20Z

Merging this PR will not alter performance

✅ 13 untouched benchmarks
⏩ 2 skipped benchmarks¹

_{Comparing anmolg1997:fix/openai-file-block-mime-type (98e9c04) with master (1662347)²}

2 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
No successful run was found on master (85a5a04) during the generation of this report, so 1662347 was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

Anmol Jaiswal (anmolg1997) · 2026-04-29T04:52:16Z

Pushed an update following the discussion on #36939.

Kept the original mime_type fix in _convert_openai_format_to_data_block, and added a small guard in convert_to_openai_data_block(api="chat/completions") that raises a clear ValueError for non-PDF base64 file blocks instead of letting OpenAI return a raw 400. Tests cover both behaviors. Responses API path is unchanged.

… file blocks The base64 file branch in `_convert_openai_format_to_data_block` was hard-coding `mime_type="application/pdf"`, while the image branch right above used `parsed["mime_type"]`. So a non-PDF data URI (e.g. CSV) passed via the OpenAI Chat Completions file block shape got silently relabeled as PDF in the v1 content block, which is wrong for non-OpenAI chat models that consume v1 blocks via shared `_normalize_messages`. Changes: 1. Use `parsed["mime_type"]` in the base64 file branch, matching the image branch right above it. 2. In `convert_to_openai_data_block(api="chat/completions")`, raise a clear `ValueError` when MIME is not `application/pdf`, pointing the caller to the Responses API. This keeps Chat Completions semantics intact and fails fast with a friendlier error than OpenAI's raw 400. 3. Regression tests for both behaviors. Fixes langchain-ai#36939

Anmol Jaiswal (anmolg1997) · 2026-05-08T05:12:14Z

Friendly ping. CI is green and the updated fix is in, per the discussion on #36939. Happy to tweak anything if needed before review.

Anmol Jaiswal (anmolg1997) requested a review from Eugene Yurtsev (eyurtsev) as a code owner April 22, 2026 07:21

github-actions Bot added core `langchain-core` package issues & PRs size: XS < 50 LOC labels Apr 22, 2026

langchain-automated-triage Bot added new-contributor external labels Apr 22, 2026

github-actions Bot added the missing-issue-link label Apr 22, 2026

This comment has been minimized.

Sign in to view

github-actions Bot closed this Apr 22, 2026

Anmol Jaiswal (anmolg1997) mentioned this pull request Apr 24, 2026

core: _convert_openai_format_to_data_block hard-codes mime_type on base64 file blocks #36939

Open

29 tasks

github-actions Bot reopened this Apr 26, 2026

github-actions Bot removed the missing-issue-link label Apr 26, 2026

ccurme (ccurme) changed the title ~~core[patch]: use parsed mime_type for base64 file blocks in openai translator~~ fix(core): use parsed mime_type for base64 file blocks in openai translator Apr 26, 2026

github-actions Bot added the fix For PRs that implement a fix label Apr 26, 2026

ccurme (ccurme) closed this Apr 26, 2026

ccurme (ccurme) reopened this Apr 29, 2026

Anmol Jaiswal (anmolg1997) force-pushed the fix/openai-file-block-mime-type branch from e81839c to 805b11c Compare April 29, 2026 04:51

github-actions Bot added size: S 50-199 LOC and removed size: XS < 50 LOC labels Apr 29, 2026

Anmol Jaiswal (anmolg1997) force-pushed the fix/openai-file-block-mime-type branch from 805b11c to 58f0d26 Compare April 29, 2026 04:53

Merge branch 'master' into fix/openai-file-block-mime-type

98e9c04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core): use parsed mime_type for base64 file blocks in openai translator#36940

fix(core): use parsed mime_type for base64 file blocks in openai translator#36940
Anmol Jaiswal (anmolg1997) wants to merge 2 commits into
langchain-ai:masterfrom
anmolg1997:fix/openai-file-block-mime-type

Anmol Jaiswal (anmolg1997) commented Apr 22, 2026

Uh oh!

This comment has been minimized.

ccurme (ccurme) commented Apr 26, 2026

Uh oh!

codspeed-hq Bot commented Apr 29, 2026 •

edited

Loading

Uh oh!

Anmol Jaiswal (anmolg1997) commented Apr 29, 2026

Uh oh!

Anmol Jaiswal (anmolg1997) commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Anmol Jaiswal (anmolg1997) commented Apr 22, 2026

Uh oh!

This comment has been minimized.

ccurme (ccurme) commented Apr 26, 2026

Uh oh!

codspeed-hq Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Footnotes

Uh oh!

Anmol Jaiswal (anmolg1997) commented Apr 29, 2026

Uh oh!

Anmol Jaiswal (anmolg1997) commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codspeed-hq Bot commented Apr 29, 2026 •

edited

Loading