fix(security): prevent binary file injection in text upload pipeline by Adar5 · Pull Request #244 · jenkinsci/resources-ai-chatbot-plugin

Adar5 · 2026-03-05T08:08:16Z

Description

Currently, the /upload endpoint relies strictly on the file extension to determine content type. This allows users to bypass validation by renaming binary files (images, compiled executables) to .txt.

When this happens, the backend processes the binary data as text, truncates it, and injects it into the LLM context window. This pollutes the RAG context and wastes LLM tokens/compute.

The Fix

Implemented a byte-level validator in file_service.py that inspects the first 1024 bytes of the uploaded file for null bytes (\x00).

If binary signatures are detected, it fast-fails with a 415 Unsupported Media Type.
File pointers are safely reset via .seek(0) for valid files to ensure no data loss.

Steps to Reproduce the Bug

Rename a .png file to test.txt.
Upload via the Swagger /upload endpoint.
Observe the API returning 200 OK and injecting binary headers into the chat_service.py context pipeline.

Testing

Verified valid .txt files still process correctly.
Verified disguised .png files are cleanly rejected with a 415 error.

sharma-sugurthi · 2026-03-07T02:11:23Z

the binary upload fix and reformulation loop fix are unrelated changes. i think spliting this into two PRs makes review and potential reverts much cleaner. @berviantoleo be clarify on this !!

sharma-sugurthi · 2026-03-07T02:13:08Z

And also content[:1024] only catches null bytes in the first 1KB. a crafted file with a clean ASCII header followed by binary payload would may bypass this. i suggest considering checking the full content @Adar5

…lot bypass

Adar5 · 2026-03-07T04:45:35Z

@sharma-sugurthi Thanks for the thorough review!

Splitting the PRs: Good catch on the commits. I had accidentally included the reformulation commit in this branch's history. I've rebased the branch to drop that commit, so this PR is now strictly scoped to the binary upload fix.

1KB Bypass: You are totally right about the polyglot bypass vulnerability. I've updated the logic to check the full content buffer for null bytes instead of slicing it, which closes that loophole safely.

The branch has been force-updated with both of these changes!

Adar5 requested a review from a team as a code owner March 5, 2026 08:08

berviantoleo added the enhancement For changelog: Minor enhancement. use `major-rfe` for changes to be highlighted label Mar 6, 2026

Adar5 added 4 commits March 7, 2026 10:01

fix(security): prevent binary file injection in text upload pipeline

3fd2ea5

style: fix pylint line length and test suite warnings

914b6d9

style: fix final line length warning on line 278

07f80ca

fix(security): scan full file content for null bytes to prevent polyg…

6fc9971

…lot bypass

Adar5 force-pushed the fix/binary-upload-spoofing branch from 351edb3 to 6fc9971 Compare March 7, 2026 04:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(security): prevent binary file injection in text upload pipeline#244

fix(security): prevent binary file injection in text upload pipeline#244
Adar5 wants to merge 4 commits intojenkinsci:mainfrom
Adar5:fix/binary-upload-spoofing

Adar5 commented Mar 5, 2026 •

edited

Loading

Uh oh!

sharma-sugurthi commented Mar 7, 2026

Uh oh!

sharma-sugurthi commented Mar 7, 2026

Uh oh!

Adar5 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Adar5 commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

The Fix

Steps to Reproduce the Bug

Testing

Uh oh!

sharma-sugurthi commented Mar 7, 2026

Uh oh!

sharma-sugurthi commented Mar 7, 2026

Uh oh!

Adar5 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adar5 commented Mar 5, 2026 •

edited

Loading