feat: Use structured outputs for more control over response #5195

dbz10 · 2025-03-09T13:32:54Z

What this PR does / why we need it:

Hi,

This pull request leverages structured outputs to provide more granular control + guarantees over the responses from LLM providers. In a few of the usages in this project we expect a specific json object back. While mealie is already providing the expected json schema in the prompt this PR goes one step further and enforces that this schema is (allegedly) guaranteed to be respected in the response.

In theory this makes LLM calling more reliable and possibly makes prompting a bit more ergonomic in the future as well as pleading with LLMs to please for the love of all that is holy return json in the desired format or else my whole family will die should be less necessary.

Which issue(s) this PR fixes:

No specific related issue that I saw.

Special notes for your reviewer:

I tried to join the discord server but the invite was invalid. Happy to hear if I should have communicated prior to submitting this PR somewhere.

Testing

I ran unit test using task py:check in the devcontainer and got

1230 passed, 18 skipped, 153 warnings in 3069.26s (0:51:09)

I don't believe the tests actually executed any calls to OpenAI, especially since I didn't provide my API key, so I'm happy to take feedback or suggestions on additional testing I can do to validate this PR. I'm just getting started with mealie and don't have an extensive database of recipes nor URLs, etc, to test it on so would appreciate any suggestions how to go about that.

michael-genson · 2025-03-13T04:33:28Z

Looks good! I will try to find some time this weekend to toy around with it and make sure it works as expected, but based on my understanding of how structured outputs works this should work great.

I don't believe the tests actually executed any calls to OpenAI

This is correct, we mock the call to OpenAI (since otherwise, as you said, you'd need to provide a key). I'll test manually with my key when I get the time (and welcome others to do the same!).

michael-genson · 2025-03-13T04:34:46Z

I tried to join the discord server but the invite was invalid

Where was the invalid invite? We should get that updated. Here's a new one: https://discord.gg/qA9zCWB5ay

dbz10 · 2025-03-16T13:30:00Z

I tried to join the discord server but the invite was invalid

Where was the invalid invite? We should get that updated. Here's a new one: https://discord.gg/qA9zCWB5ay

Ok sorry in retrospect I think this was my own issue - web browser version of Discord had gotten logged out and all it said was 'whoops unable to accept invite'. after logging in I was able to join via the link.

dbz10 · 2025-03-16T13:37:11Z

Looks good! I will try to find some time this weekend to toy around with it and make sure it works as expected, but based on my understanding of how structured outputs works this should work great.

I don't believe the tests actually executed any calls to OpenAI

This is correct, we mock the call to OpenAI (since otherwise, as you said, you'd need to provide a key). I'll test manually with my key when I get the time (and welcome others to do the same!).

Yeah I think I can put in a bit more effort here and test it myself as well since I realized it's not a pre-requisite to already have user data in mealie to do so 😅

Just to confirm my understanding of where openai functionality is currently used so that I know what functionality to test out -

scrape a recipe from pointing at a url (scraping + parsing
generate a recipe from an image

is there anything else I should test? I see there's a prompt for parsing ingredients but it wasnt super obvious if this is called as part of the previous two, or if there's an independent path for calling that

dbz10 · 2025-03-16T14:14:21Z

Ok I need to do some troubleshooting it seems. Sorry for the premature PR. Will work on it a bit
mealie | TypeError: You tried to pass a `BaseModel` class to `chat.completions.create()`; You must use `beta.chat.completions.parse()` instead

…mpletions parse

dbz10 · 2025-03-16T14:55:41Z

Ok first of all I apologize for the half baked initial PR.
After digging into it a bit more I found that the structured output format is not compatible with optional arguments (see 'all fields must be required' here) meaning that if we want to use this, we need to remove some of the default values in the OpenAI{x} pydantic models.

That being said, with some additional changes + using the beta chat.completions.parse API, I was able to run recipe generation from a screenshot of a recipe.

@michael-genson lmk what you think, whether you think it's still worth it to try using structured outputs given that the blast radius of the changes is a bit larger than what I originally expected.

death2all110 · 2025-04-25T02:35:43Z

Any word on this?

michael-genson · 2025-04-25T04:53:03Z

Apologies, been meaning to review this but have been busy with life and such. We've also been focused on the Nuxt 3 upgrade, which is blocking all frontend PRs, however since this only touches the backend we aren't strictly blocked by the Nuxt upgrade.

death2all110 · 2025-04-26T00:11:21Z

@michael-genson No worries man. Thank you.

Speaking of Nuxt, there is an issue with going back after going into a recipe resets your scroll position to the top: #5350

feat: Use structured outputs for more control over response

9fb5f52

github-actions bot added the feature label Mar 9, 2025

michael-genson self-assigned this Mar 13, 2025

dbz10 marked this pull request as draft March 16, 2025 14:14

fix: Make pydantic models compatible with openai and use beta chat co…

77f52dd

…mpletions parse

dbz10 marked this pull request as ready for review March 16, 2025 15:02

Merge branch 'mealie-next' into feat/openai-structured-outputs

b32af68

michael-genson mentioned this pull request Mar 30, 2025

[BUG] - Mealie failing to convert image to recipe when using ollama #5292

Closed

6 tasks

Merge branch 'mealie-next' into feat/openai-structured-outputs

8f5bff2

dbz10 closed this Jun 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Use structured outputs for more control over response #5195

feat: Use structured outputs for more control over response #5195

Uh oh!

dbz10 commented Mar 9, 2025 •

edited

Loading

Uh oh!

michael-genson commented Mar 13, 2025 •

edited

Loading

Uh oh!

michael-genson commented Mar 13, 2025

Uh oh!

dbz10 commented Mar 16, 2025

Uh oh!

dbz10 commented Mar 16, 2025

Uh oh!

dbz10 commented Mar 16, 2025 •

edited

Loading

Uh oh!

dbz10 commented Mar 16, 2025 •

edited

Loading

Uh oh!

death2all110 commented Apr 25, 2025

Uh oh!

michael-genson commented Apr 25, 2025

Uh oh!

death2all110 commented Apr 26, 2025

Uh oh!

Uh oh!

Uh oh!

feat: Use structured outputs for more control over response #5195

feat: Use structured outputs for more control over response #5195

Uh oh!

Conversation

dbz10 commented Mar 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Testing

Uh oh!

michael-genson commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michael-genson commented Mar 13, 2025

Uh oh!

dbz10 commented Mar 16, 2025

Uh oh!

dbz10 commented Mar 16, 2025

Uh oh!

dbz10 commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbz10 commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

death2all110 commented Apr 25, 2025

Uh oh!

michael-genson commented Apr 25, 2025

Uh oh!

death2all110 commented Apr 26, 2025

Uh oh!

Uh oh!

dbz10 commented Mar 9, 2025 •

edited

Loading

michael-genson commented Mar 13, 2025 •

edited

Loading

dbz10 commented Mar 16, 2025 •

edited

Loading

dbz10 commented Mar 16, 2025 •

edited

Loading