Skip to content

Suggestion: add WFGY 16 Problem Map as a RAG debugging checklist #3100

@onestardao

Description

@onestardao

Hi, thanks a lot for open sourcing baml and for the work on bringing stronger "engineering" into prompt engineering.

I wanted to share one external resource that might pair well with baml users who are building real RAG and multi step workflows.

What it is

The map defines 16 concrete failure modes such as:

  • ingestion and parsing drift
  • chunking and context contract mismatch
  • vector store fragmentation and metric problems
  • retrieval order issues and prompt injection
  • evaluator mismatch for long horizon chains

For each failure type there is a short description, symptoms that users can observe, and a minimal fix strategy. The idea is to treat it as a checklist you can run through when a RAG or agentic workflow behaves in strange ways.

Why I think it might fit baml

baml already helps enforce stronger structure and contracts at the code and prompt level. A lot of real world failures still come from data and retrieval issues around the model. The 16 Problem Map is designed to complement that by giving teams a shared language for the typical RAG and vector store breakpoints.

This could be useful for:

  • teams using baml to design production agents and wanting a more systematic debug playbook
  • docs readers who search for "why is my RAG broken" and need a high level map of where to look

Possible integration

Totally up to you, but some low friction options might be:

  • a short bullet under a "Further reading" or "RAG debugging" section in the docs
  • one sentence mention in any place where you already talk about failure modes, reliability or context management

If this feels out of scope for the project that is completely fine. Either way, thank you for the work on baml and for pushing structured prompt engineering forward.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions