Hello team!
The Problem:
I've been trying to load the SillyTavern documentation into Open WebUI's knowledge base (RAG), but the current structure makes it difficult for LLMs to parse context correctly due to:
- Nested directories losing context when flattened.
- Duplicate filenames (e.g., multiple
index.md files).
- YAML frontmatter and Docusaurus tags (!!!) adding token noise.
The Suggestion:
It would be great to have a "clean" version of the SillyTavern docs for RAG users.
Proof of Concept:
I whacked together a workflow that flattens filenames (to e.g., SillyTavern_Installation_index.md) and strips the metadata using sed: Pavdig/SillyTavern-Docs-RAG.
Maybe something similar could be made, but officially so that users could simply download the ZIP?
Hello team!
The Problem:
I've been trying to load the SillyTavern documentation into Open WebUI's knowledge base (RAG), but the current structure makes it difficult for LLMs to parse context correctly due to:
index.mdfiles).The Suggestion:
It would be great to have a "clean" version of the SillyTavern docs for RAG users.
Proof of Concept:
I whacked together a workflow that flattens filenames (to e.g.,
SillyTavern_Installation_index.md) and strips the metadata usingsed: Pavdig/SillyTavern-Docs-RAG.Maybe something similar could be made, but officially so that users could simply download the ZIP?