-
Notifications
You must be signed in to change notification settings - Fork 63
feat(llama_stack): centralize vector/RAG config and shared helpers #1266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
jgarciao
merged 6 commits into
opendatahub-io:main
from
jgarciao:refactor-vector-stores-add-dataset
Mar 20, 2026
Merged
Changes from 5 commits
Commits
Show all changes
6 commits
Select commit
Hold shift + click to select a range
c3b8c98
feat(llama_stack): centralize vector/RAG config and shared helpers
jgarciao 0270be8
Merge branch 'main' into refactor-vector-stores-add-dataset
jgarciao c94f91d
fix: enhance doc_sources handling in vector_store fixture
jgarciao 5b230b0
fix: delete unused constant
jgarciao f52b283
feat: add README.md in the dataset folder
jgarciao fbb2881
fix: prevent symlink/traversal attacks
jgarciao File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,17 @@ | ||
| # Llama Stack test fixtures (internal) | ||
|
|
||
| These files are for **internal Open Data Hub / OpenShift AI integration tests** only. We use them to hit **[Llama Stack](https://github.com/meta-llama/llama-stack) vector store APIs**—think ingest, indexing, search, and the plumbing around that—not as a shipped dataset or for model training. | ||
|
|
||
| ## IBM finance PDFs (`corpus/finance/`) | ||
|
|
||
| The PDFs here are IBM **quarterly earnings press releases** (the same material IBM posts for investors). If you need to replace or refresh them, download the official PDFs from IBM’s site: | ||
|
|
||
| [Quarterly earnings announcements](https://www.ibm.com/investor/financial-reporting/quarterly-earnings) (choose year and quarter, then open the press release PDF). | ||
|
|
||
| ## PDF edge cases (`corpus/pdf-testing/`) | ||
|
|
||
| This folder is for **weird PDFs on purpose**: password-protected files, digitally signed ones (e.g. PAdES), and similar cases so we can test how ingestion and parsers behave when the file is not a plain “print to PDF” document. | ||
|
|
||
| ## Small print | ||
|
|
||
| Not for external distribution as a “dataset.” PDFs stay under their publishers’ terms; don’t reuse them outside this test context without checking those terms. |
Binary file added
BIN
+114 KB
tests/llama_stack/dataset/corpus/finance/ibm-1q25-earnings-press-release-unencrypted.pdf
Binary file not shown.
Binary file added
BIN
+130 KB
tests/llama_stack/dataset/corpus/finance/ibm-2q25-earnings-press-release-unencrypted.pdf
Binary file not shown.
Binary file added
BIN
+132 KB
tests/llama_stack/dataset/corpus/finance/ibm-3q25-earnings-press-release-unencrypted.pdf
Binary file not shown.
Binary file added
BIN
+130 KB
tests/llama_stack/dataset/corpus/finance/ibm-4q25-earnings-press-release-unencrypted.pdf
Binary file not shown.
Binary file added
BIN
+272 KB
tests/llama_stack/dataset/corpus/pdf-testing/ibm-4q25-press-release-encrypted.pdf
Binary file not shown.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.