feat: Add Nvidia e2e beginner notebook and tool calling notebook #1964

JashG · 2025-04-16T03:28:06Z

What does this PR do?

This PR contains two sets of notebooks that serve as reference material for developers getting started with Llama Stack using the NVIDIA Provider. Developers should be able to execute these notebooks end-to-end, pointing to their NeMo Microservices deployment.

beginner_e2e/: Notebook that walks through a beginner end-to-end workflow that covers creating datasets, running inference, customizing and evaluating models, and running safety checks.
tool_calling/: Notebook that is ported over from the Data Flywheel & Tool Calling notebook that is referenced in the NeMo Microservices docs. I updated the notebook to use the Llama Stack client wherever possible, and added relevant instructions.

Test Plan

Both notebook folders contain READMEs with pre-requisites. To manually test these notebooks, you'll need to have a deployment of the NeMo Microservices Platform and update the config.py file with your deployment's information.
I've run through these notebooks manually end-to-end to verify each step works.

hardikjshah · 2025-04-25T18:22:08Z

Thank you for putting this together, it is quite thorough and gives a pretty comprehensive e2e experience to the user.

Some thoughts --

Maybe we reduce the complexity of these notebooks by reducing some steps. ( for eg. uploading to HF can be done from the get go so that user does not have to worry about that step )
We are trying to showcase both direct calls to NIM as well as Llama Stack , which can create a lot of confusion. For eg. registration of a customized model first requires user to ensure NIM has the model loaded and then they also have to register it with Llama Stack. Seems unnecessary and maybe we can simplify this with a single entry point of Llama Stack and the complexity is hidden for the user.
nit: lets rename the /tmp directory to sample_data or something similar
Registration of a benchmark seems not thought through properly since all params are passed in metadata instead of using the APIs properly. Lets work together on making this cleaner instead of passing an entire bag of params in metadata.

Happy to approve this once the conflicts are resolved and take followups for some of the items above so as to get this in and iterate properly in smaller pieces.

JashG · 2025-04-28T17:18:39Z

@hardikjshah Thanks Hardik for your feedback. These were modeled from existing notebooks we have, but I'm definitely happy to look at how we can simplify these + add a diagram as a follow-up.

Re: point 2

For eg. registration of a customized model first requires user to ensure NIM has the model loaded and then they also have to register it with Llama Stack. Seems unnecessary and maybe we can simplify this with a single entry point of Llama Stack and the complexity is hidden for the user

NIM periodically updates its internal list of models, automatically in the background. To run inference on a customized model with Llama Stack, the user needs to:

Make sure NIM has picked up the model (no manual action needed)
Manually register the model with Llama Stack

Maybe at model registration time (step 2), we first internally check if the model has been registered in NIM before registering it with Llama Stack. Is that sort of what you are suggesting?

JashG · 2025-04-30T16:17:54Z

@hardikjshah FYI I moved out a fix in this PR to its own PR. Otherwise, this PR is ready to merge.

hardikjshah · 2025-05-14T22:07:38Z

@JashG looks good, can you merge the latest changes and look into the tests that are not passing. This looks good from my pov once those are resolved

JashG · 2025-05-19T13:37:26Z

@hardikjshah Thanks Hardik! The test failures seem unrelated - looks like 2 tests failed to start. I've updated the branch and they're passing now.

dglogo · 2025-05-27T22:44:17Z

Anything blocking the merge here? thanks

bbrowning · 2025-06-13T19:04:40Z

This looks reasonable to me - the documentation is quite extensive and the example python notebooks / data files are not overly large as far as file sizes for a git repo. I'm not entirely sure the link in doc_template.md will render in our website properly from your distribution template to the notebooks, but that's not any reason to hold this up and we can figure that out later.

I see some previous feedback was already addressed, but I'll admit I'm a bit nervous to approve / merge because the checks last ran 2 weeks ago. Would you mind updating this with the latest changes from main, which will trigger the checks again? If things are good I'm happy to merge.

Thanks!

JashG · 2025-06-16T14:18:13Z

@bbrowning Thanks for the review! Branch is up-to-date and ready to merge.

bbrowning

This looks like a great addition to our example notebooks showing some end-to-end examples with the NVIDIA distribution. Thank you!

I haven't had a chance to run the entire notebooks end-to-end myself, but did take a look at the Llama Stack Client usage within them and things look reasonable. As this has been open for quite a while, I'm ok to go ahead and merge this and then do any tweaks or follow-ups as needed later if we've tweaked some of these APIs since this was written.

jgulabrai added 9 commits April 3, 2025 11:19

In progress: Add NVIDIA e2e notebook

861962f

In-progress: e2e notebook with partial Eval integration

c04ab01

Updates to notebook; use direct requests to NeMo where needed

57813f5

Add back Guardrails section

a671b33

Clear notebook output

7faec23

Add high-level instructions

84e85e8

fix: Use NAMESPACE global variable

1a76c55

Merge branch 'main' into nvidia-e2e-notebook

7cdd2a0

feat: NVIDIA beginner e2e notebook

6927cdf

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 16, 2025

fix typo

75f7789

JashG changed the title ~~DRAFT: Nvidia e2e notebook~~ feat: DRAFT: Nvidia e2e notebook Apr 16, 2025

jgulabrai added 3 commits April 17, 2025 10:23

Ensure sampling_params param is included in run_eval calls

0d9d333

Clean up instructions and implementation; reorganize notebooks

4131e81

Add changes

8fd656d

JashG changed the title ~~feat: DRAFT: Nvidia e2e notebook~~ feat: DRAFT: Nvidia e2e notebooks: beginner notebook and tool calling notebook Apr 18, 2025

Fix variable name

e24959e

jgulabrai added 4 commits April 28, 2025 12:00

Merge branch 'main' into nvidia-e2e-notebook

73275f0

Rename tmp dir to sample_data; remove print statements

e649616

Minor unit test updates

c7ab6ee

Merge branch 'main' into nvidia-e2e-notebook

c3d8940

JashG changed the title ~~feat: DRAFT: Nvidia e2e notebooks: beginner notebook and tool calling notebook~~ feat: Add Nvidia e2e beginner notebook and tool calling notebook Apr 28, 2025

Remove unused env vars; change the other tmp folder name; fix examples

29f57d5

JashG marked this pull request as ready for review April 28, 2025 17:10

JashG requested review from ashwinb, yanxi0830, hardikjshah, dltn and raghotham as code owners April 28, 2025 17:10

JashG requested review from sixianyi0721, ehhuang, terrytangyuan, SLR722 and leseb as code owners April 28, 2025 17:10

jgulabrai added 4 commits April 29, 2025 12:57

fix: Consistently prefix customized models with the namespace

2f60f3c

Add reference to notebook in docs

96afc98

fix: Update datasets metadata field from provider to provider_id

f8f59c8

fix unit tests

bfbaf09

JashG mentioned this pull request Apr 30, 2025

fix: Fix messages format in NVIDIA safety check request body #2063

Merged

Merge branch 'main' into nvidia-e2e-notebook

012dd68

jgulabrai added 2 commits May 6, 2025 11:12

Merge branch 'main' into nvidia-e2e-notebook

b1d941e

fix: missing key

4999c8f

Merge branch 'main' into nvidia-e2e-notebook

51b68b4

JashG requested a review from bbrowning as a code owner May 28, 2025 21:47

jgulabrai added 3 commits May 28, 2025 17:48

Merge branch 'main' into nvidia-e2e-notebook

f5cb965

Merge branch 'main' into nvidia-e2e-notebook

6a004e9

Merge branch 'main' into nvidia-e2e-notebook

1a492ad

JashG requested a review from reluctantfuturist as a code owner June 16, 2025 13:45

Merge branch 'main' into nvidia-e2e-notebook

bd64bc9

bbrowning approved these changes Jun 16, 2025

View reviewed changes

bbrowning merged commit 40e2c97 into meta-llama:main Jun 16, 2025
114 checks passed

bbrowning mentioned this pull request Jun 16, 2025

fix: broken links on nvidia distro docs when rendered #2446

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add Nvidia e2e beginner notebook and tool calling notebook #1964

feat: Add Nvidia e2e beginner notebook and tool calling notebook #1964

Uh oh!

JashG commented Apr 16, 2025 •

edited

Loading

Uh oh!

hardikjshah commented Apr 25, 2025

Uh oh!

JashG commented Apr 28, 2025 •

edited

Loading

Uh oh!

JashG commented Apr 30, 2025

Uh oh!

hardikjshah commented May 14, 2025

Uh oh!

JashG commented May 19, 2025 •

edited

Loading

Uh oh!

dglogo commented May 27, 2025

Uh oh!

bbrowning commented Jun 13, 2025

Uh oh!

JashG commented Jun 16, 2025

Uh oh!

bbrowning left a comment

Uh oh!

Uh oh!

Uh oh!

feat: Add Nvidia e2e beginner notebook and tool calling notebook #1964

feat: Add Nvidia e2e beginner notebook and tool calling notebook #1964

Uh oh!

Conversation

JashG commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

hardikjshah commented Apr 25, 2025

Uh oh!

JashG commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JashG commented Apr 30, 2025

Uh oh!

hardikjshah commented May 14, 2025

Uh oh!

JashG commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dglogo commented May 27, 2025

Uh oh!

bbrowning commented Jun 13, 2025

Uh oh!

JashG commented Jun 16, 2025

Uh oh!

bbrowning left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JashG commented Apr 16, 2025 •

edited

Loading

JashG commented Apr 28, 2025 •

edited

Loading

JashG commented May 19, 2025 •

edited

Loading