update and polish llama2 trtllm_guide.md #129

ziqif-nv · 2025-02-12T00:03:42Z

update the guide to make it work and clearer based on my reproducing experience.

NOTE: without the added config in FILL_TEMPLATE_SCRIPT section, the tritionserver deployment would fail due to protobuf parsing errors.

oandreeva-nv

LGTM, thanks for the polishing!

krishung5

Lgtm as well, thanks for the update.

update and polish llama2 trtllm_guide.md

84a7fed

ziqif-nv requested review from oandreeva-nv and krishung5 February 12, 2025 00:03

oandreeva-nv approved these changes Feb 12, 2025

View reviewed changes

krishung5 approved these changes Feb 12, 2025

View reviewed changes

ziqif-nv merged commit 8bd14d1 into main Feb 12, 2025
3 checks passed

ziqif-nv deleted the ziqif_llama2_trtllm branch February 12, 2025 00:19

fdf3d186-88d5 pushed a commit to fdf3d186-88d5/triton-inference-server that referenced this pull request Mar 21, 2025

update and polish llama2 trtllm_guide.md (triton-inference-server#129)

1f3f231

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update and polish llama2 trtllm_guide.md #129

update and polish llama2 trtllm_guide.md #129

ziqif-nv commented Feb 12, 2025

oandreeva-nv left a comment

krishung5 left a comment

update and polish llama2 trtllm_guide.md #129

update and polish llama2 trtllm_guide.md #129

Conversation

ziqif-nv commented Feb 12, 2025

oandreeva-nv left a comment

Choose a reason for hiding this comment

krishung5 left a comment

Choose a reason for hiding this comment