Skip to content

Commit 39472d8

Browse files
shanmugamr1992tdeneArEsKay3Shanmugam Ramasamyi-riyad
authored
Shanmugamr1992/megatron inference ultra (#3784)
Co-authored-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Co-authored-by: Robert Kirby <rkirby@nvidia.com> Co-authored-by: Shanmugam Ramasamy <shanmugamr@cw-dfw-cs-001-login-01.cm.cluster> Co-authored-by: rislam <rislam@nvidia.com>
1 parent 8f539df commit 39472d8

File tree

4 files changed

+288
-53
lines changed

4 files changed

+288
-53
lines changed

megatron/core/inference/engines/dynamic_engine.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -884,7 +884,10 @@ def _add_request(
884884
self.failed_request_ids.append(request_id)
885885
if self.rank == 0:
886886
warnings.warn(
887-
f"Request {request_id} failed to be added to the engine due to errors."
887+
f"Request {request_id} failed to be added to the engine due to errors. "
888+
f"Prompt Tokens: {len(request.prompt_tokens)} "
889+
f"Tokens to generate: {request.sampling_params.num_tokens_to_generate} "
890+
f"Max sequence length: {self.context.max_sequence_length} "
888891
)
889892

890893
return self.requests[request_id].future

0 commit comments

Comments
 (0)