feat: [draft do not merge] Random dataset with specified input and output sequence length #1453

guyueh1 · 2025-10-31T20:04:56Z

What does this PR do ?

Random dataset following specified input and output sequence length

Issues

Usage

Use the following flags for fixed ISL/OSL eval

uv run examples/run_eval_random_dataset.py \
+data.input_len_or_input_len_generator=1000 \
generation.ignore_eos=true \
generation.vllm_cfg.max_model_len=3000

Use the following flags for fixed ISL/OSL GRPO

uv run examples/run_grpo_random_dataset.py \
+data.input_len_or_input_len_generator=1000 \
policy.generation.ignore_eos=true \
policy.generation.output_len_or_output_len_generator=2000

Use the following flags for random ISL/OSL GRPO with mean + stdv

uv run examples/run_grpo_random_dataset.py \
grpo.val_at_start=false \
grpo.val_period=0 \
policy.max_total_sequence_length=8000 \
+data.input_len_or_input_len_generator.mean=1000 \
+data.input_len_or_input_len_generator.std=100 \
+policy.generation.output_len_or_output_len_generator.mean=2000 \
+policy.generation.output_len_or_output_len_generator.std=1000 \
policy.generation.ignore_eos=True

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Summary by CodeRabbit

Release Notes

New Features
- Added example scripts for GRPO training and evaluation workflows with support for random datasets
- Introduced ignore_eos flag in generation configurations for flexible EOS token handling
- Added output length configuration options for generation control
- Implemented dummy environment for testing and evaluation scenarios
- Added FP8 quantization support for MoE (Mixture of Experts) modules
Refactor
- Extended generation configuration to support configurable sequence length generation
- Enhanced model initialization with improved timing instrumentation for evaluation workflows

Signed-off-by: Guyue Huang <[email protected]>

guyueh1 added 9 commits October 27, 2025 11:56

Fix process_weights_after_loading for fp8 dense

41abdf1

Signed-off-by: Guyue Huang <[email protected]>

Support moe fp8 rollout qwen

9ba1bd2

Signed-off-by: Guyue Huang <[email protected]>

save

57ea880

Signed-off-by: Guyue Huang <[email protected]>

Merge branch 'main' into random_input_output_len

a44b438

Signed-off-by: Guyue Huang <[email protected]>

Fix eval

1dcea62

Signed-off-by: Guyue Huang <[email protected]>

Merge branch 'fp8_moe_rollout' into inf_bench

546ac9a

Add timer for eval step

1003155

Signed-off-by: Guyue Huang <[email protected]>

Fix grpo

fcff955

Signed-off-by: Guyue Huang <[email protected]>

fix

02698c9

Signed-off-by: Guyue Huang <[email protected]>

guyueh1 requested review from a team as code owners October 31, 2025 20:04

guyueh1 changed the title ~~feat: Random dataset with specified input and output sequence length~~ feat: [draft do not merge] Random dataset with specified input and output sequence length Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: [draft do not merge] Random dataset with specified input and output sequence length #1453

feat: [draft do not merge] Random dataset with specified input and output sequence length #1453

Uh oh!

guyueh1 commented Oct 31, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: [draft do not merge] Random dataset with specified input and output sequence length #1453

Are you sure you want to change the base?

feat: [draft do not merge] Random dataset with specified input and output sequence length #1453

Uh oh!

Conversation

guyueh1 commented Oct 31, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Release Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

guyueh1 commented Oct 31, 2025 •

edited by coderabbitai bot

Loading