Skip to content

Add Qwen3.5-9B MedQA RL example#15

Open
Honyant wants to merge 1 commit into
MedARC-AI:mainfrom
Honyant:feature/qwen35-9b-medqa-example
Open

Add Qwen3.5-9B MedQA RL example#15
Honyant wants to merge 1 commit into
MedARC-AI:mainfrom
Honyant:feature/qwen35-9b-medqa-example

Conversation

@Honyant
Copy link
Copy Markdown

@Honyant Honyant commented Apr 27, 2026

Adds examples/medqa/rl.toml — 100-step GRPO config for Qwen3.5-9B on MedQA-USMLE-4-options, with the 9B-VLM-specific knobs (filesystem broadcast, max_async_level=1, optim_cpu_offload, no compile).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant