-
Notifications
You must be signed in to change notification settings - Fork 108
RULER remove answer prefix #953
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Cheng-Ping Hsieh <[email protected]>
WalkthroughThe RULER data preparation script updates remove generation-specific configs, change how question text is composed, drop generation fields from outputs, adjust max sequence length calculation, reduce per-task sample counts, and stop writing tokens_to_generate into generated per-task init.py files. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
participant U as CLI/Caller
participant P as prepare.py
participant T as Task Loader
participant W as Writer
U->>P: run RULER prepare
P->>T: load tasks and templates
Note over P,T: max_seq_length computed earlier<br/>(subtract template_tokens)
loop per task (limited to 100 samples)
P->>T: read original_entry
Note right of P: question = input + answer_prefix<br/>(no generation config)
P->>W: write prepared sample (no "generation" field)
end
P->>W: write per-task __init__.py<br/>(no tokens_to_generate in DEFAULT_SETTINGS)
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes Poem
Pre-merge checks and finishing touches❌ Failed checks (1 warning, 1 inconclusive)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
🔇 Additional comments (4)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's pause this for two weeks.
Summary by CodeRabbit
New Features
Refactor
Chores