-
Notifications
You must be signed in to change notification settings - Fork 9
fix: baseline trial counter and duplicate check bugs #118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: baseline trial counter and duplicate check bugs #118
Conversation
Fixes two critical bugs in baseline trial integration (PR openshift-psap#111): 1. **Baseline trials not counted in completed_trials** - Baseline trials were creating Optuna trial objects but not incrementing self.completed_trials counter - This caused the optimization loop to think no trials had run - Result: System would try to run n_trials MORE trials after baseline - If trials failed quickly, created thousands of phantom failed trials - Fix: Increment self.completed_trials after each baseline completion (success, failure, timeout, or exception) 2. **Baseline trials blocking optimization trials in duplicate check** - Failed baseline trials (with empty params {}) were being included in duplicate parameter detection - Optimization trials with empty params were incorrectly flagged as duplicates of the failed baseline - Result: Optimization trials skipped with "duplicate parameters" warning - Fix: Exclude baseline trials from duplicate check by checking is_baseline user attribute
WalkthroughThe Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~15 minutes
Possibly related PRs
Suggested reviewers
Poem
Pre-merge checks and finishing touches✅ Passed checks (2 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
🧰 Additional context used🧬 Code graph analysis (1)auto_tune_vllm/core/study_controller.py (1)
🔇 Additional comments (2)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
ephoris
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
thameem-abbas
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Fixes two critical bugs in baseline trial integration (PR #111):
Baseline trials not counted in completed_trials
Baseline trials blocking optimization trials in duplicate check
Summary by CodeRabbit