-
Notifications
You must be signed in to change notification settings - Fork 7
Allow for setting separate log_dir
#17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Olaf Lipinski <[email protected]>
Signed-off-by: Olaf Lipinski <[email protected]>
Signed-off-by: Olaf Lipinski <[email protected]>
…v helper scripts, separate test_utils.sh from infra_utils.sh
Signed-off-by: Olaf Lipinski <[email protected]>
Signed-off-by: Olaf Lipinski <[email protected]>
Signed-off-by: Olaf Lipinski <[email protected]>
…erent trainer fall back modes - updated schedule generation documentation to include new FTS `log_dir` param ref - added mlflow to default test extra directive - added tests for new FTS `log_dir` param - disabled a flaky (10%) 2D model parallel test with limited marginal utility
…e-based trainer refs - updated autodoc_typehints config to be less verbose - added yaml (multi)representer for PretrainedConfig object types - bumped docker image to use test channel (2.7.0-rc2)
- remove duplicate multi-gpu test directive
|
/azp run |
|
Azure Pipelines successfully started running 1 pipeline(s). |
|
Thanks for your valuable contribution @olipinski! Apologies for the delayed response; I've had limited bandwidth to dedicate to Finetuning Scheduler in the last few weeks. I've updated your PR as follows (along with including a convenience rebase of some previously in-flight changes to ensure PyTorch 2.7.0-rc2 is accommodated)
Arguably, Lightning should handle this scenario by updating its Thanks again for your contribution. Feel free to reach out anytime if you have other issues or want to share more about your use case. Best of luck with your work and let me know if you have any questions or further suggestions! |
…for PretrainedConfig object types, update docker image to use cuda 12.6.3
|
Thanks for merging, @speediedan! No worries about the delay. I'm glad the PR was useful and led to some additional refactoring and enhancements. Thanks for your continued work on FTS! |
What does this PR do?
Allows for setting a separate
log_dirfrom the trainerlog_dir.Fixes #16
Fixes Lightning-AI/pytorch-lightning/issue#20615
Does your PR introduce any breaking changes? If yes, please list them.
None
Before submitting
PR review
Anyone in the community is welcome to review the PR.
In short, see the following bullet-list:
Did you have fun?
Make sure you had fun coding 🙃