-
Notifications
You must be signed in to change notification settings - Fork 509
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Differing component implementation logic across recipes
best practice
Things we should be doing but aren't
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2307
opened Jan 29, 2025 by
EugenHotaj
Saving multiple checkpoints per epoch
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2285
opened Jan 21, 2025 by
EugenHotaj
Roadmap for other parallelisms
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2280
opened Jan 19, 2025 by
rahul-sarvam
Llama3.2 vision does not run with distributed state dict
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2277
opened Jan 17, 2025 by
acisseJZhong
The current instantiation does not trigger the initialization of submodules
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2273
opened Jan 16, 2025 by
dz1iang
DPO after / on top of LoRA tuning
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2272
opened Jan 16, 2025 by
albertbn
About the CLS token for the llama3_2_vision_encoder
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2268
opened Jan 15, 2025 by
dfloreaa
Expose FSDP2 MixedPrecisionPolicy params
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2267
opened Jan 14, 2025 by
EugenHotaj
adding support for LR schedule for full distributed finetune
best practice
Things we should be doing but aren't
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
triaged
This issue has been assigned an owner and appropriate label
#2263
opened Jan 13, 2025 by
tginart
Request: adding Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
triaged
This issue has been assigned an owner and appropriate label
py.typed
for type checkers
better engineering
#2258
opened Jan 13, 2025 by
jamesbraza
Qlora uses more memory than regular lora
triaged
This issue has been assigned an owner and appropriate label
#2255
opened Jan 11, 2025 by
AndrewMead10
Very slow convergence with bf16
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2254
opened Jan 11, 2025 by
EugenHotaj
Finetuning Llama 3.1 8B Base Model on ChatML Format Dataset – Loss Reaches NaN After 2000 Steps
triaged
This issue has been assigned an owner and appropriate label
#2246
opened Jan 10, 2025 by
abdul-456
Overriding kv cache entries in torchtune models
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2241
opened Jan 9, 2025 by
telgamal-1
Finetune meta-llama/Llama-Guard-3-1B
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2237
opened Jan 8, 2025 by
jingzhaoou
Hugging Face from_pretrained() using merged weights KeyError: 'base_model_name_or_path'
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2224
opened Jan 2, 2025 by
chg0901
How to use train and test split with the recipes?
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2222
opened Jan 1, 2025 by
7rabbit
packed errors
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2218
opened Dec 31, 2024 by
chg0901
[feature request] support input/output to fsspec path
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2217
opened Dec 31, 2024 by
leoleoasd
Model request. Phi4
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2190
opened Dec 20, 2024 by
krammnic
Add multiprocess dataset packing
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2180
opened Dec 19, 2024 by
bratao
GPU Middle Class?
discussion
Start a discussion
distributed
Anything related to distributed env (multi-GPU, multi-node)
triaged
This issue has been assigned an owner and appropriate label
#2161
opened Dec 16, 2024 by
EugenHotaj
Move Things we should be doing but aren't
triaged
This issue has been assigned an owner and appropriate label
update_recipe_state
to its own util
best practice
#2158
opened Dec 13, 2024 by
joecummings
ProTip!
Adding no:label will show everything without a label.