-
Notifications
You must be signed in to change notification settings - Fork 378
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Error while running inference with generate_v2.py after one generation
#1755
opened Oct 5, 2024 by
Vattikondadheeraj
recipe for hyperparameter sweep
enhancement
New feature or request
#1752
opened Oct 4, 2024 by
felipemello1
lr scheduler is not optional
enhancement
New feature or request
#1751
opened Oct 4, 2024 by
felipemello1
Can we support class weight in the CEWithChunkedOutputLoss class
enhancement
New feature or request
#1746
opened Oct 2, 2024 by
ye-jin-shop
saving 70B checkpoint takes 1000s for full finetuning 8GPU
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
discussion
Start a discussion
#1735
opened Oct 1, 2024 by
felipemello1
Multiple GPU low performance
question
Further information is requested
#1734
opened Oct 1, 2024 by
jetstudio-io
update config defaults dataset.packed and log_peak_memory_stats
community help wanted
We would love the community's help completing this issue
#1732
opened Oct 1, 2024 by
felipemello1
when finetuning pretrained models with lora+chat template, embeddings are not updated
#1725
opened Sep 30, 2024 by
felipemello1
Unsupported Ops in MPS: _upsample_bilinear2d_aa
discussion
Start a discussion
#1723
opened Sep 30, 2024 by
Jack-Khuu
mistral-nemo support
community help wanted
We would love the community's help completing this issue
enhancement
New feature or request
#1716
opened Sep 30, 2024 by
win4r
mistral-small support
community help wanted
We would love the community's help completing this issue
enhancement
New feature or request
#1715
opened Sep 30, 2024 by
win4r
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
bug
Something isn't working
#1710
opened Sep 29, 2024 by
Vattikondadheeraj
torchtune quantization has different model output comparing with document
#1701
opened Sep 27, 2024 by
elfisworking
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.