Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update run-docs to avoid code duplication #1439

Merged
merged 29 commits into from
Jan 30, 2025
Merged
Changes from 1 commit
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
e90ee12
Update run-docs to avoid duplicate code
mikekgfb Dec 23, 2024
ddb3773
Update run-docs
mikekgfb Dec 24, 2024
d834661
Merge branch 'main' into patch-35
mikekgfb Dec 27, 2024
9e82bbc
Update build_native.sh
mikekgfb Dec 29, 2024
6087a58
Update run-docs
mikekgfb Jan 1, 2025
347c64e
Merge branch 'main' into patch-35
mikekgfb Jan 6, 2025
92a2f8a
Merge branch 'main' into patch-35
mikekgfb Jan 15, 2025
d602eed
Merge branch 'main' into patch-35
mikekgfb Jan 17, 2025
f0df24e
Merge branch 'main' into patch-35
mikekgfb Jan 18, 2025
a3772f1
Merge branch 'main' into patch-35
mikekgfb Jan 22, 2025
f670dc9
Merge branch 'main' into patch-35
mikekgfb Jan 23, 2025
158b3e6
Merge branch 'pytorch:main' into patch-35
mikekgfb Jan 23, 2025
dcb2a60
Update run-docs
mikekgfb Jan 23, 2025
adcb28a
Update run-docs
mikekgfb Jan 24, 2025
053058d
Merge branch 'main' into patch-35
Jack-Khuu Jan 24, 2025
5e21fff
Merge branch 'main' into patch-35
Jack-Khuu Jan 24, 2025
680937b
Merge branch 'main' into patch-35
mikekgfb Jan 27, 2025
bd594fb
Update README.md
mikekgfb Jan 28, 2025
1015de7
Update quantization.md
mikekgfb Jan 28, 2025
02dd5db
Update run-docs
mikekgfb Jan 28, 2025
da1b98d
Update run.cpp
mikekgfb Jan 28, 2025
f3ee3e4
Update run.cpp
mikekgfb Jan 28, 2025
5629e29
Create cuda-32.json
mikekgfb Jan 28, 2025
902a5da
Create mobile-32.json
mikekgfb Jan 28, 2025
0ac7096
Update run-docs
mikekgfb Jan 28, 2025
4d97e78
Update run-docs
mikekgfb Jan 28, 2025
c787e1a
Update run-readme-pr-mps.yml
mikekgfb Jan 29, 2025
156ceda
Update run.cpp
mikekgfb Jan 29, 2025
b77ddf3
Update run.cpp
mikekgfb Jan 30, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Create mobile-32.json
add gs=32 variant of mobile for tests
mikekgfb authored Jan 28, 2025
commit 902a5dab1aa79c16dfa49cb81135198e75cfd4ae
4 changes: 4 additions & 0 deletions torchchat/quant_config/mobile-32.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
{
"embedding": {"bitwidth": 4, "groupsize" : 32},
"linear:a8w4dq": {"groupsize" : 32}
}