Skip to content

Commit 53392e6

Browse files
Update int4_quantization_in_keras.py
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
1 parent f75951e commit 53392e6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

guides/int4_quantization_in_keras.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@
3737
3838
## Benefits
3939
* Memory / bandwidth bound models: When the implementation spends most of its time on memory I/O,
40-
reducing the computation time does not reduce their overall runtime. INT4 reduces bytes
40+
reducing the computation time does not reduce its overall runtime. INT4 reduces bytes
4141
moved by ~8x vs `float32`, improving cache behavior and reducing memory stalls;
4242
this often helps more than increasing raw FLOPs.
4343
* Accuracy: Many architectures retain acceptable accuracy with INT4; encoder-only models

0 commit comments

Comments
 (0)