Skip to content

Commit 518312d

Browse files
address reviews
1 parent 629ede3 commit 518312d

File tree

3 files changed

+4
-4
lines changed

3 files changed

+4
-4
lines changed

guides/ipynb/writing_quantization_compatible_layers.ipynb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -191,7 +191,7 @@
191191
"\n",
192192
"- `self._kernel` as an INT8 vector of shape `(input_dim,)` (the same shape as\n",
193193
" the original full-precision kernel).\n",
194-
"- `self.scale` as the scalar quantization scale in the layer's compute dtype,\n",
194+
"- `self.scale` as the scalar quantization scale in the layer's variable dtype,\n",
195195
" which is FP32 in this case."
196196
]
197197
},
@@ -801,4 +801,4 @@
801801
},
802802
"nbformat": 4,
803803
"nbformat_minor": 0
804-
}
804+
}

guides/md/writing_quantization_compatible_layers.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -137,7 +137,7 @@ INT8 variables. It should allocate:
137137

138138
- `self._kernel` as an INT8 vector of shape `(input_dim,)` (the same shape as
139139
the original full-precision kernel).
140-
- `self.scale` as the scalar quantization scale in the layer's compute dtype,
140+
- `self.scale` as the scalar quantization scale in the layer's variable dtype,
141141
which is FP32 in this case.
142142

143143

guides/writing_quantization_compatible_layers.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -135,7 +135,7 @@ def quantize(self, mode, **kwargs):
135135
136136
- `self._kernel` as an INT8 vector of shape `(input_dim,)` (the same shape as
137137
the original full-precision kernel).
138-
- `self.scale` as the scalar quantization scale in the layer's compute dtype,
138+
- `self.scale` as the scalar quantization scale in the layer's variable dtype,
139139
which is FP32 in this case.
140140
"""
141141

0 commit comments

Comments
 (0)