Skip to content
This repository was archived by the owner on Oct 25, 2024. It is now read-only.

Commit a864bb2

Browse files
changwangssXinyuYe-Intelpre-commit-ci[bot]
authored
Migrate SQ and WOQ to INC 3.x API. (#1606)
Signed-off-by: changwangss <[email protected]> Co-authored-by: Ye, Xinyu <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
1 parent c263d09 commit a864bb2

File tree

21 files changed

+1809
-1816
lines changed

21 files changed

+1809
-1816
lines changed

.github/checkgroup.yml

-34
Original file line numberDiff line numberDiff line change
@@ -30,40 +30,6 @@ subprojects:
3030
- "optimize-unit-test-PR-test"
3131
- "Genreate-OptimizeUT-Report"
3232

33-
- id: "NeuralChat Unit Test"
34-
paths:
35-
- ".github/workflows/unit-test-neuralchat.yml"
36-
- ".github/workflows/script/unitTest/run_unit_test_neuralchat.sh"
37-
- "intel_extension_for_transformers/neural_chat/**"
38-
- "requirements.txt"
39-
- "setup.py"
40-
- "intel_extension_for_transformers/transformers/llm/finetuning/**"
41-
- "intel_extension_for_transformers/transformers/llm/quantization/**"
42-
- "intel_extension_for_transformers/transformers/**"
43-
- "intel_extension_for_transformers/langchain/**"
44-
- "!intel_extension_for_transformers/neural_chat/docs/**"
45-
- "!intel_extension_for_transformers/neural_chat/examples/**"
46-
- "!intel_extension_for_transformers/neural_chat/assets/**"
47-
- "!intel_extension_for_transformers/neural_chat/README.md"
48-
checks:
49-
- "neuralchat-unit-test-baseline"
50-
- "neuralchat-unit-test-PR-test"
51-
- "Generate-NeuralChat-Report"
52-
53-
- id: "Engine Unit Test workflow"
54-
paths:
55-
- ".github/workflows/unit-test-engine.yml"
56-
- "requirements.txt"
57-
- "setup.py"
58-
- intel_extension_for_transformers/transformers/**
59-
- "intel_extension_for_transformers/transformers/runtime/**"
60-
- "!intel_extension_for_transformers/transformers/runtime/kernels/**"
61-
- "!intel_extension_for_transformers/transformers/runtime/third_party/**"
62-
- "!intel_extension_for_transformers/transformers/runtime/docs/**"
63-
checks:
64-
- "engine-unit-test-baseline"
65-
- "engine-unit-test-PR-test"
66-
- "Genreate-Engine-Report"
6733

6834
# - id: "Windows Binary Test"
6935
# paths:

.github/workflows/script/unitTest/env_setup.sh

+1
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ until [ "$n" -ge 5 ]; do
1313
git clone https://github.com/intel/neural-compressor.git /neural-compressor
1414
cd /neural-compressor
1515
pip install -r requirements.txt
16+
pip install -r requirements_pt.txt
1617
python setup.py install && break
1718
n=$((n + 1))
1819
sleep 5

examples/huggingface/pytorch/text-generation/quantization/README.md

+2-8
Original file line numberDiff line numberDiff line change
@@ -36,21 +36,18 @@ OMP_NUM_THREADS=<physical cores num> numactl -m <node N> -C <cpu list> python ru
3636
--model <MODEL_NAME_OR_PATH> \
3737
--sq \
3838
--output_dir <SQ_MODEL_SAVE_PATH> \ # Default is "./saved_results."
39-
--int8 \
4039
--benchmark \
4140
--batch_size 1
4241
# load SQ model quantied by itrex and do benchmark.
4342
OMP_NUM_THREADS=<physical cores num> numactl -m <node N> -C <cpu list> python run_generation_sq.py \
4443
--model <SQ_MODEL_SAVE_PATH> \
45-
--int8 \
4644
--benchmark \
4745
--batch_size 1
4846
# load SQ model quantied configure.json and do benchmark.
4947
python run_generation_sq.py \
5048
--model <MODEL_NAME_OR_PATH> \
5149
--output_dir <SQ_MODEL_SAVE_PATH> \
52-
--int8 \
53-
--restore \
50+
--restore_sq_model_from_json \
5451
--benchmark \
5552
--batch_size 1
5653
```
@@ -68,23 +65,20 @@ python run_generation_sq.py \
6865
--model <MODEL_NAME_OR_PATH> \
6966
--sq \
7067
--output_dir <SQ_MODEL_SAVE_PATH> \ # Default is "./saved_results."
71-
--int8 \
7268
--accuracy \
7369
--batch_size 56
7470

7571
# load SQ model quantied by itrex and do benchmark.
7672
python run_generation_sq.py \
7773
--model <SQ_MODEL_SAVE_PATH> \
78-
--int8 \
7974
--accuracy \
8075
--batch_size 56
8176

8277
# load SQ model quantied configure.json and do benchmark.
8378
python run_generation_sq.py \
8479
--model <MODEL_NAME_OR_PATH> \
8580
--output_dir <SQ_MODEL_SAVE_PATH> \
86-
--int8 \
87-
--restore \
81+
--restore_sq_model_from_json \
8882
--accuracy \
8983
--batch_size 56
9084

0 commit comments

Comments
 (0)