[TorchFX] Experimental quantization using torch.ao quantizer #32

daniil-lyakhov · 2024-10-23T12:53:44Z

Experimental quantization:

Quantizer and RangeEstimator algo are presented
Experimental PostTrainingPipeline with NNCFQuantizer support is presented

Changes

Reason for changes

Related tickets

Tests

### Changes Redundant code is removed from `tests/cross_fw/test_templates/test_quantizer_config.py` ### Reason for changes To decrease amount of code in tests

### Changes - Added BF16/FP16 models to PTQ tests scope. ### Reason for changes - Extend e2e with different precisions. ### Related tickets - 147481 ### Tests - manual post_training_quantization/562/ - passed

### Changes Removed TensorBackendType. ### Reason for changes Got TensorBackendType and TensorBackend with the same definition

Workaround for issue huggingface/optimum-intel#993

Reverts openvinotoolkit#2922 ### Related tickets 158536

### Changes Removed requirements.txt ### Reason for changes - requirements.txt that install package is not expected as default behavior to install module in editable mode. - editable mode is deprecated - requirements.txt not actual after pyproject.toml

### Changes Update readme.md and installation.md

### Changes - Add pattern based on file extension to gitignore for examples - Use pathlib style instead os.path - Fix issue with creating ../nncf/~/.cache/datasets directory - Replace tqdm to nncf.common.logging.track_progress - Remove unused function arguments and if conditions - Clean logs, disable progress bar, remove log of validataion - Pass list of command arguments to subprocess instead of string - Remove nncf dependencies from tests/cross_fw/examples - Add `--reuse-venv` to use venv from example directory - Add progress bar to collect_values_for_each_item_using_prepared_model ### Tests https://github.com/openvinotoolkit/nncf/actions/runs/11900403097 nightly/job/test_examples/618 nightly/job/windows/job/test-examples/278/

…notoolkit#3027) ### Changes * Bias fusing is removed from default transformations * `constant_folding` is updated to remove inplace operations without users * `extract_model` is updated to support original model output as a subgraph output ### Reason for changes To make it possible to apply quantization the same way it done by X86Quantizer ### Related tickets openvinotoolkit#2766 110985 ### Tests * All int8 references are updated and checked manually * `test_constant_folding` and `test_constant_folding_with_constraints` are updated with a constant subgraph which contains an inplace op (`relu_`) * `test_model_extraction_with_original_output` is introduced * conformance test post_training_quantization/557 have finished successfully

github-actions bot added the experimental label Oct 23, 2024

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from 3b1865c to e1c1fba Compare October 23, 2024 14:27

github-actions bot added NNCF PT NNCF PTQ labels Oct 23, 2024

daniil-lyakhov changed the title ~~WIP experimental quantization~~ [TorchFX] Experimental quantization using torch.ao quantizer Oct 23, 2024

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 2 times, most recently from bb8f475 to e6bf1d5 Compare November 28, 2024 10:15

[Tests] A redundant code is removed (openvinotoolkit#3119)

76e0ffc

### Changes Redundant code is removed from `tests/cross_fw/test_templates/test_quantizer_config.py` ### Reason for changes To decrease amount of code in tests

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 3 times, most recently from efd3367 to d1941f3 Compare November 28, 2024 17:32

KodiaqQ and others added 3 commits November 28, 2024 20:35

Added BF16 & FP16 models to PTQ tests (openvinotoolkit#2922)

9c5220a

### Changes - Added BF16/FP16 models to PTQ tests scope. ### Reason for changes - Extend e2e with different precisions. ### Related tickets - 147481 ### Tests - manual post_training_quantization/562/ - passed

remove unused class (openvinotoolkit#3123)

7c64c3d

### Changes Removed TensorBackendType. ### Reason for changes Got TensorBackendType and TensorBackend with the same definition

Fix imports in FP8 example (openvinotoolkit#3117)

03c0540

Workaround for issue huggingface/optimum-intel#993

github-actions bot added the NNCF Common label Dec 2, 2024

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from 7701b88 to bea7751 Compare December 2, 2024 13:17

KodiaqQ and others added 5 commits December 2, 2024 16:28

Revert "Added BF16 & FP16 models to PTQ tests" (openvinotoolkit#3124)

0799634

Reverts openvinotoolkit#2922 ### Related tickets 158536

Update readme and installation (openvinotoolkit#3111)

75ba2f7

### Changes Update readme.md and installation.md

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from aea0bdf to 52e80c8 Compare December 4, 2024 09:59

github-actions bot added documentation Improvements or additions to documentation dependencies NNCF TF NNCF ONNX NNCF OpenVINO labels Dec 4, 2024

daniil-lyakhov added 3 commits December 4, 2024 11:04

WIP experimental quantization

75bbe9b

Experimental quantization

0c406cc

Reuse MinMax algo instead of copy-paste

8e001e3

Correct use of transform_for_annotation

52e80c8

daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 2 times, most recently from 9178921 to 43bc251 Compare December 5, 2024 12:32

daniil-lyakhov added 2 commits December 5, 2024 13:38

Comments/fixes

43bc251

batchwise_statistics

e285dc6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TorchFX] Experimental quantization using torch.ao quantizer #32

[TorchFX] Experimental quantization using torch.ao quantizer #32

daniil-lyakhov commented Oct 23, 2024 •

edited

Loading

[TorchFX] Experimental quantization using torch.ao quantizer #32

Are you sure you want to change the base?

[TorchFX] Experimental quantization using torch.ao quantizer #32

Conversation

daniil-lyakhov commented Oct 23, 2024 • edited Loading

Changes

Reason for changes

Related tickets

Tests

daniil-lyakhov commented Oct 23, 2024 •

edited

Loading