Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TorchFX] Experimental quantization using torch.ao quantizer #32

Open
wants to merge 15 commits into
base: develop
Choose a base branch
from

Conversation

daniil-lyakhov
Copy link
Owner

@daniil-lyakhov daniil-lyakhov commented Oct 23, 2024

Experimental quantization:

  • Quantizer and RangeEstimator algo are presented
  • Experimental PostTrainingPipeline with NNCFQuantizer support is presented

Changes

Reason for changes

Related tickets

Tests

@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from 3b1865c to e1c1fba Compare October 23, 2024 14:27
@daniil-lyakhov daniil-lyakhov changed the title WIP experimental quantization [TorchFX] Experimental quantization using torch.ao quantizer Oct 23, 2024
@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 2 times, most recently from bb8f475 to e6bf1d5 Compare November 28, 2024 10:15
### Changes

Redundant code is removed from
`tests/cross_fw/test_templates/test_quantizer_config.py`

### Reason for changes

To decrease amount of code in tests
@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 3 times, most recently from efd3367 to d1941f3 Compare November 28, 2024 17:32
KodiaqQ and others added 3 commits November 28, 2024 20:35
### Changes

- Added BF16/FP16 models to PTQ tests scope.

### Reason for changes

- Extend e2e with different precisions.

### Related tickets

- 147481

### Tests

- manual post_training_quantization/562/ - passed
### Changes

Removed TensorBackendType.
### Reason for changes

Got TensorBackendType and TensorBackend with the same definition
@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from 7701b88 to bea7751 Compare December 2, 2024 13:17
KodiaqQ and others added 5 commits December 2, 2024 16:28
### Changes

Removed requirements.txt

### Reason for changes

- requirements.txt that install package is not expected as default
behavior to install module in editable mode.
- editable mode is deprecated 
- requirements.txt not actual after pyproject.toml
### Changes

Update readme.md and installation.md
### Changes

- Add pattern based on file extension to gitignore for examples
- Use pathlib style instead os.path
- Fix issue with creating ../nncf/~/.cache/datasets directory
- Replace tqdm to nncf.common.logging.track_progress
- Remove unused function arguments and if conditions
- Clean logs, disable progress bar, remove log of validataion
- Pass list of command arguments to subprocess instead of string
- Remove nncf dependencies from tests/cross_fw/examples
- Add `--reuse-venv` to use venv from example directory
- Add progress bar to collect_values_for_each_item_using_prepared_model

### Tests
https://github.com/openvinotoolkit/nncf/actions/runs/11900403097
nightly/job/test_examples/618
nightly/job/windows/job/test-examples/278/
…notoolkit#3027)

### Changes

* Bias fusing is removed from default transformations
* `constant_folding` is updated to remove inplace operations without
users
* `extract_model` is updated to support original model output as a
subgraph output

### Reason for changes

To make it possible to apply quantization the same way it done by
X86Quantizer

### Related tickets

openvinotoolkit#2766
110985

### Tests
* All int8 references are updated and checked manually
* `test_constant_folding` and `test_constant_folding_with_constraints`
are updated with a constant subgraph which contains an inplace op
(`relu_`)
* `test_model_extraction_with_original_output` is introduced
* conformance test post_training_quantization/557 have finished
successfully
@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch from aea0bdf to 52e80c8 Compare December 4, 2024 09:59
@daniil-lyakhov daniil-lyakhov force-pushed the dl/fx/experimental_quantization branch 2 times, most recently from 9178921 to 43bc251 Compare December 5, 2024 12:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants