Support QDQ transformations with com.microsoft.Quantize/Dequantize ops #17127

adrianlizarraga · 2023-08-12T00:06:33Z

Description

Enables int32 support for com.microsoft.DequantizeLinear (contrib op)
Makes the zero_point input optional for Quantize/Dequantize contrib ops
Enables QDQ transformations with the Quantize/Dequantize contrib ops
Update tests: EnsureUniqueDQForNodeUnitTests, QDQTransformerTests, TransposeOptimizerTests

Testing

List of tested graph transformations:

Motivation and Context

We need to support mixed 16-bit/8-bit precision QDQ models. This PR is the first step in achieving this goal: we need to make QDQ contrib ops work with our optimizations/transformations.

…nsformerTests.QDQPropagation_Forward/Backward

…lict zero-point inputs

…nstant-folding

onnxruntime/test/testdata/transform/qdq_with_multi_consumer_dq_nodes.fixed.qdq_contrib.onnx

onnxruntime/test/testdata/transform/convert_qdq_ops_to_ms_domain.py

…er script to handle subgraphs.

onnxruntime/test/optimizer/transpose_optimizer_test.cc

yufenglee · 2023-08-25T04:15:22Z

QuantizeLinear, 1,

I think you are adding int16 support. I don't see the change.

Refers to: onnxruntime/core/graph/contrib_ops/quantization_defs.cc:144 in 375d3a2. [](commit_id = 375d3a2, deletion_comment = False)

adrianlizarraga · 2023-08-25T05:35:33Z

I think you are adding int16 support. I don't see the change.

@yufenglee The work is being broken down into separate/smaller PRs. This specific PR focuses on making sure contrib QDQ ops can be optimized in the same manner as ONNX ops (please refer to the PR description for details).

The next PR (linked in the description) adds int16 support, but I'd like to get this one merged in before starting reviews on it.

yufenglee

microsoft#17127) ### Description - Enables int32 support for com.microsoft.DequantizeLinear (contrib op) - Makes the `zero_point` input optional for Quantize/Dequantize contrib ops - Enables QDQ transformations with the Quantize/Dequantize contrib ops - Update tests: EnsureUniqueDQForNodeUnitTests, QDQTransformerTests, TransposeOptimizerTests ### Testing List of tested graph transformations: - [x] QDQSelectorActionTransformer - qdq_transformer_test.cc - [x] QDQS8ToU8Transformer - qdq_transformer_test.cc - [x] DoubleQDQPairsRemover - qdq_transformer_test.cc - [x] IdenticalChildrenConsolidation - qdq_transformer_test.cc - [x] QDQPropagation - qdq_transformer_test.cc - [x] QDQFinalCleanup - qdq_transformer_test.cc - [x] CliQuantFusion - qdq_transformer_test.cc - [x] ReluQuantFusion - qdq_transformer_test.cc - [x] EnsureUniqueDQForNodeUnit - ensure_unique_dq_for_node_unit_test.cc - [x] TransposeOptimizer - transpose_optimizer_test.cc - [x] CommonSubexpressionElimination - graph_transform_test.cc - [x] ConstantFolding - graph_transform_test.cc ### Motivation and Context We need to [support mixed 16-bit/8-bit precision QDQ models](microsoft#17015). This PR is the first step in achieving this goal: we need to make QDQ contrib ops work with our optimizations/transformations. --------- Co-authored-by: Edward Chen <[email protected]> Co-authored-by: Scott McKay <[email protected]>

adrianlizarraga added 27 commits August 9, 2023 15:26

Add int32_t support to com.microsoft.DequantizeLinear

0b6b0f9

Update CSE, QDQPropagation, EnsureUniqueDQ, DoubleQDQPairRemover

656c3a8

Update transpose optimization to handle ms_domain Q/DQ ops

e6afa3e

Update extended handlers

4b50cb6

Make short function constexpr

723493c

Start updating QDQTransformerTests to test contrib qdq ops

fdf7c65

Update ClipQuantizeLinear opt

8ea5b19

Update ReluQuant optimization

91d996e

Add more QDQTransformer tests

4b0d99e

Yet more QDQTransformer tests. This will take a while

57eb149

Add tests for EnsureUniqueDQ opt

58d0ff4

More tests for QDQTransformer

ae9cfcd

Add QDQTransformer Resize tests

f2640e1

Add QDQTransformer tests for ArgMax and MatMul*

144aebe

Update operator docs

12f6ff4

QDQTransformer tests: LeakyRelu, ConvRelu, ConvMaxPoolReshape

8fe17c9

Shorten variable name

d33aebe

QDQTransformerTests: Sigmoid*

e05745b

QDQTransformerTests: QBackward tests

c2a5180

QDQTransformerTests: DQForward

78e6562

QDQTransformerTests: Concat

8901e3a

QDQTransformerTests: Softmax

c281341

Make zero_point an optional input for contrib Q/DQ ops. Update QDQTra…

d630f4e

…nsformerTests.QDQPropagation_Forward/Backward

Update operator docs

e66f1a5

Add more QDQPropagation tests

0b8ec95

QDQFinalCleanup tests

8985090

Add the remaining QDQTransformer tests

fe78e3e

adrianlizarraga marked this pull request as ready for review August 14, 2023 17:21

adrianlizarraga requested a review from a team as a code owner August 14, 2023 17:21

adrianlizarraga requested a review from skottmckay August 14, 2023 17:22

adrianlizarraga added 3 commits August 21, 2023 15:32

Add TODO to allow DoubleQDQPairsRemover to handle QDQ ops without exp…

0093de1

…lict zero-point inputs

Experiment: remove seeminly unnecessary cpu_ep and domain check in co…

a8f07c4

…nstant-folding

Clean up EnsureUniqueDQ tests (reduce boilerplate)

54d113b

adrianlizarraga requested a review from pengwa August 22, 2023 00:00

adrianlizarraga added 2 commits August 21, 2023 17:53

Fix some linter issues

6a7973b

Use extended handlers in all calls to transpose optimization

94a6580

adrianlizarraga requested a review from yufenglee August 23, 2023 00:13

adrianlizarraga added 2 commits August 22, 2023 17:42

Revert most changes to double_qdq_pairs_remover as they are unrelated

ebf495b

Wrap long line

9225ea5

edgchen1 reviewed Aug 23, 2023

View reviewed changes

onnxruntime/test/testdata/transform/qdq_with_multi_consumer_dq_nodes.fixed.qdq_contrib.onnx Outdated Show resolved Hide resolved

Add script that creates test models for testing ms domain qdq ops

72426e0

github-advanced-security bot found potential problems Aug 23, 2023

View reviewed changes

onnxruntime/test/testdata/transform/convert_qdq_ops_to_ms_domain.py Fixed Show fixed Hide fixed

adrianlizarraga added 2 commits August 22, 2023 19:01

Fix linter issues with python file

ab42706

Mark unused parameter

e908a5b

skottmckay previously approved these changes Aug 24, 2023

View reviewed changes

Add a TODO above the list of deterministic MS domain ops. Update help…

811bf37

…er script to handle subgraphs.

adrianlizarraga dismissed skottmckay’s stale review via 811bf37 August 24, 2023 21:13

Apply lintrunner fixes to python script

c260e65

edgchen1 reviewed Aug 24, 2023

View reviewed changes

onnxruntime/test/optimizer/transpose_optimizer_test.cc Outdated Show resolved Hide resolved

edgchen1 previously approved these changes Aug 24, 2023

View reviewed changes

Pass const strings by reference

aaf0974

adrianlizarraga dismissed edgchen1’s stale review via aaf0974 August 24, 2023 23:34

Merge latest commits from main branch

375d3a2

edgchen1 approved these changes Aug 25, 2023

View reviewed changes

yufenglee approved these changes Aug 25, 2023

View reviewed changes

adrianlizarraga merged commit 5a83a67 into main Aug 25, 2023
99 checks passed

adrianlizarraga deleted the adrianl/contrib-qdq-optimizations branch August 25, 2023 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support QDQ transformations with com.microsoft.Quantize/Dequantize ops #17127

Support QDQ transformations with com.microsoft.Quantize/Dequantize ops #17127

adrianlizarraga commented Aug 12, 2023 •

edited by pengwa

Loading

yufenglee commented Aug 25, 2023

adrianlizarraga commented Aug 25, 2023 •

edited

Loading

yufenglee left a comment

Support QDQ transformations with com.microsoft.Quantize/Dequantize ops #17127

Support QDQ transformations with com.microsoft.Quantize/Dequantize ops #17127

Conversation

adrianlizarraga commented Aug 12, 2023 • edited by pengwa Loading

Description

Testing

Motivation and Context

yufenglee commented Aug 25, 2023

adrianlizarraga commented Aug 25, 2023 • edited Loading

yufenglee left a comment

Choose a reason for hiding this comment

adrianlizarraga commented Aug 12, 2023 •

edited by pengwa

Loading

adrianlizarraga commented Aug 25, 2023 •

edited

Loading