Regenerate `.mlirbc` files for tests and benchmarks after LLVM integrate #17330 #17344

bjacob · 2024-05-10T15:43:43Z

Summary of what happened:

LLVM integrate Integrate both llvm-project@2083e97e (+1 ↩️, +1 🍒) and torch-mlir@bce800a3 #17330 had to disable a number of e2e tests/benchmarks. Specifically, all tests compiling a .mlirbc source that contains a tensor.expand_shape op.
The reason is that [MLIR] Generalize expand_shape to take shape as explicit input llvm/llvm-project#90040 was a compatibility-breaking change to this op. The MLIR bytecode format version was not bumped, so it results in a cryptic error: Integrate both llvm-project@2083e97e (+1 ↩️, +1 🍒) and torch-mlir@bce800a3 #17330 (comment)

This issue is about re-enabling these tests. First, all these .mlirbc files need to be re-generated with tools rebuilt after llvm/llvm-project#90040.

The text was updated successfully, but these errors were encountered:

…800a3 (#17330) * torch-mlir integrated at bce800a. * llvm-project integrated at 2083e97e plus local changes: * Reverted llvm/llvm-project#89131 locally: while this change is good in its own right, the `vector.interleave` that it generates (instead of `vector.shuffle`) are not handled by some GPU codegen lowerings. * Filed #17346. * Cherry-picked Bazel build fix: llvm/llvm-project#91654 * Several e2e tests have been temporarily disabled, follow-up work is needed to reenable them: #17344 --------- Co-authored-by: MaheshRavishankar <[email protected]> Co-authored-by: Scott Todd <[email protected]>

ScottTodd · 2024-05-10T19:55:15Z

was a compatibility-breaking change to this op. The MLIR bytecode format version was not bumped

That's expected. The bytecode format version is not tied to any particular dialect, and the tensor dialect makes no guarantees about its format or ops (unlike, say, VHLO from StableHLO). We've been fairly lucky recently in avoiding similar breaks.

ScottTodd · 2024-05-14T20:11:10Z

I tried to regenerate the .mlirbc files for https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/pytorch/models/resnet50 and https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/pytorch/models/opt-125M, but hit issues with both. Need to apply more rigor to those frontend workflows.

resnet50 crashed in the compiler at mlir::iree_compiler::IREE::Util::serializeResourceRawData D:\dev\projects\iree\compiler\src\iree\compiler\Dialect\Util\IR\UtilAttrs.cpp:230:0
opt-125M regressed in PyTorch at some point - I can't even export it to torch-mlir (marked as failing, along with nearly all other models, at https://github.com/nod-ai/e2eshark-reports/blob/main/2024-05-07/turbine_reports/statusreport.md)

ScottTodd · 2024-05-15T16:12:03Z

Looking through the history of https://github.com/iree-org/iree/commits/main/build_tools/python/e2e_test_framework/models/matmul.py, I can't tell how to regenerate those matmul test files. Quite a few PRs with completely empty descriptions :/

Possibly https://github.com/iree-org/iree-experimental/tree/main/iree-torch/library, https://github.com/iree-org/iree-experimental/tree/main/iree-jax/library, etc.?

We might be able to download the existing files, edit them manually to use the new tensor.expand_shape syntax, then upload them and update the URLs. I'm not sure who has access to https://storage.googleapis.com/iree-model-artifacts/ anymore though. We could push there if someone still has access or push elsewhere (a github repo with LFS, Azure, etc.)

bjacob · 2024-05-16T01:56:02Z

@mariecwhite, we're going to need help here! Context in the issue description above.

ScottTodd · 2024-05-30T16:01:40Z

For https://github.com/iree-org/iree/tree/main/experimental/regression_suite/tests/pregenerated, tests are still disabled:

iree/.github/workflows/pkgci_regression_test.yml

Lines 190 to 197 in 2587078

    
                 # TODO(#17344): regenerate .mlirbc files, test plat_rdna3_rocm on rocm 
        
                 # # In-tree tests 
        
                 # - name: Run experimental/regression_suite tests 
        
                 #   run: | 
        
                 #     source ${VENV_DIR}/bin/activate 
        
                 #     pytest \ 
        
                 #       -rA -s -m "plat_host_cpu and presubmit" \ 
        
                 #       experimental/regression_suite

Instructions for regenerating are at https://github.com/nod-ai/SHARK-Turbine/tree/main/models/turbine_models/custom_models#instructions, but that code hasn't been touched in a while, so it might need other updates too.

ScottTodd · 2024-05-30T21:09:51Z

I tried to regenerate the .mlirbc files for https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/pytorch/models/resnet50 and https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/pytorch/models/opt-125M, but hit issues with both. Need to apply more rigor to those frontend workflows.

resnet50 crashed in the compiler at mlir::iree_compiler::IREE::Util::serializeResourceRawData D:\dev\projects\iree\compiler\src\iree\compiler\Dialect\Util\IR\UtilAttrs.cpp:230:0

opt-125M regressed in PyTorch at some point - I can't even export it to torch-mlir (marked as failing, along with nearly all other models, at https://github.com/nod-ai/e2eshark-reports/blob/main/2024-05-07/turbine_reports/statusreport.md)

Okay, so https://github.com/nod-ai/SHARK-TestSuite/blob/main/.github/workflows/test_e2eshark.yml (what is generating reports like these) is actually pinned to a very old PyTorch version (2.1.0, 8 months old) by using the requirements files in https://github.com/nod-ai/SHARK-Turbine/tree/torch_2.1/core (that repo itself has also moved to https://github.com/iree-org/iree-turbine).

When I try those pinned versions I get RuntimeError: Windows not yet supported for torch.compile
When I try with PyTorch 2.4.0 I get TypeError: forward() got an unexpected keyword argument 'constraints'

…800a3 (iree-org#17330) * torch-mlir integrated at bce800a. * llvm-project integrated at 2083e97e plus local changes: * Reverted llvm/llvm-project#89131 locally: while this change is good in its own right, the `vector.interleave` that it generates (instead of `vector.shuffle`) are not handled by some GPU codegen lowerings. * Filed iree-org#17346. * Cherry-picked Bazel build fix: llvm/llvm-project#91654 * Several e2e tests have been temporarily disabled, follow-up work is needed to reenable them: iree-org#17344 --------- Co-authored-by: MaheshRavishankar <[email protected]> Co-authored-by: Scott Todd <[email protected]>

mariecwhite · 2024-06-27T04:50:26Z

Sorry this got buried in my inbox. I've updated the matmul MLIR files and included instructions on how to regenerate them here: #17748

…800a3 (iree-org#17330) * torch-mlir integrated at bce800a. * llvm-project integrated at 2083e97e plus local changes: * Reverted llvm/llvm-project#89131 locally: while this change is good in its own right, the `vector.interleave` that it generates (instead of `vector.shuffle`) are not handled by some GPU codegen lowerings. * Filed iree-org#17346. * Cherry-picked Bazel build fix: llvm/llvm-project#91654 * Several e2e tests have been temporarily disabled, follow-up work is needed to reenable them: iree-org#17344 --------- Co-authored-by: MaheshRavishankar <[email protected]> Co-authored-by: Scott Todd <[email protected]> Signed-off-by: Lubo Litchev <[email protected]>

bjacob added a commit to bjacob/iree that referenced this issue May 10, 2024

disable more tests and reference iree-org#17344

6983a4c

bjacob mentioned this issue May 10, 2024

Integrate both llvm-project@2083e97e (+1 ↩️, +1 🍒) and torch-mlir@bce800a3 #17330

Merged

bjacob changed the title ~~Regenerate .mlirbc files in e2e tests/benchmarks disabled in LLVM integrate#17330~~ Regenerate .mlirbc files in e2e tests/benchmarks disabled in LLVM integrate #17330 May 10, 2024

ScottTodd changed the title ~~Regenerate .mlirbc files in e2e tests/benchmarks disabled in LLVM integrate #17330~~ Regenerate .mlirbc files for tests and benchmarks after LLVM integrate #17330 May 30, 2024

ScottTodd mentioned this issue Jun 27, 2024

Update dotprod microbenchmark artifacts #17748

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regenerate `.mlirbc` files for tests and benchmarks after LLVM integrate #17330 #17344

Regenerate `.mlirbc` files for tests and benchmarks after LLVM integrate #17330 #17344

bjacob commented May 10, 2024

ScottTodd commented May 10, 2024 •

edited

Loading

ScottTodd commented May 14, 2024

ScottTodd commented May 15, 2024

bjacob commented May 16, 2024

ScottTodd commented May 30, 2024

ScottTodd commented May 30, 2024

mariecwhite commented Jun 27, 2024 •

edited

Loading

Regenerate .mlirbc files for tests and benchmarks after LLVM integrate #17330 #17344

Regenerate .mlirbc files for tests and benchmarks after LLVM integrate #17330 #17344

Comments

bjacob commented May 10, 2024

ScottTodd commented May 10, 2024 • edited Loading

ScottTodd commented May 14, 2024

ScottTodd commented May 15, 2024

bjacob commented May 16, 2024

ScottTodd commented May 30, 2024

ScottTodd commented May 30, 2024

mariecwhite commented Jun 27, 2024 • edited Loading

Regenerate `.mlirbc` files for tests and benchmarks after LLVM integrate #17330 #17344

Regenerate `.mlirbc` files for tests and benchmarks after LLVM integrate #17330 #17344

ScottTodd commented May 10, 2024 •

edited

Loading

mariecwhite commented Jun 27, 2024 •

edited

Loading