C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node #1074

AshD · 2024-11-19T16:04:18Z

Describe the bug
Running https://github.com/microsoft/onnxruntime-genai/tree/main/examples/csharp/HelloPhi3V sample
with https://huggingface.co/microsoft/Phi-3.5-vision-instruct-onnx/tree/main/gpu/gpu-int4-rtn-block-32

throws Microsoft.ML.OnnxRuntimeGenAI.OnnxRuntimeGenAIException: 'Non-zero status code returned while running SkipSimplifiedLayerNormalization node. Name:'/model/layers.0/post_attention_layernorm/SkipLayerNorm' Status Message: D:\a_work\1\s\include\onnxruntime\core/framework/op_kernel_context.h:42 onnxruntime::OpKernelContext::Input Missing Input: model.layers.0.post_attention_layernorm.weight
'
on generator.ComputeLogits();

Is the above model not compatible with Microsoft.ML.OnnxRuntimeGenAI?

The CPU model seems to work fine.
https://huggingface.co/microsoft/Phi-3.5-vision-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4

I am looking to use the DML model for Phi-3.5 vision. I assumed this was it. https://huggingface.co/microsoft/Phi-3.5-vision-instruct-onnx/tree/main/gpu/gpu-int4-rtn-block-32

To Reproduce
Steps to reproduce the behavior:

Run the HelloPhi3V sample
Throws above exception
Replace with Microsoft.ML.OnnxRuntimeGenAI.DirectML nuget package
Still throws the above exception

Desktop (please complete the following information):

OS: Windows 11, .NET 9, Visual Studio 2022 latest
Microsoft.ML.OnnxRuntimeGenAI.DirectML v0.51 or Microsoft.ML.OnnxRuntimeGenAI v0.51 packages

kunal-vaishnavi · 2024-11-19T17:47:01Z

throws Microsoft.ML.OnnxRuntimeGenAI.OnnxRuntimeGenAIException: 'Non-zero status code returned while running SkipSimplifiedLayerNormalization node. Name:'/model/layers.0/post_attention_layernorm/SkipLayerNorm' Status Message: D:\a_work\1\s\include\onnxruntime\core/framework/op_kernel_context.h:42 onnxruntime::OpKernelContext::Input Missing Input: model.layers.0.post_attention_layernorm.weight

There's a known ONNX Runtime regression for SkipSimplifiedLayerNormalization with v1.20.0. You can downgrade to an older ONNX Runtime version until a patch is released after MS Ignite or use a nightly ONNX Runtime version to resolve this.

I am looking to use the DML model for Phi-3.5 vision. I assumed this was it. https://huggingface.co/microsoft/Phi-3.5-vision-instruct-onnx/tree/main/gpu/gpu-int4-rtn-block-32

In case you run into DML issues with ONNX Runtime GenAI v0.5.1, there's also a known ONNX Runtime GenAI regression specific to DML. The fix has been merged here. You can downgrade to ONNX Runtime GenAI v0.5.0, build from source, or wait until a patch is released after MS Ignite.

AshD · 2024-11-19T18:46:08Z

Thanks @kunal-vaishnavi version 0.5.0 fixes this issue.

microsoft-github-policy-service bot added the ep:DML label Nov 19, 2024

AshD closed this as completed Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node #1074

C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node #1074

AshD commented Nov 19, 2024

kunal-vaishnavi commented Nov 19, 2024

AshD commented Nov 19, 2024

C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node #1074

C# Sample Error with Phi 3.5 GPU/DML model - Non-zero status code returned while running SkipSimplifiedLayerNormalization node #1074

Comments

AshD commented Nov 19, 2024

kunal-vaishnavi commented Nov 19, 2024

AshD commented Nov 19, 2024