You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for reporting. This regression was introduced in NVIDIA/TransformerEngine#1033 and it should be fixed with NVIDIA/TransformerEngine#1329. Hopefully the integration test in that PR will help catch these kinds of errors in the future.
Describe the bug
Megatron-LM doesn't compatible with transformer-engine 1.13.
in transformer-engine:
https://github.com/NVIDIA/TransformerEngine/blob/2643ba1df43397cc84c9da5fe719a66d87ad9a0a/transformer_engine/pytorch/module/layernorm.py#L62-L70
new parameter normalized_shape was declared, but in Megatron-LM:
Megatron-LM/megatron/core/extensions/transformer_engine.py
Lines 64 to 70 in 1b8fce7
the new parameter normalized_shape is missing.
To Reproduce
When directly installing transformer-engine with
transformer engine version will be 1.13, and when you run the scripts in examples, the error will appear.
Environment (please complete the following information):
Fix right now:
install transformer-engine with:
The text was updated successfully, but these errors were encountered: