You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
How can i use Muon with llama model? I run it with Llama, 64 A100
model = LlamaForCausalLM.from_pretrained("meta-llama/Llama-2-7b")
grouped_parameters = [
p for p in model.parameters() if p.requires_grad
]
optimizer = Muon(grouped_parameters)
But it got wrong
[rank3]: File "/xxxxxxxxxxxxxxxxxxxxxxxxxxxxx/optimizer/Muon.py", line 104, in <listcomp>
[rank3]: params = [p for p in group['params'] if self.state[p]['use_muon']]
[rank3]: KeyError: 'use_muon'
When I print the params,it seems that the params in self.state not equal group['params']
The text was updated successfully, but these errors were encountered:
How can i use
Muon
with llama model? I run it with Llama, 64 A100But it got wrong
When I print the params,it seems that the params in
self.state
not equalgroup['params']
The text was updated successfully, but these errors were encountered: