when i use trlx ppotrainer train a model llama 13b model, but saved huggingface mode ,but when it inference , it has some strange keys ,and the inference result did not show ,it also have no error , it seems the result disapper #584
Labels
bug
Something isn't working
🐛 Describe the bug
trainer = trlx.train(
reward_fn=reward_fn,
prompts=prompts,
eval_prompts=["习近平女儿"] * 4,
config=config,
)
the model can not inference right, it has no error ,but the result also disapper ,the code exit 0
Which trlX version are you using?
No response
Additional system and package information
No response
The text was updated successfully, but these errors were encountered: