simple-llama3

A simple tested pytorch implementation of llama3 without fairscale.

If you want to understand the transformer model I recommend you to read my implementation of a vanilla transformer, since I rehuse some code here.