v1.0.0
First stable release!
Fixed
- Fix bug preventing the embeddings model from being traversed on
nn.Apply
. - Fix incorrect use of self-attention cache when used for cross-attention.
Changed
- Optimize implementation of some Dense matrix functions, especially on
amd64 with AVX.