We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
This is printed when I call functional.scaled_dot_product_attention:
functional.scaled_dot_product_attention
[W914 13:25:36.000000000 sdp_utils.cpp:555] Warning: 1Torch was not compiled with flash attention. (function operator ())
I'm on Windows with TorchSharp-cuda-windows=0.103.0
TorchSharp-cuda-windows=0.103.0
The text was updated successfully, but these errors were encountered:
Can you show the actual line of code used? Are you getting the warning during runtime or at compile / interpret?
I don't see this warning, when using a CausalSelfAttention layer inside of a transformer architecture.
This is the line of code I used:
// "Flash" attention var y = F.scaled_dot_product_attention(q, k, v, is_casual: true);
where q,k,v are the query, key, values from a Causal Attention linear layer.
Sorry, something went wrong.
No branches or pull requests
This is printed when I call
functional.scaled_dot_product_attention
:I'm on Windows with
TorchSharp-cuda-windows=0.103.0
The text was updated successfully, but these errors were encountered: