-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NDTensors] Circumvent scalar indexing to improve GPU performance #1216
Conversation
…rs.jl into kmp5/enhancements/more_gpu
## TODO here it looks at the elements of S so convert to CPU when on GPU | ||
## Could write this as a GPU impl which just converts S to array. S | ||
## is not used again so we don't need to convert back to GPU. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same suggestion here as above for truncate!!
, I think we should use leaf_parenttype
dispatch.
D[n] = R[n, n] | ||
end | ||
n = size(R)[1] | ||
D = diag(R, (n - Nd)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure this is doing what you think it is doing, diag(M, k)
selects the k
th off-diagonal of the matrix.
I went with:
D[1:Nd] = diag(R)[1:Nd]
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah I see, thanks!
A lot of this PR is superseded by #1215. |
@mtfishman yes I'll close this PR and open a new one if I find something awry. Thanks! |
Description
As a continued effort to make the ITensors efficient on GPU in this branch I am identifying and fixing areas that were missed in previous PRs. First I am looking at the profiling of GPU based SVD and will also address issues in 1193
TODO