-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Save twiddles #380
Save twiddles #380
Conversation
ce789e0
to
3418d99
Compare
3b7159e
to
97eb1d5
Compare
3418d99
to
0a4a8e1
Compare
97eb1d5
to
8e89407
Compare
0a4a8e1
to
70d7f2d
Compare
8e89407
to
71cf1a2
Compare
1754cef
to
dbdee89
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 1 unresolved discussion (waiting on @spapinistarkware)
src/core/backend/avx512/fft.rs
line 276 at r2 (raw file):
/// # Safety /// This function is safe. pub unsafe fn vecwise_ibutterflies(
this is only used on the first 4 layers? you can't use it in the middle of the FFT?
Code quote:
vecwise_ibutterflies
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 1 unresolved discussion (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 276 at r2 (raw file):
Previously, ilyalesokhin-starkware wrote…
this is only used on the first 4 layers? you can't use it in the middle of the FFT?
In this implementation, no.
It's good for ffts up to 2^28 elements, which I think is good for now.
igher than that, we would also need to transpose inside the vec, which I currently don't do.
5054180
to
83b4507
Compare
dbdee89
to
7df206b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 1 of 1 files at r3, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @spapinistarkware)
83b4507
to
63d67e8
Compare
7df206b
to
19779ce
Compare
63d67e8
to
06dd0af
Compare
19779ce
to
becaa49
Compare
what is this? Code quote: const INDICES_FROM_T1: __m512i = unsafe {
core::mem::transmute([
0b0001, 0b0001, 0b0000, 0b0000, 0b0011, 0b0011, 0b0010, 0b0010, 0b0101, 0b0101, 0b0100,
0b0100, 0b0111, 0b0111, 0b0110, 0b0110,
])
};
const NEGATION_MASK: __m512i = unsafe {
core::mem::transmute([0i32, -2, -2, 0, 0, -2, -2, 0, 0, -2, -2, 0, 0, -2, -2, 0])
};
let t = _mm512_permutexvar_epi32(INDICES_FROM_T1, t);
let t = _mm512_xor_epi32(t, NEGATION_MASK); |
54fd21a
to
7ea6c55
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 1 unresolved discussion (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 274 at r4 (raw file):
Previously, ilyalesokhin-starkware wrote…
what is this?
// See the comments in its body for more info.
887603e
to
50b6cbb
Compare
becaa49
to
5615ce4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed all commit messages.
Reviewable status: 0 of 1 files reviewed, 2 unresolved discussions (waiting on @spapinistarkware)
src/core/backend/avx512/fft.rs
line 328 at r5 (raw file):
}; let t = _mm512_permutexvar_epi32(INDICES_FROM_T1, t); let t = _mm512_xor_epi32(t, NEGATION_MASK);
can you move this to a function?
compute_layer0_twiddles or something like that?
Code quote:
const INDICES_FROM_T1: __m512i = unsafe {
core::mem::transmute([
0b0001, 0b0001, 0b0000, 0b0000, 0b0011, 0b0011, 0b0010, 0b0010, 0b0101, 0b0101, 0b0100,
0b0100, 0b0111, 0b0111, 0b0110, 0b0110,
])
};
const NEGATION_MASK: __m512i = unsafe {
core::mem::transmute([0i32, -2, -2, 0, 0, -2, -2, 0, 0, -2, -2, 0, 0, -2, -2, 0])
};
let t = _mm512_permutexvar_epi32(INDICES_FROM_T1, t);
let t = _mm512_xor_epi32(t, NEGATION_MASK);
50b6cbb
to
fc239c1
Compare
5615ce4
to
436d4b5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 2 unresolved discussions (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 328 at r5 (raw file):
Previously, ilyalesokhin-starkware wrote…
can you move this to a function?
compute_layer0_twiddles or something like that?
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 1 of 1 files at r6, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @spapinistarkware)
Merge activity
|
<!-- Reviewable:start --> This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/starkware-libs/stwo/380) <!-- Reviewable:end -->
436d4b5
to
20a6d22
Compare
This change is