Implement special swizzles for masks and remove `{to,from}_bitmask_vector` #423

calebzulawski · 2024-06-05T18:05:15Z

Bitmask vectors are a bit of a compromise, to allow bitmasks to be useful on vectors with length >64. They are wasteful, since they are Simd<u8, N>, 8x bigger than they need to be in order to work well with generics.

Additionally, in #422, I'm finding that bitmask vector codegen doesn't work for non-powers-of-two, and even with a workaround, seems to have incorrect codegen on a handful of architectures.

In this PR I add an extract swizzle, and implement all of the special swizzles for masks, which I believe allows us to remove bitmask vectors. With a hypothetical 128-element vector, you could do:

let bitmasks: [u64; 2] = [mask.to_bitmask(), mask.extract::<64, 64>().to_bitmask()];

…masks

programmerjake

looks good enough, though I think we should re-add conversions from arbitrary sized Mask from/to integers to make >64 element vectors more ergonomic, maybe when Rust finally gains uint<N> types.

RalfJung · 2024-06-08T15:05:19Z

Additionally, in #422, I'm finding that bitmask vector codegen doesn't work for non-powers-of-two, and even with a workaround, seems to have incorrect codegen on a handful of architectures.

Is this a bug in the code, or in codegen? If the code behaves correctly with Miri but gives the wrong result with codegen then that seems like a critical bug to me that needs investigation.

RalfJung · 2024-06-08T15:08:39Z

crates/core_simd/src/masks/full_masks.rs

-        unsafe {
-            // Compute the bitmask
-            let mut bytes: <LaneCount<N> as SupportedLaneCount>::BitMask =
-                core::intrinsics::simd::simd_bitmask(self.0);


This intrinsic still has some other uses... are those working fine with non-power-of-2?

Answered here:

We still use the intrinsic, but currently (not yet synced to rust-lang/rust) we only use integer return types

RalfJung · 2024-06-08T15:42:58Z

looks good enough, though I think we should re-add conversions from arbitrary sized Mask from/to integers to make >64 element vectors more ergonomic, maybe when Rust finally gains uint<N> types.

Are vectors with more than 64 elements a thing? Miri currently supports simd_bitmask and simd_select_bitmask only for vectors up to size 64, which I thought was sufficient since this crate also only goes up to 64.

RalfJung · 2024-06-08T17:36:13Z

Additionally, in #422, I'm finding that bitmask vector codegen doesn't work for non-powers-of-two, and even with a workaround, seems to have incorrect codegen on a handful of architectures.

Did things break only on big-endian or also on little-endian?
If the trouble was big-endian-only, it could be related to the behavior of simd_bitmask not matching what to_bitmask_vector expected -- see rust-lang/rust#126171.

programmerjake · 2024-06-09T04:55:40Z

Are vectors with more than 64 elements a thing?

yes, e.g. RISC-V V theoretically supports vectors with thousands of elements. NEC's SX-Aurora supports f64x64 so presumably supports f32x128 too (though idk for sure).

Miri currently supports simd_bitmask and simd_select_bitmask only for vectors up to size 64, which I thought was sufficient since this crate also only goes up to 64.

portable-simd only supports 64 elements only because that's where we stopped for now since we currently have to implement some things for each size manually, the long term plan is for the max size to be much larger. This is kinda like how Rust used to only implement Debug for arrays with <= 32 elements, but only because the compiler wasn't good enough yet.

calebzulawski added 2 commits June 5, 2024 13:51

Add extend special swizzle fn, and implement special swizzle fns for …

675401b

…masks

Remove bitmask vectors in favor of extracting bitmasks

3733375

calebzulawski requested review from programmerjake and workingjubilee June 5, 2024 18:05

Fix clippy lints

bd92b7c

programmerjake approved these changes Jun 5, 2024

View reviewed changes

calebzulawski merged commit 8c31005 into master Jun 5, 2024
64 checks passed

calebzulawski deleted the bitmask-again-again-again branch June 5, 2024 23:46

RalfJung reviewed Jun 8, 2024

View reviewed changes

calebzulawski mentioned this pull request Jun 8, 2024

portable-simd: add test for non-power-of-2 bitmask rust-lang/miri#3655

Merged

This was referenced Jun 9, 2024

simd_bitmask: support vectors larger than 64 elements rust-lang/miri#3658

Open

What should SIMD bitmasks look like? rust-lang/rust#126217

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement special swizzles for masks and remove `{to,from}_bitmask_vector` #423

Implement special swizzles for masks and remove `{to,from}_bitmask_vector` #423

calebzulawski commented Jun 5, 2024

programmerjake left a comment

RalfJung commented Jun 8, 2024

RalfJung Jun 8, 2024

RalfJung Jun 8, 2024

RalfJung commented Jun 8, 2024

RalfJung commented Jun 8, 2024

programmerjake commented Jun 9, 2024

Implement special swizzles for masks and remove {to,from}_bitmask_vector #423

Implement special swizzles for masks and remove {to,from}_bitmask_vector #423

Conversation

calebzulawski commented Jun 5, 2024

programmerjake left a comment

Choose a reason for hiding this comment

RalfJung commented Jun 8, 2024

RalfJung Jun 8, 2024

Choose a reason for hiding this comment

RalfJung Jun 8, 2024

Choose a reason for hiding this comment

RalfJung commented Jun 8, 2024

RalfJung commented Jun 8, 2024

programmerjake commented Jun 9, 2024

Implement special swizzles for masks and remove `{to,from}_bitmask_vector` #423

Implement special swizzles for masks and remove `{to,from}_bitmask_vector` #423