Use alloca to address memory layout effects #744

PSeitz · 2023-11-22T12:10:24Z

replace collect with Vec::reserve
the iter collect pollutes the call stack (10 levels of calls)
offset tests with alloca
As mentioned in Eliminate memory layout bias when measuring (with LLVM stabilizer) #334, randomness in memory layout has a huge impact on the bench results. When I benchmark with criterion it's not uncommon to see effects of +-30%. Tests seem much more stable with alloca, there's still random 6% jumps in one test I've been running.

Adresses #334

the iter collect pollutes the call stack (10 levels off calls)

samueltardieu

I like this changes. They should give better stability by increasing testing time variability.

samueltardieu · 2024-03-29T07:59:15Z

src/routine.rs

@@ -277,6 +293,8 @@ where
            }

            b.iters = b.iters.wrapping_mul(2);
+            b.iters = b.iters.min(64); // To make sure we offset the test at least with 0-64 bytes
+                                       // wit alloca


Nit: typo ("wit" -> "with"). Also, the comment is typically placed above the line, not on the side.

samueltardieu · 2024-03-29T08:06:11Z

src/routine.rs

+                    stack_alloc, /* how much bytes we want to allocate */
+                    |_memory: &mut [core::mem::MaybeUninit<u8>] /* dynamically stack allocated slice itself */| {


Maybe stack_alloc should be inlined here, to prevent getting a warning on platforms not selected by the cfg attribute. Using a name such as _shifting_stack_space or similar instead of _memory would be self-documenting.

FilipAndersson245 · 2024-04-01T20:42:12Z

Seems quite straightforward and if this helps in reducing random changes between runs I'm all for this.

waywardmonkeys · 2024-07-27T16:13:32Z

@PSeitz Thanks for your contribution! Are you interested in updating this PR for the review comments or should we take it on?

PSeitz added 4 commits May 26, 2023 13:16

replace collect with Vec::reserve

d0af36c

the iter collect pollutes the call stack (10 levels off calls)

offset tests with alloca

cf60ffc

use alloca only in windows and unix

17159c7

limit stackalloc to page size 4096

e6f98ee

samueltardieu suggested changes Mar 29, 2024

View reviewed changes

waywardmonkeys mentioned this pull request Jul 27, 2024

Next release? #724

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use alloca to address memory layout effects #744

Use alloca to address memory layout effects #744

PSeitz commented Nov 22, 2023

samueltardieu left a comment

samueltardieu Mar 29, 2024

samueltardieu Mar 29, 2024

FilipAndersson245 commented Apr 1, 2024

waywardmonkeys commented Jul 27, 2024

		stack_alloc, /* how much bytes we want to allocate */
		\|_memory: &mut [core::mem::MaybeUninit<u8>] /* dynamically stack allocated slice itself */\| {

Use alloca to address memory layout effects #744

Are you sure you want to change the base?

Use alloca to address memory layout effects #744

Conversation

PSeitz commented Nov 22, 2023

samueltardieu left a comment

Choose a reason for hiding this comment

samueltardieu Mar 29, 2024

Choose a reason for hiding this comment

samueltardieu Mar 29, 2024

Choose a reason for hiding this comment

FilipAndersson245 commented Apr 1, 2024

waywardmonkeys commented Jul 27, 2024