Matching engine with <1ns (~800as) average (4 core 8 GB linux x86_64 small virtual machine) per order fill and/or insert
Uses preallocated on the stack order storage with local "wink-out" allocator and simd ultra fast (deterministic) bitmap linear search
0.997 - 0.999 CPUs used (perf stat)