Version 2
Refactored & reduced shared memory usage (depends on TFHEpp for the parameter set select, runnable on old GPUs like GTX 1060Ti but slow)
Refactored & reduced shared memory usage (depends on TFHEpp for the parameter set select, runnable on old GPUs like GTX 1060Ti but slow)