You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is my test result. I turned off IOMMU and ACS. It improved a little bit compared to before, but it is still very low. This rate is not even as good as the 3090x8 I used before.
This is my test result. I turned off IOMMU and ACS. It improved a little bit compared to before, but it is still very low. This rate is not even as good as the 3090x8 I used before.
NCCL_ALGO=RING CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 numactl --cpunodebind=0,1,4,5 --membind=0,1,4,5 ./build/all_reduce_perf -b 256 -e 1G -w 20 -n 100 -f 2 -g 8
nThread 1 nGpus 8 minBytes 256 maxBytes 1073741824 step: 2(factor) warmup iters: 20 iters: 100 agg iters: 1 validation: 1 graph: 0
Using devices
Rank 0 Group 0 Pid 6151 on ubuntu device 0 [0x01] NVIDIA GeForce RTX 4090
Rank 1 Group 0 Pid 6151 on ubuntu device 1 [0x25] NVIDIA GeForce RTX 4090
Rank 2 Group 0 Pid 6151 on ubuntu device 2 [0x41] NVIDIA GeForce RTX 4090
Rank 3 Group 0 Pid 6151 on ubuntu device 3 [0x61] NVIDIA GeForce RTX 4090
Rank 4 Group 0 Pid 6151 on ubuntu device 4 [0x81] NVIDIA GeForce RTX 4090
Rank 5 Group 0 Pid 6151 on ubuntu device 5 [0xa1] NVIDIA GeForce RTX 4090
Rank 6 Group 0 Pid 6151 on ubuntu device 6 [0xc1] NVIDIA GeForce RTX 4090
Rank 7 Group 0 Pid 6151 on ubuntu device 7 [0xe1] NVIDIA GeForce RTX 4090
out-of-place in-place
size count type redop root time algbw busbw #wrong time algbw busbw #wrong
(B) (elements) (us) (GB/s) (GB/s) (us) (GB/s) (GB/s)
134217728 33554432 float sum -1 45706 2.94 5.14 0 45696 2.94 5.14 0
268435456 67108864 float sum -1 91494 2.93 5.13 0 91512 2.93 5.13 0
536870912 134217728 float sum -1 183860 2.92 5.11 0 183851 2.92 5.11 0
1073741824 268435456 float sum -1 371272 2.89 5.06 0 371306 2.89 5.06 0
Out of bounds values : 0 OK
Avg bus bandwidth : 2.99014
The text was updated successfully, but these errors were encountered: