Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix dreamer run in SOTA tests #2627

Merged
merged 1 commit into from
Dec 3, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 3, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Dec 3, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2627

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 8 Unrelated Failures

As of commit e33b963 with merge base 3da76f0 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Dec 3, 2024
ghstack-source-id: dfe3ab6fe0d29fcdcaf57f31f84d04e07e36bad3
Pull Request resolved: #2627
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 3, 2024
@vmoens vmoens merged commit e33b963 into gh/vmoens/48/base Dec 3, 2024
48 of 58 checks passed
@vmoens vmoens deleted the gh/vmoens/48/head branch December 3, 2024 14:53
Copy link

github-actions bot commented Dec 3, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4328s 0.4267s 2.3435 Ops/s 2.2424 Ops/s $\color{#35bf28}+4.51\%$
test_transformed 0.6253s 0.6182s 1.6177 Ops/s 1.6553 Ops/s $\color{#d91a1a}-2.27\%$
test_serial 1.3643s 1.3552s 0.7379 Ops/s 0.7345 Ops/s $\color{#35bf28}+0.47\%$
test_parallel 1.3824s 1.3171s 0.7593 Ops/s 0.7522 Ops/s $\color{#35bf28}+0.94\%$
test_step_mdp_speed[True-True-True-True-True] 0.1773ms 30.4046μs 32.8897 KOps/s 33.7104 KOps/s $\color{#d91a1a}-2.43\%$
test_step_mdp_speed[True-True-True-True-False] 49.3920μs 17.8111μs 56.1447 KOps/s 56.5193 KOps/s $\color{#d91a1a}-0.66\%$
test_step_mdp_speed[True-True-True-False-True] 42.5900μs 17.0949μs 58.4969 KOps/s 57.5494 KOps/s $\color{#35bf28}+1.65\%$
test_step_mdp_speed[True-True-True-False-False] 54.4020μs 10.0753μs 99.2529 KOps/s 101.1427 KOps/s $\color{#d91a1a}-1.87\%$
test_step_mdp_speed[True-True-False-True-True] 91.0100μs 32.4960μs 30.7730 KOps/s 32.1033 KOps/s $\color{#d91a1a}-4.14\%$
test_step_mdp_speed[True-True-False-True-False] 58.3590μs 19.6243μs 50.9572 KOps/s 51.6361 KOps/s $\color{#d91a1a}-1.31\%$
test_step_mdp_speed[True-True-False-False-True] 85.4450μs 18.9270μs 52.8344 KOps/s 55.1473 KOps/s $\color{#d91a1a}-4.19\%$
test_step_mdp_speed[True-True-False-False-False] 40.5670μs 11.9899μs 83.4039 KOps/s 85.3039 KOps/s $\color{#d91a1a}-2.23\%$
test_step_mdp_speed[True-False-True-True-True] 90.9700μs 34.0681μs 29.3530 KOps/s 30.0519 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[True-False-True-True-False] 55.3740μs 21.7631μs 45.9492 KOps/s 47.6147 KOps/s $\color{#d91a1a}-3.50\%$
test_step_mdp_speed[True-False-True-False-True] 45.5850μs 18.9214μs 52.8503 KOps/s 54.2248 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[True-False-True-False-False] 43.9720μs 11.9650μs 83.5772 KOps/s 85.5743 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[True-False-False-True-True] 83.6970μs 35.7944μs 27.9373 KOps/s 28.7687 KOps/s $\color{#d91a1a}-2.89\%$
test_step_mdp_speed[True-False-False-True-False] 53.6310μs 23.5785μs 42.4116 KOps/s 43.9332 KOps/s $\color{#d91a1a}-3.46\%$
test_step_mdp_speed[True-False-False-False-True] 0.6027ms 20.8723μs 47.9104 KOps/s 50.0478 KOps/s $\color{#d91a1a}-4.27\%$
test_step_mdp_speed[True-False-False-False-False] 50.0640μs 13.7596μs 72.6767 KOps/s 74.4318 KOps/s $\color{#d91a1a}-2.36\%$
test_step_mdp_speed[False-True-True-True-True] 92.6130μs 34.2046μs 29.2359 KOps/s 30.2202 KOps/s $\color{#d91a1a}-3.26\%$
test_step_mdp_speed[False-True-True-True-False] 63.8490μs 21.9373μs 45.5844 KOps/s 47.0352 KOps/s $\color{#d91a1a}-3.08\%$
test_step_mdp_speed[False-True-True-False-True] 76.4570μs 20.9518μs 47.7285 KOps/s 46.9703 KOps/s $\color{#35bf28}+1.61\%$
test_step_mdp_speed[False-True-True-False-False] 38.1220μs 13.0878μs 76.4068 KOps/s 76.8590 KOps/s $\color{#d91a1a}-0.59\%$
test_step_mdp_speed[False-True-False-True-True] 73.5080μs 35.6411μs 28.0575 KOps/s 28.4927 KOps/s $\color{#d91a1a}-1.53\%$
test_step_mdp_speed[False-True-False-True-False] 69.2200μs 23.4633μs 42.6197 KOps/s 43.7652 KOps/s $\color{#d91a1a}-2.62\%$
test_step_mdp_speed[False-True-False-False-True] 2.7263ms 22.9449μs 43.5827 KOps/s 45.4368 KOps/s $\color{#d91a1a}-4.08\%$
test_step_mdp_speed[False-True-False-False-False] 65.7830μs 14.9636μs 66.8288 KOps/s 67.7154 KOps/s $\color{#d91a1a}-1.31\%$
test_step_mdp_speed[False-False-True-True-True] 86.6620μs 37.8212μs 26.4402 KOps/s 27.2834 KOps/s $\color{#d91a1a}-3.09\%$
test_step_mdp_speed[False-False-True-True-False] 0.1019ms 24.9854μs 40.0233 KOps/s 40.2923 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[False-False-True-False-True] 71.5240μs 22.8229μs 43.8156 KOps/s 44.2898 KOps/s $\color{#d91a1a}-1.07\%$
test_step_mdp_speed[False-False-True-False-False] 40.5060μs 14.9517μs 66.8819 KOps/s 67.6072 KOps/s $\color{#d91a1a}-1.07\%$
test_step_mdp_speed[False-False-False-True-True] 0.1199ms 38.9876μs 25.6492 KOps/s 25.9855 KOps/s $\color{#d91a1a}-1.29\%$
test_step_mdp_speed[False-False-False-True-False] 65.5630μs 26.8316μs 37.2695 KOps/s 37.7905 KOps/s $\color{#d91a1a}-1.38\%$
test_step_mdp_speed[False-False-False-False-True] 63.5790μs 24.6175μs 40.6215 KOps/s 41.3368 KOps/s $\color{#d91a1a}-1.73\%$
test_step_mdp_speed[False-False-False-False-False] 60.1970μs 16.5397μs 60.4606 KOps/s 61.4471 KOps/s $\color{#d91a1a}-1.61\%$
test_values[generalized_advantage_estimate-True-True] 12.9417ms 9.4562ms 105.7503 Ops/s 105.3054 Ops/s $\color{#35bf28}+0.42\%$
test_values[vec_generalized_advantage_estimate-True-True] 39.5531ms 36.1476ms 27.6643 Ops/s 29.6197 Ops/s $\textbf{\color{#d91a1a}-6.60\%}$
test_values[td0_return_estimate-False-False] 0.2284ms 0.1936ms 5.1654 KOps/s 5.3718 KOps/s $\color{#d91a1a}-3.84\%$
test_values[td1_return_estimate-False-False] 27.8563ms 24.3971ms 40.9884 Ops/s 42.3523 Ops/s $\color{#d91a1a}-3.22\%$
test_values[vec_td1_return_estimate-False-False] 38.8140ms 36.1651ms 27.6510 Ops/s 29.3310 Ops/s $\textbf{\color{#d91a1a}-5.73\%}$
test_values[td_lambda_return_estimate-True-False] 37.9142ms 34.4483ms 29.0290 Ops/s 29.2960 Ops/s $\color{#d91a1a}-0.91\%$
test_values[vec_td_lambda_return_estimate-True-False] 37.8489ms 35.9903ms 27.7853 Ops/s 29.4753 Ops/s $\textbf{\color{#d91a1a}-5.73\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 12.3680ms 8.2935ms 120.5761 Ops/s 123.2987 Ops/s $\color{#d91a1a}-2.21\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3214ms 2.0229ms 494.3439 Ops/s 495.0325 Ops/s $\color{#d91a1a}-0.14\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6043ms 0.3566ms 2.8040 KOps/s 2.7521 KOps/s $\color{#35bf28}+1.89\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 50.2356ms 49.1005ms 20.3664 Ops/s 23.7264 Ops/s $\textbf{\color{#d91a1a}-14.16\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.2140ms 3.0310ms 329.9292 Ops/s 325.7523 Ops/s $\color{#35bf28}+1.28\%$
test_dqn_speed[False-None] 2.2136ms 1.4086ms 709.9388 Ops/s 673.6772 Ops/s $\textbf{\color{#35bf28}+5.38\%}$
test_dqn_speed[False-backward] 2.0572ms 1.8950ms 527.7145 Ops/s 487.7709 Ops/s $\textbf{\color{#35bf28}+8.19\%}$
test_dqn_speed[True-None] 1.3402ms 0.4727ms 2.1155 KOps/s 2.0790 KOps/s $\color{#35bf28}+1.76\%$
test_dqn_speed[True-backward] 0.9458ms 0.9029ms 1.1076 KOps/s 1.1040 KOps/s $\color{#35bf28}+0.33\%$
test_dqn_speed[reduce-overhead-None] 0.7999ms 0.4711ms 2.1226 KOps/s 2.0698 KOps/s $\color{#35bf28}+2.55\%$
test_dqn_speed[reduce-overhead-backward] 0.9397ms 0.8933ms 1.1194 KOps/s 1.0235 KOps/s $\textbf{\color{#35bf28}+9.38\%}$
test_ddpg_speed[False-None] 3.4584ms 2.8758ms 347.7289 Ops/s 344.2907 Ops/s $\color{#35bf28}+1.00\%$
test_ddpg_speed[False-backward] 4.1034ms 4.0094ms 249.4142 Ops/s 248.8268 Ops/s $\color{#35bf28}+0.24\%$
test_ddpg_speed[True-None] 1.4557ms 0.9997ms 1.0003 KOps/s 998.0711 Ops/s $\color{#35bf28}+0.22\%$
test_ddpg_speed[True-backward] 1.9443ms 1.8912ms 528.7760 Ops/s 516.0584 Ops/s $\color{#35bf28}+2.46\%$
test_ddpg_speed[reduce-overhead-None] 1.2298ms 0.9999ms 1.0001 KOps/s 1.0021 KOps/s $\color{#d91a1a}-0.20\%$
test_ddpg_speed[reduce-overhead-backward] 2.0033ms 1.9072ms 524.3310 Ops/s 528.7951 Ops/s $\color{#d91a1a}-0.84\%$
test_sac_speed[False-None] 8.6859ms 8.0212ms 124.6696 Ops/s 124.4608 Ops/s $\color{#35bf28}+0.17\%$
test_sac_speed[False-backward] 11.6456ms 10.7209ms 93.2754 Ops/s 93.2029 Ops/s $\color{#35bf28}+0.08\%$
test_sac_speed[True-None] 2.4396ms 1.8200ms 549.4394 Ops/s 530.0337 Ops/s $\color{#35bf28}+3.66\%$
test_sac_speed[True-backward] 3.6413ms 3.5475ms 281.8879 Ops/s 281.4674 Ops/s $\color{#35bf28}+0.15\%$
test_sac_speed[reduce-overhead-None] 2.3591ms 1.8208ms 549.2096 Ops/s 538.8040 Ops/s $\color{#35bf28}+1.93\%$
test_sac_speed[reduce-overhead-backward] 3.9593ms 3.5057ms 285.2516 Ops/s 283.3807 Ops/s $\color{#35bf28}+0.66\%$
test_redq_speed[False-None] 14.6361ms 12.9922ms 76.9690 Ops/s 77.4042 Ops/s $\color{#d91a1a}-0.56\%$
test_redq_speed[False-backward] 25.3073ms 22.6856ms 44.0808 Ops/s 44.5901 Ops/s $\color{#d91a1a}-1.14\%$
test_redq_speed[True-None] 6.5815ms 4.5070ms 221.8758 Ops/s 221.8233 Ops/s $\color{#35bf28}+0.02\%$
test_redq_speed[True-backward] 13.6863ms 11.9053ms 83.9960 Ops/s 79.6683 Ops/s $\textbf{\color{#35bf28}+5.43\%}$
test_redq_speed[reduce-overhead-None] 6.1390ms 4.5634ms 219.1363 Ops/s 213.2741 Ops/s $\color{#35bf28}+2.75\%$
test_redq_speed[reduce-overhead-backward] 13.1481ms 12.1379ms 82.3868 Ops/s 79.2107 Ops/s $\color{#35bf28}+4.01\%$
test_redq_deprec_speed[False-None] 13.7400ms 12.6586ms 78.9975 Ops/s 75.9584 Ops/s $\color{#35bf28}+4.00\%$
test_redq_deprec_speed[False-backward] 22.1812ms 18.2353ms 54.8387 Ops/s 52.6043 Ops/s $\color{#35bf28}+4.25\%$
test_redq_deprec_speed[True-None] 4.1573ms 3.5331ms 283.0378 Ops/s 275.8791 Ops/s $\color{#35bf28}+2.59\%$
test_redq_deprec_speed[True-backward] 9.0334ms 8.1535ms 122.6465 Ops/s 110.4297 Ops/s $\textbf{\color{#35bf28}+11.06\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.1213ms 3.5329ms 283.0548 Ops/s 274.2959 Ops/s $\color{#35bf28}+3.19\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.9842ms 7.8741ms 126.9989 Ops/s 117.8128 Ops/s $\textbf{\color{#35bf28}+7.80\%}$
test_td3_speed[False-None] 9.8047ms 7.9351ms 126.0230 Ops/s 121.9502 Ops/s $\color{#35bf28}+3.34\%$
test_td3_speed[False-backward] 11.4417ms 10.3640ms 96.4878 Ops/s 93.3904 Ops/s $\color{#35bf28}+3.32\%$
test_td3_speed[True-None] 1.8735ms 1.7028ms 587.2724 Ops/s 579.8203 Ops/s $\color{#35bf28}+1.29\%$
test_td3_speed[True-backward] 3.6010ms 3.3024ms 302.8082 Ops/s 299.6955 Ops/s $\color{#35bf28}+1.04\%$
test_td3_speed[reduce-overhead-None] 1.8300ms 1.6956ms 589.7449 Ops/s 575.2730 Ops/s $\color{#35bf28}+2.52\%$
test_td3_speed[reduce-overhead-backward] 3.4471ms 3.3000ms 303.0308 Ops/s 292.7167 Ops/s $\color{#35bf28}+3.52\%$
test_cql_speed[False-None] 37.9848ms 36.0722ms 27.7222 Ops/s 27.1588 Ops/s $\color{#35bf28}+2.07\%$
test_cql_speed[False-backward] 49.5019ms 45.9133ms 21.7802 Ops/s 21.4589 Ops/s $\color{#35bf28}+1.50\%$
test_cql_speed[True-None] 16.4098ms 15.5603ms 64.2661 Ops/s 64.4520 Ops/s $\color{#d91a1a}-0.29\%$
test_cql_speed[True-backward] 23.8431ms 22.2471ms 44.9498 Ops/s 44.5677 Ops/s $\color{#35bf28}+0.86\%$
test_cql_speed[reduce-overhead-None] 16.2560ms 15.5408ms 64.3467 Ops/s 64.3343 Ops/s $\color{#35bf28}+0.02\%$
test_cql_speed[reduce-overhead-backward] 23.2747ms 22.2871ms 44.8690 Ops/s 44.6414 Ops/s $\color{#35bf28}+0.51\%$
test_a2c_speed[False-None] 7.9544ms 7.2748ms 137.4607 Ops/s 78.7304 Ops/s $\textbf{\color{#35bf28}+74.60\%}$
test_a2c_speed[False-backward] 20.9707ms 14.6338ms 68.3351 Ops/s 69.0650 Ops/s $\color{#d91a1a}-1.06\%$
test_a2c_speed[True-None] 4.9606ms 4.1709ms 239.7592 Ops/s 236.4197 Ops/s $\color{#35bf28}+1.41\%$
test_a2c_speed[True-backward] 11.1584ms 10.7487ms 93.0343 Ops/s 93.3729 Ops/s $\color{#d91a1a}-0.36\%$
test_a2c_speed[reduce-overhead-None] 4.6620ms 4.1764ms 239.4407 Ops/s 235.8840 Ops/s $\color{#35bf28}+1.51\%$
test_a2c_speed[reduce-overhead-backward] 11.7198ms 11.4210ms 87.5577 Ops/s 93.1465 Ops/s $\textbf{\color{#d91a1a}-6.00\%}$
test_ppo_speed[False-None] 10.2534ms 7.9627ms 125.5851 Ops/s 133.5642 Ops/s $\textbf{\color{#d91a1a}-5.97\%}$
test_ppo_speed[False-backward] 16.1809ms 15.2997ms 65.3609 Ops/s 66.8191 Ops/s $\color{#d91a1a}-2.18\%$
test_ppo_speed[True-None] 4.3515ms 3.6683ms 272.6066 Ops/s 268.3054 Ops/s $\color{#35bf28}+1.60\%$
test_ppo_speed[True-backward] 9.8997ms 9.5652ms 104.5453 Ops/s 103.5480 Ops/s $\color{#35bf28}+0.96\%$
test_ppo_speed[reduce-overhead-None] 4.3092ms 3.6654ms 272.8193 Ops/s 270.2319 Ops/s $\color{#35bf28}+0.96\%$
test_ppo_speed[reduce-overhead-backward] 9.8880ms 9.5924ms 104.2489 Ops/s 103.5912 Ops/s $\color{#35bf28}+0.63\%$
test_reinforce_speed[False-None] 7.4550ms 6.4578ms 154.8521 Ops/s 152.8227 Ops/s $\color{#35bf28}+1.33\%$
test_reinforce_speed[False-backward] 11.3903ms 9.7260ms 102.8167 Ops/s 100.3304 Ops/s $\color{#35bf28}+2.48\%$
test_reinforce_speed[True-None] 3.1049ms 2.6366ms 379.2736 Ops/s 373.5053 Ops/s $\color{#35bf28}+1.54\%$
test_reinforce_speed[True-backward] 9.7842ms 9.0093ms 110.9960 Ops/s 115.4468 Ops/s $\color{#d91a1a}-3.86\%$
test_reinforce_speed[reduce-overhead-None] 2.9278ms 2.6321ms 379.9214 Ops/s 375.9281 Ops/s $\color{#35bf28}+1.06\%$
test_reinforce_speed[reduce-overhead-backward] 8.9886ms 8.5954ms 116.3416 Ops/s 115.6699 Ops/s $\color{#35bf28}+0.58\%$
test_iql_speed[False-None] 34.1581ms 32.1519ms 31.1024 Ops/s 31.1072 Ops/s $\color{#d91a1a}-0.02\%$
test_iql_speed[False-backward] 46.9841ms 45.0289ms 22.2080 Ops/s 22.1198 Ops/s $\color{#35bf28}+0.40\%$
test_iql_speed[True-None] 11.8922ms 10.5714ms 94.5951 Ops/s 94.0961 Ops/s $\color{#35bf28}+0.53\%$
test_iql_speed[True-backward] 23.3229ms 21.6247ms 46.2434 Ops/s 43.7567 Ops/s $\textbf{\color{#35bf28}+5.68\%}$
test_iql_speed[reduce-overhead-None] 11.8968ms 10.6705ms 93.7162 Ops/s 90.4020 Ops/s $\color{#35bf28}+3.67\%$
test_iql_speed[reduce-overhead-backward] 22.5567ms 21.5454ms 46.4137 Ops/s 45.2381 Ops/s $\color{#35bf28}+2.60\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.6093ms 4.8858ms 204.6769 Ops/s 193.9209 Ops/s $\textbf{\color{#35bf28}+5.55\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.3387ms 0.5054ms 1.9788 KOps/s 1.9007 KOps/s $\color{#35bf28}+4.11\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.9447ms 0.4872ms 2.0527 KOps/s 2.0228 KOps/s $\color{#35bf28}+1.48\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.0038ms 4.6007ms 217.3566 Ops/s 203.3259 Ops/s $\textbf{\color{#35bf28}+6.90\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9934ms 0.4939ms 2.0247 KOps/s 1.9868 KOps/s $\color{#35bf28}+1.91\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7119ms 0.4709ms 2.1235 KOps/s 2.0455 KOps/s $\color{#35bf28}+3.81\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.9018ms 1.6204ms 617.1450 Ops/s 605.2874 Ops/s $\color{#35bf28}+1.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.5905ms 1.5838ms 631.4104 Ops/s 621.6782 Ops/s $\color{#35bf28}+1.57\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.5335ms 4.7799ms 209.2094 Ops/s 197.7682 Ops/s $\textbf{\color{#35bf28}+5.79\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.0969ms 0.6424ms 1.5567 KOps/s 1.5318 KOps/s $\color{#35bf28}+1.63\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8132ms 0.6126ms 1.6324 KOps/s 1.6046 KOps/s $\color{#35bf28}+1.73\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.3014ms 4.6932ms 213.0762 Ops/s 203.5677 Ops/s $\color{#35bf28}+4.67\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.2516ms 0.5043ms 1.9829 KOps/s 1.9096 KOps/s $\color{#35bf28}+3.84\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6898ms 0.4828ms 2.0712 KOps/s 1.9949 KOps/s $\color{#35bf28}+3.83\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.8989ms 4.6383ms 215.5980 Ops/s 204.1108 Ops/s $\textbf{\color{#35bf28}+5.63\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.4511ms 0.4973ms 2.0109 KOps/s 1.9602 KOps/s $\color{#35bf28}+2.58\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6915ms 0.4743ms 2.1084 KOps/s 2.0436 KOps/s $\color{#35bf28}+3.17\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.0670ms 4.8346ms 206.8406 Ops/s 196.4503 Ops/s $\textbf{\color{#35bf28}+5.29\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8787ms 0.6389ms 1.5652 KOps/s 1.5075 KOps/s $\color{#35bf28}+3.83\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 8.5032ms 0.6259ms 1.5977 KOps/s 1.6077 KOps/s $\color{#d91a1a}-0.62\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 9.3543ms 4.2573ms 234.8890 Ops/s 234.6179 Ops/s $\color{#35bf28}+0.12\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.5573ms 2.3427ms 426.8663 Ops/s 435.6653 Ops/s $\color{#d91a1a}-2.02\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 5.8845ms 1.3352ms 748.9543 Ops/s 794.4284 Ops/s $\textbf{\color{#d91a1a}-5.72\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4097s 12.3692ms 80.8460 Ops/s 35.4474 Ops/s $\textbf{\color{#35bf28}+128.07\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.7500ms 2.3518ms 425.2050 Ops/s 430.0265 Ops/s $\color{#d91a1a}-1.12\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 4.9084ms 1.3278ms 753.1453 Ops/s 812.3374 Ops/s $\textbf{\color{#d91a1a}-7.29\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.8631ms 4.4073ms 226.8942 Ops/s 215.7065 Ops/s $\textbf{\color{#35bf28}+5.19\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.7899ms 2.5891ms 386.2378 Ops/s 406.9078 Ops/s $\textbf{\color{#d91a1a}-5.08\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 4.0316ms 1.5382ms 650.1156 Ops/s 675.1085 Ops/s $\color{#d91a1a}-3.70\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.8719ms 11.0738ms 90.3036 Ops/s 88.2380 Ops/s $\color{#35bf28}+2.34\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.1167ms 14.4546ms 69.1822 Ops/s 68.3288 Ops/s $\color{#35bf28}+1.25\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.6241ms 19.7225ms 50.7036 Ops/s 49.3308 Ops/s $\color{#35bf28}+2.78\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 15.7971ms 14.6517ms 68.2512 Ops/s 67.2246 Ops/s $\color{#35bf28}+1.53\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.9612ms 19.8593ms 50.3543 Ops/s 50.9364 Ops/s $\color{#d91a1a}-1.14\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.6040ms 15.8971ms 62.9045 Ops/s 63.1580 Ops/s $\color{#d91a1a}-0.40\%$

Copy link

github-actions bot commented Dec 3, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}17$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7543s 0.7474s 1.3379 Ops/s 1.2930 Ops/s $\color{#35bf28}+3.47\%$
test_transformed 1.0922s 1.0169s 0.9834 Ops/s 0.9828 Ops/s $\color{#35bf28}+0.06\%$
test_serial 2.2451s 2.1788s 0.4590 Ops/s 0.4621 Ops/s $\color{#d91a1a}-0.68\%$
test_parallel 2.0762s 2.0160s 0.4960 Ops/s 0.5047 Ops/s $\color{#d91a1a}-1.72\%$
test_step_mdp_speed[True-True-True-True-True] 0.2198ms 39.8184μs 25.1140 KOps/s 24.9816 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[True-True-True-True-False] 0.1679ms 22.8721μs 43.7214 KOps/s 42.9124 KOps/s $\color{#35bf28}+1.89\%$
test_step_mdp_speed[True-True-True-False-True] 52.7210μs 21.8822μs 45.6993 KOps/s 45.7862 KOps/s $\color{#d91a1a}-0.19\%$
test_step_mdp_speed[True-True-True-False-False] 59.4910μs 12.6447μs 79.0845 KOps/s 76.2454 KOps/s $\color{#35bf28}+3.72\%$
test_step_mdp_speed[True-True-False-True-True] 0.1195ms 41.9149μs 23.8578 KOps/s 23.2183 KOps/s $\color{#35bf28}+2.75\%$
test_step_mdp_speed[True-True-False-True-False] 0.1268ms 24.9005μs 40.1598 KOps/s 39.3416 KOps/s $\color{#35bf28}+2.08\%$
test_step_mdp_speed[True-True-False-False-True] 52.8910μs 24.6127μs 40.6294 KOps/s 41.0919 KOps/s $\color{#d91a1a}-1.13\%$
test_step_mdp_speed[True-True-False-False-False] 43.4800μs 14.9602μs 66.8442 KOps/s 66.4569 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[True-False-True-True-True] 78.0610μs 45.2548μs 22.0971 KOps/s 22.4639 KOps/s $\color{#d91a1a}-1.63\%$
test_step_mdp_speed[True-False-True-True-False] 70.2310μs 27.4373μs 36.4467 KOps/s 36.6718 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[True-False-True-False-True] 61.8310μs 24.6051μs 40.6420 KOps/s 40.3023 KOps/s $\color{#35bf28}+0.84\%$
test_step_mdp_speed[True-False-True-False-False] 51.2110μs 14.8557μs 67.3140 KOps/s 65.8263 KOps/s $\color{#35bf28}+2.26\%$
test_step_mdp_speed[True-False-False-True-True] 76.0910μs 46.6556μs 21.4337 KOps/s 21.5131 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[True-False-False-True-False] 60.0210μs 28.7496μs 34.7831 KOps/s 34.3785 KOps/s $\color{#35bf28}+1.18\%$
test_step_mdp_speed[True-False-False-False-True] 57.0910μs 25.8582μs 38.6725 KOps/s 37.3933 KOps/s $\color{#35bf28}+3.42\%$
test_step_mdp_speed[True-False-False-False-False] 59.0000μs 16.8481μs 59.3538 KOps/s 58.0674 KOps/s $\color{#35bf28}+2.22\%$
test_step_mdp_speed[False-True-True-True-True] 96.3910μs 44.4107μs 22.5171 KOps/s 22.4722 KOps/s $\color{#35bf28}+0.20\%$
test_step_mdp_speed[False-True-True-True-False] 60.6110μs 27.5701μs 36.2712 KOps/s 36.3512 KOps/s $\color{#d91a1a}-0.22\%$
test_step_mdp_speed[False-True-True-False-True] 0.1323ms 28.0889μs 35.6012 KOps/s 35.4086 KOps/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[False-True-True-False-False] 43.9910μs 16.5289μs 60.5003 KOps/s 60.2471 KOps/s $\color{#35bf28}+0.42\%$
test_step_mdp_speed[False-True-False-True-True] 72.1710μs 46.5990μs 21.4597 KOps/s 21.0869 KOps/s $\color{#35bf28}+1.77\%$
test_step_mdp_speed[False-True-False-True-False] 0.1044ms 29.3803μs 34.0364 KOps/s 34.2529 KOps/s $\color{#d91a1a}-0.63\%$
test_step_mdp_speed[False-True-False-False-True] 3.4217ms 30.5265μs 32.7584 KOps/s 32.8555 KOps/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[False-True-False-False-False] 69.7710μs 18.5991μs 53.7662 KOps/s 52.5326 KOps/s $\color{#35bf28}+2.35\%$
test_step_mdp_speed[False-False-True-True-True] 0.1165ms 49.1431μs 20.3487 KOps/s 20.3834 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[False-False-True-True-False] 68.2410μs 31.2490μs 32.0010 KOps/s 31.4821 KOps/s $\color{#35bf28}+1.65\%$
test_step_mdp_speed[False-False-True-False-True] 61.6610μs 30.2414μs 33.0672 KOps/s 32.6129 KOps/s $\color{#35bf28}+1.39\%$
test_step_mdp_speed[False-False-True-False-False] 0.2018ms 18.5988μs 53.7669 KOps/s 52.6887 KOps/s $\color{#35bf28}+2.05\%$
test_step_mdp_speed[False-False-False-True-True] 0.2118ms 50.4115μs 19.8368 KOps/s 20.0223 KOps/s $\color{#d91a1a}-0.93\%$
test_step_mdp_speed[False-False-False-True-False] 60.6010μs 33.4621μs 29.8846 KOps/s 29.8478 KOps/s $\color{#35bf28}+0.12\%$
test_step_mdp_speed[False-False-False-False-True] 0.2103ms 31.9271μs 31.3214 KOps/s 31.8070 KOps/s $\color{#d91a1a}-1.53\%$
test_step_mdp_speed[False-False-False-False-False] 0.2180ms 20.4116μs 48.9917 KOps/s 48.7438 KOps/s $\color{#35bf28}+0.51\%$
test_values[generalized_advantage_estimate-True-True] 26.3804ms 25.3795ms 39.4018 Ops/s 40.2026 Ops/s $\color{#d91a1a}-1.99\%$
test_values[vec_generalized_advantage_estimate-True-True] 99.1797ms 2.8870ms 346.3849 Ops/s 359.7201 Ops/s $\color{#d91a1a}-3.71\%$
test_values[td0_return_estimate-False-False] 0.1096ms 81.7958μs 12.2256 KOps/s 12.2575 KOps/s $\color{#d91a1a}-0.26\%$
test_values[td1_return_estimate-False-False] 58.8006ms 56.6771ms 17.6438 Ops/s 18.0571 Ops/s $\color{#d91a1a}-2.29\%$
test_values[vec_td1_return_estimate-False-False] 1.3906ms 1.0889ms 918.3593 Ops/s 916.1390 Ops/s $\color{#35bf28}+0.24\%$
test_values[td_lambda_return_estimate-True-False] 94.4629ms 90.2744ms 11.0773 Ops/s 11.2458 Ops/s $\color{#d91a1a}-1.50\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2968ms 1.0871ms 919.8522 Ops/s 917.2390 Ops/s $\color{#35bf28}+0.28\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.6096ms 25.3060ms 39.5164 Ops/s 38.0758 Ops/s $\color{#35bf28}+3.78\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0544ms 0.7626ms 1.3113 KOps/s 1.3029 KOps/s $\color{#35bf28}+0.64\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.8333ms 0.6803ms 1.4700 KOps/s 1.4335 KOps/s $\color{#35bf28}+2.54\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6889ms 1.4910ms 670.6738 Ops/s 671.3645 Ops/s $\color{#d91a1a}-0.10\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.7248ms 0.7318ms 1.3665 KOps/s 1.4481 KOps/s $\textbf{\color{#d91a1a}-5.63\%}$
test_dqn_speed[False-None] 1.7253ms 1.4878ms 672.1516 Ops/s 673.9986 Ops/s $\color{#d91a1a}-0.27\%$
test_dqn_speed[False-backward] 2.2841ms 2.1133ms 473.1830 Ops/s 479.8514 Ops/s $\color{#d91a1a}-1.39\%$
test_dqn_speed[True-None] 0.9637ms 0.5473ms 1.8271 KOps/s 1.8320 KOps/s $\color{#d91a1a}-0.27\%$
test_dqn_speed[True-backward] 1.2268ms 1.1075ms 902.9011 Ops/s 901.8392 Ops/s $\color{#35bf28}+0.12\%$
test_dqn_speed[reduce-overhead-None] 0.9592ms 0.5537ms 1.8060 KOps/s 1.7457 KOps/s $\color{#35bf28}+3.45\%$
test_dqn_speed[reduce-overhead-backward] 1.1133ms 0.9770ms 1.0235 KOps/s 1.0060 KOps/s $\color{#35bf28}+1.74\%$
test_ddpg_speed[False-None] 3.1953ms 2.8025ms 356.8256 Ops/s 357.4705 Ops/s $\color{#d91a1a}-0.18\%$
test_ddpg_speed[False-backward] 4.4581ms 4.0578ms 246.4404 Ops/s 245.0733 Ops/s $\color{#35bf28}+0.56\%$
test_ddpg_speed[True-None] 1.4921ms 1.0982ms 910.5690 Ops/s 922.7233 Ops/s $\color{#d91a1a}-1.32\%$
test_ddpg_speed[True-backward] 2.3985ms 2.1862ms 457.4226 Ops/s 455.4564 Ops/s $\color{#35bf28}+0.43\%$
test_ddpg_speed[reduce-overhead-None] 1.5161ms 1.1091ms 901.6505 Ops/s 877.9653 Ops/s $\color{#35bf28}+2.70\%$
test_ddpg_speed[reduce-overhead-backward] 1.7221ms 1.6593ms 602.6505 Ops/s 603.8449 Ops/s $\color{#d91a1a}-0.20\%$
test_sac_speed[False-None] 8.6442ms 8.1063ms 123.3603 Ops/s 125.1646 Ops/s $\color{#d91a1a}-1.44\%$
test_sac_speed[False-backward] 11.6347ms 11.0010ms 90.9005 Ops/s 90.9949 Ops/s $\color{#d91a1a}-0.10\%$
test_sac_speed[True-None] 1.7354ms 1.5583ms 641.7266 Ops/s 644.4229 Ops/s $\color{#d91a1a}-0.42\%$
test_sac_speed[True-backward] 3.5388ms 3.3150ms 301.6557 Ops/s 306.3838 Ops/s $\color{#d91a1a}-1.54\%$
test_sac_speed[reduce-overhead-None] 23.0050ms 12.6918ms 78.7908 Ops/s 77.7825 Ops/s $\color{#35bf28}+1.30\%$
test_sac_speed[reduce-overhead-backward] 1.4866ms 1.3480ms 741.8398 Ops/s 741.7925 Ops/s $+0.01\%$
test_redq_speed[False-None] 8.5428ms 7.5157ms 133.0553 Ops/s 132.6889 Ops/s $\color{#35bf28}+0.28\%$
test_redq_speed[False-backward] 12.1139ms 11.3965ms 87.7466 Ops/s 87.4966 Ops/s $\color{#35bf28}+0.29\%$
test_redq_speed[True-None] 2.1748ms 2.0128ms 496.8257 Ops/s 497.8210 Ops/s $\color{#d91a1a}-0.20\%$
test_redq_speed[True-backward] 4.0110ms 3.6985ms 270.3819 Ops/s 257.7748 Ops/s $\color{#35bf28}+4.89\%$
test_redq_speed[reduce-overhead-None] 2.2964ms 2.0489ms 488.0741 Ops/s 480.1267 Ops/s $\color{#35bf28}+1.66\%$
test_redq_speed[reduce-overhead-backward] 4.2398ms 3.7230ms 268.6015 Ops/s 258.4097 Ops/s $\color{#35bf28}+3.94\%$
test_redq_deprec_speed[False-None] 9.5566ms 9.0038ms 111.0641 Ops/s 110.4663 Ops/s $\color{#35bf28}+0.54\%$
test_redq_deprec_speed[False-backward] 12.5307ms 12.0638ms 82.8928 Ops/s 80.9590 Ops/s $\color{#35bf28}+2.39\%$
test_redq_deprec_speed[True-None] 2.7559ms 2.3461ms 426.2388 Ops/s 423.5244 Ops/s $\color{#35bf28}+0.64\%$
test_redq_deprec_speed[True-backward] 4.6427ms 4.2600ms 234.7419 Ops/s 248.1948 Ops/s $\textbf{\color{#d91a1a}-5.42\%}$
test_redq_deprec_speed[reduce-overhead-None] 2.6444ms 2.4015ms 416.4000 Ops/s 406.4746 Ops/s $\color{#35bf28}+2.44\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.6977ms 4.1029ms 243.7320 Ops/s 249.6707 Ops/s $\color{#d91a1a}-2.38\%$
test_td3_speed[False-None] 8.0076ms 7.8923ms 126.7063 Ops/s 118.2441 Ops/s $\textbf{\color{#35bf28}+7.16\%}$
test_td3_speed[False-backward] 10.8319ms 10.3119ms 96.9749 Ops/s 96.3377 Ops/s $\color{#35bf28}+0.66\%$
test_td3_speed[True-None] 1.7118ms 1.6594ms 602.6334 Ops/s 616.9564 Ops/s $\color{#d91a1a}-2.32\%$
test_td3_speed[True-backward] 3.4691ms 3.3197ms 301.2299 Ops/s 318.3319 Ops/s $\textbf{\color{#d91a1a}-5.37\%}$
test_td3_speed[reduce-overhead-None] 50.5787ms 25.8660ms 38.6607 Ops/s 36.5452 Ops/s $\textbf{\color{#35bf28}+5.79\%}$
test_td3_speed[reduce-overhead-backward] 1.4239ms 1.2997ms 769.4189 Ops/s 755.2590 Ops/s $\color{#35bf28}+1.87\%$
test_cql_speed[False-None] 16.9529ms 16.1434ms 61.9448 Ops/s 62.2391 Ops/s $\color{#d91a1a}-0.47\%$
test_cql_speed[False-backward] 21.9223ms 21.3538ms 46.8300 Ops/s 47.0894 Ops/s $\color{#d91a1a}-0.55\%$
test_cql_speed[True-None] 3.4841ms 2.9921ms 334.2171 Ops/s 335.6548 Ops/s $\color{#d91a1a}-0.43\%$
test_cql_speed[True-backward] 5.4426ms 5.0960ms 196.2319 Ops/s 196.9961 Ops/s $\color{#d91a1a}-0.39\%$
test_cql_speed[reduce-overhead-None] 22.0383ms 13.2423ms 75.5158 Ops/s 75.2869 Ops/s $\color{#35bf28}+0.30\%$
test_cql_speed[reduce-overhead-backward] 1.6791ms 1.5197ms 658.0435 Ops/s 626.2481 Ops/s $\textbf{\color{#35bf28}+5.08\%}$
test_a2c_speed[False-None] 3.3260ms 3.1529ms 317.1652 Ops/s 316.8877 Ops/s $\color{#35bf28}+0.09\%$
test_a2c_speed[False-backward] 6.7350ms 6.1103ms 163.6584 Ops/s 157.8783 Ops/s $\color{#35bf28}+3.66\%$
test_a2c_speed[True-None] 1.2082ms 1.0080ms 992.0299 Ops/s 976.1203 Ops/s $\color{#35bf28}+1.63\%$
test_a2c_speed[True-backward] 2.8113ms 2.6716ms 374.3086 Ops/s 375.7263 Ops/s $\color{#d91a1a}-0.38\%$
test_a2c_speed[reduce-overhead-None] 0.3897s 12.5754ms 79.5202 Ops/s 84.5101 Ops/s $\textbf{\color{#d91a1a}-5.90\%}$
test_a2c_speed[reduce-overhead-backward] 1.1757ms 1.0137ms 986.4524 Ops/s 999.6257 Ops/s $\color{#d91a1a}-1.32\%$
test_ppo_speed[False-None] 4.0240ms 3.6513ms 273.8760 Ops/s 277.4311 Ops/s $\color{#d91a1a}-1.28\%$
test_ppo_speed[False-backward] 7.3457ms 6.8520ms 145.9421 Ops/s 147.5730 Ops/s $\color{#d91a1a}-1.11\%$
test_ppo_speed[True-None] 1.3386ms 0.9569ms 1.0450 KOps/s 1.0060 KOps/s $\color{#35bf28}+3.87\%$
test_ppo_speed[True-backward] 2.7759ms 2.6341ms 379.6384 Ops/s 387.5036 Ops/s $\color{#d91a1a}-2.03\%$
test_ppo_speed[reduce-overhead-None] 0.6547ms 0.5016ms 1.9937 KOps/s 1.8974 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_ppo_speed[reduce-overhead-backward] 1.2346ms 1.0785ms 927.1964 Ops/s 1.0069 KOps/s $\textbf{\color{#d91a1a}-7.91\%}$
test_reinforce_speed[False-None] 2.3840ms 2.2162ms 451.2246 Ops/s 449.2097 Ops/s $\color{#35bf28}+0.45\%$
test_reinforce_speed[False-backward] 3.5763ms 3.4013ms 294.0047 Ops/s 307.8988 Ops/s $\color{#d91a1a}-4.51\%$
test_reinforce_speed[True-None] 1.0249ms 0.8407ms 1.1895 KOps/s 1.1942 KOps/s $\color{#d91a1a}-0.39\%$
test_reinforce_speed[True-backward] 2.7554ms 2.6155ms 382.3306 Ops/s 408.3260 Ops/s $\textbf{\color{#d91a1a}-6.37\%}$
test_reinforce_speed[reduce-overhead-None] 22.9823ms 11.9461ms 83.7093 Ops/s 85.1677 Ops/s $\color{#d91a1a}-1.71\%$
test_reinforce_speed[reduce-overhead-backward] 1.3234ms 1.1657ms 857.8357 Ops/s 830.9164 Ops/s $\color{#35bf28}+3.24\%$
test_iql_speed[False-None] 9.6058ms 9.1441ms 109.3596 Ops/s 109.7296 Ops/s $\color{#d91a1a}-0.34\%$
test_iql_speed[False-backward] 13.7414ms 13.1589ms 75.9940 Ops/s 75.9168 Ops/s $\color{#35bf28}+0.10\%$
test_iql_speed[True-None] 1.9986ms 1.7869ms 559.6337 Ops/s 544.5243 Ops/s $\color{#35bf28}+2.77\%$
test_iql_speed[True-backward] 4.4733ms 4.2887ms 233.1722 Ops/s 230.0780 Ops/s $\color{#35bf28}+1.34\%$
test_iql_speed[reduce-overhead-None] 20.6084ms 11.5967ms 86.2318 Ops/s 86.7272 Ops/s $\color{#d91a1a}-0.57\%$
test_iql_speed[reduce-overhead-backward] 1.6014ms 1.4494ms 689.9571 Ops/s 695.5448 Ops/s $\color{#d91a1a}-0.80\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8186ms 6.4281ms 155.5665 Ops/s 152.8610 Ops/s $\color{#35bf28}+1.77\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6732ms 0.3739ms 2.6747 KOps/s 3.4718 KOps/s $\textbf{\color{#d91a1a}-22.96\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6082ms 0.3070ms 3.2571 KOps/s 3.9999 KOps/s $\textbf{\color{#d91a1a}-18.57\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.6340ms 6.1963ms 161.3867 Ops/s 159.1460 Ops/s $\color{#35bf28}+1.41\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.2129ms 0.2559ms 3.9071 KOps/s 3.9075 KOps/s $\color{#d91a1a}-0.01\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4471ms 0.2338ms 4.2766 KOps/s 4.2689 KOps/s $\color{#35bf28}+0.18\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7603ms 1.5092ms 662.6179 Ops/s 680.4125 Ops/s $\color{#d91a1a}-2.62\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5170ms 1.2829ms 779.4630 Ops/s 820.9000 Ops/s $\textbf{\color{#d91a1a}-5.05\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6972ms 6.3726ms 156.9226 Ops/s 155.0086 Ops/s $\color{#35bf28}+1.23\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1634ms 0.4147ms 2.4112 KOps/s 2.4320 KOps/s $\color{#d91a1a}-0.85\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6685ms 0.3874ms 2.5812 KOps/s 2.4281 KOps/s $\textbf{\color{#35bf28}+6.30\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5329ms 6.2421ms 160.2033 Ops/s 160.4912 Ops/s $\color{#d91a1a}-0.18\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.7416ms 0.3279ms 3.0501 KOps/s 3.2692 KOps/s $\textbf{\color{#d91a1a}-6.70\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5898ms 0.2838ms 3.5230 KOps/s 3.5227 KOps/s $+0.01\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5374ms 6.1655ms 162.1918 Ops/s 161.7673 Ops/s $\color{#35bf28}+0.26\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.3121ms 0.2940ms 3.4017 KOps/s 3.0308 KOps/s $\textbf{\color{#35bf28}+12.24\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5572ms 0.2711ms 3.6885 KOps/s 3.0425 KOps/s $\textbf{\color{#35bf28}+21.23\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6519ms 6.3919ms 156.4472 Ops/s 156.5115 Ops/s $\color{#d91a1a}-0.04\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.7186ms 0.5433ms 1.8406 KOps/s 2.1985 KOps/s $\textbf{\color{#d91a1a}-16.28\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 9.6395ms 0.5351ms 1.8687 KOps/s 2.5013 KOps/s $\textbf{\color{#d91a1a}-25.29\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.0356ms 5.3414ms 187.2152 Ops/s 187.3948 Ops/s $\color{#d91a1a}-0.10\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 4.0688ms 1.9912ms 502.2169 Ops/s 440.4765 Ops/s $\textbf{\color{#35bf28}+14.02\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.9607ms 1.3209ms 757.0411 Ops/s 854.9558 Ops/s $\textbf{\color{#d91a1a}-11.45\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4775s 14.8548ms 67.3184 Ops/s 190.0230 Ops/s $\textbf{\color{#d91a1a}-64.57\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.0675ms 2.0633ms 484.6702 Ops/s 428.4997 Ops/s $\textbf{\color{#35bf28}+13.11\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 8.0536ms 1.2889ms 775.8254 Ops/s 871.8712 Ops/s $\textbf{\color{#d91a1a}-11.02\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.8221ms 5.6335ms 177.5084 Ops/s 33.2276 Ops/s $\textbf{\color{#35bf28}+434.22\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.3473ms 2.3023ms 434.3542 Ops/s 528.1629 Ops/s $\textbf{\color{#d91a1a}-17.76\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.2832ms 1.4881ms 672.0118 Ops/s 826.7317 Ops/s $\textbf{\color{#d91a1a}-18.71\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.4528ms 13.2629ms 75.3980 Ops/s 74.6428 Ops/s $\color{#35bf28}+1.01\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.1956ms 16.6891ms 59.9193 Ops/s 59.2048 Ops/s $\color{#35bf28}+1.21\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.8926ms 18.0843ms 55.2966 Ops/s 54.3031 Ops/s $\color{#35bf28}+1.83\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.6834ms 17.0369ms 58.6961 Ops/s 58.9903 Ops/s $\color{#d91a1a}-0.50\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.5477ms 17.7882ms 56.2169 Ops/s 55.2364 Ops/s $\color{#35bf28}+1.78\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.7815ms 18.2725ms 54.7269 Ops/s 55.3691 Ops/s $\color{#d91a1a}-1.16\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants