Skip to content

Conversation

@vmoens
Copy link
Collaborator

@vmoens vmoens commented Dec 12, 2025

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 12, 2025
ghstack-source-id: ee09d7f
Pull-Request: #3251
@pytorch-bot
Copy link

pytorch-bot bot commented Dec 12, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3251

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 12 New Failures, 1 Cancelled Job, 9 Unrelated Failures

As of commit 8917b73 with merge base 3cd740a (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 12, 2025
@vmoens vmoens added the ciflow/binaries/all Build all binaries label Dec 12, 2025
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 12, 2025
ghstack-source-id: 3ec1686
Pull-Request: #3251
@vmoens vmoens mentioned this pull request Dec 12, 2025
[ghstack-poisoned]
@github-actions
Copy link

github-actions bot commented Dec 12, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 154. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_tensor_to_bytestream_speed[pickle] 80.4944μs 79.2761μs 12.6141 KOps/s 12.5046 KOps/s $\color{#35bf28}+0.88\%$
test_tensor_to_bytestream_speed[torch.save] 0.1428ms 0.1405ms 7.1157 KOps/s 7.2047 KOps/s $\color{#d91a1a}-1.24\%$
test_tensor_to_bytestream_speed[untyped_storage] 0.1140s 0.1137s 8.7915 Ops/s 8.6109 Ops/s $\color{#35bf28}+2.10\%$
test_tensor_to_bytestream_speed[numpy] 2.8188μs 2.8124μs 355.5707 KOps/s 366.2699 KOps/s $\color{#d91a1a}-2.92\%$
test_tensor_to_bytestream_speed[safetensors] 40.2948μs 40.0564μs 24.9648 KOps/s 26.8624 KOps/s $\textbf{\color{#d91a1a}-7.06\%}$
test_simple 0.5409s 0.5397s 1.8529 Ops/s 1.7635 Ops/s $\textbf{\color{#35bf28}+5.07\%}$
test_transformed 1.2252s 1.1301s 0.8849 Ops/s 0.8816 Ops/s $\color{#35bf28}+0.37\%$
test_serial 1.6440s 1.6423s 0.6089 Ops/s 0.5954 Ops/s $\color{#35bf28}+2.27\%$
test_parallel 1.1546s 1.0644s 0.9395 Ops/s 0.8859 Ops/s $\textbf{\color{#35bf28}+6.05\%}$
test_step_mdp_speed[True-True-True-True-True] 0.2495ms 43.7720μs 22.8456 KOps/s 22.7676 KOps/s $\color{#35bf28}+0.34\%$
test_step_mdp_speed[True-True-True-True-False] 0.1983ms 24.4763μs 40.8559 KOps/s 40.9232 KOps/s $\color{#d91a1a}-0.16\%$
test_step_mdp_speed[True-True-True-False-True] 59.7310μs 24.3590μs 41.0526 KOps/s 40.7033 KOps/s $\color{#35bf28}+0.86\%$
test_step_mdp_speed[True-True-True-False-False] 47.0010μs 13.4934μs 74.1102 KOps/s 74.0166 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[True-True-False-True-True] 77.4010μs 46.0095μs 21.7346 KOps/s 21.4660 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[True-True-False-True-False] 92.5620μs 26.6030μs 37.5898 KOps/s 37.0378 KOps/s $\color{#35bf28}+1.49\%$
test_step_mdp_speed[True-True-False-False-True] 56.8610μs 26.7524μs 37.3798 KOps/s 36.8141 KOps/s $\color{#35bf28}+1.54\%$
test_step_mdp_speed[True-True-False-False-False] 45.6610μs 16.0512μs 62.3007 KOps/s 61.9751 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[True-False-True-True-True] 94.0120μs 48.3569μs 20.6796 KOps/s 20.3746 KOps/s $\color{#35bf28}+1.50\%$
test_step_mdp_speed[True-False-True-True-False] 80.4210μs 29.1453μs 34.3109 KOps/s 34.2098 KOps/s $\color{#35bf28}+0.30\%$
test_step_mdp_speed[True-False-True-False-True] 86.7720μs 26.9854μs 37.0570 KOps/s 36.9188 KOps/s $\color{#35bf28}+0.37\%$
test_step_mdp_speed[True-False-True-False-False] 43.8810μs 16.1404μs 61.9563 KOps/s 63.2560 KOps/s $\color{#d91a1a}-2.05\%$
test_step_mdp_speed[True-False-False-True-True] 0.1086ms 51.1039μs 19.5680 KOps/s 19.5650 KOps/s $\color{#35bf28}+0.02\%$
test_step_mdp_speed[True-False-False-True-False] 58.7110μs 31.9192μs 31.3291 KOps/s 31.5236 KOps/s $\color{#d91a1a}-0.62\%$
test_step_mdp_speed[True-False-False-False-True] 58.5410μs 28.9316μs 34.5643 KOps/s 34.3496 KOps/s $\color{#35bf28}+0.62\%$
test_step_mdp_speed[True-False-False-False-False] 52.6010μs 18.6019μs 53.7581 KOps/s 53.9828 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[False-True-True-True-True] 81.7410μs 48.5812μs 20.5841 KOps/s 20.1182 KOps/s $\color{#35bf28}+2.32\%$
test_step_mdp_speed[False-True-True-True-False] 67.8020μs 30.0718μs 33.2538 KOps/s 33.2424 KOps/s $\color{#35bf28}+0.03\%$
test_step_mdp_speed[False-True-True-False-True] 2.4344ms 30.9301μs 32.3310 KOps/s 32.1631 KOps/s $\color{#35bf28}+0.52\%$
test_step_mdp_speed[False-True-True-False-False] 51.7210μs 17.5904μs 56.8493 KOps/s 56.5703 KOps/s $\color{#35bf28}+0.49\%$
test_step_mdp_speed[False-True-False-True-True] 84.0620μs 52.0825μs 19.2003 KOps/s 19.2721 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[False-True-False-True-False] 0.1078ms 31.7436μs 31.5024 KOps/s 31.1888 KOps/s $\color{#35bf28}+1.01\%$
test_step_mdp_speed[False-True-False-False-True] 63.6120μs 32.2903μs 30.9691 KOps/s 29.7533 KOps/s $\color{#35bf28}+4.09\%$
test_step_mdp_speed[False-True-False-False-False] 49.7210μs 20.0323μs 49.9194 KOps/s 50.4373 KOps/s $\color{#d91a1a}-1.03\%$
test_step_mdp_speed[False-False-True-True-True] 89.6220μs 53.5908μs 18.6599 KOps/s 18.5190 KOps/s $\color{#35bf28}+0.76\%$
test_step_mdp_speed[False-False-True-True-False] 62.6420μs 34.2817μs 29.1701 KOps/s 29.3349 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[False-False-True-False-True] 63.1010μs 33.1656μs 30.1517 KOps/s 30.3097 KOps/s $\color{#d91a1a}-0.52\%$
test_step_mdp_speed[False-False-True-False-False] 89.6610μs 20.0580μs 49.8553 KOps/s 49.0475 KOps/s $\color{#35bf28}+1.65\%$
test_step_mdp_speed[False-False-False-True-True] 87.5820μs 56.0676μs 17.8356 KOps/s 17.8469 KOps/s $\color{#d91a1a}-0.06\%$
test_step_mdp_speed[False-False-False-True-False] 60.8010μs 37.1400μs 26.9251 KOps/s 26.9576 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-False-False-False-True] 79.6220μs 34.8795μs 28.6701 KOps/s 28.5201 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[False-False-False-False-False] 53.5410μs 22.6558μs 44.1388 KOps/s 45.0460 KOps/s $\color{#d91a1a}-2.01\%$
test_values[generalized_advantage_estimate-True-True] 10.8269ms 10.4158ms 96.0082 Ops/s 96.4538 Ops/s $\color{#d91a1a}-0.46\%$
test_values[vec_generalized_advantage_estimate-True-True] 21.1269ms 18.4337ms 54.2485 Ops/s 56.4341 Ops/s $\color{#d91a1a}-3.87\%$
test_values[td0_return_estimate-False-False] 0.2227ms 0.1281ms 7.8051 KOps/s 7.5349 KOps/s $\color{#35bf28}+3.59\%$
test_values[td1_return_estimate-False-False] 28.2724ms 27.9158ms 35.8220 Ops/s 35.4522 Ops/s $\color{#35bf28}+1.04\%$
test_values[vec_td1_return_estimate-False-False] 18.6434ms 17.9714ms 55.6439 Ops/s 56.0801 Ops/s $\color{#d91a1a}-0.78\%$
test_values[td_lambda_return_estimate-True-False] 42.9310ms 41.4912ms 24.1015 Ops/s 24.4384 Ops/s $\color{#d91a1a}-1.38\%$
test_values[vec_td_lambda_return_estimate-True-False] 18.3737ms 17.9388ms 55.7450 Ops/s 56.2652 Ops/s $\color{#d91a1a}-0.92\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 9.3808ms 9.0871ms 110.0460 Ops/s 110.8023 Ops/s $\color{#d91a1a}-0.68\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.9508ms 1.5455ms 647.0417 Ops/s 676.6890 Ops/s $\color{#d91a1a}-4.38\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5200ms 0.4329ms 2.3098 KOps/s 2.3879 KOps/s $\color{#d91a1a}-3.27\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 35.6685ms 35.1917ms 28.4158 Ops/s 29.0280 Ops/s $\color{#d91a1a}-2.11\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 2.1292ms 1.7725ms 564.1647 Ops/s 574.3723 Ops/s $\color{#d91a1a}-1.78\%$
test_dqn_speed[False-None] 6.4686ms 1.3985ms 715.0432 Ops/s 699.4778 Ops/s $\color{#35bf28}+2.23\%$
test_dqn_speed[False-backward] 2.0090ms 1.9278ms 518.7216 Ops/s 517.1157 Ops/s $\color{#35bf28}+0.31\%$
test_dqn_speed[True-None] 0.8162ms 0.5132ms 1.9484 KOps/s 1.9236 KOps/s $\color{#35bf28}+1.29\%$
test_dqn_speed[True-backward] 1.2554ms 0.9897ms 1.0104 KOps/s 843.5510 Ops/s $\textbf{\color{#35bf28}+19.78\%}$
test_dqn_speed[reduce-overhead-None] 0.6639ms 0.5085ms 1.9664 KOps/s 1.9313 KOps/s $\color{#35bf28}+1.82\%$
test_dqn_speed[reduce-overhead-backward] 0.9951ms 0.9498ms 1.0529 KOps/s 1.0405 KOps/s $\color{#35bf28}+1.20\%$
test_ddpg_speed[False-None] 3.5042ms 2.8277ms 353.6439 Ops/s 352.1608 Ops/s $\color{#35bf28}+0.42\%$
test_ddpg_speed[False-backward] 4.1466ms 4.0653ms 245.9873 Ops/s 245.7864 Ops/s $\color{#35bf28}+0.08\%$
test_ddpg_speed[True-None] 1.6512ms 1.3772ms 726.1295 Ops/s 728.1848 Ops/s $\color{#d91a1a}-0.28\%$
test_ddpg_speed[True-backward] 2.5368ms 2.3672ms 422.4428 Ops/s 416.5273 Ops/s $\color{#35bf28}+1.42\%$
test_ddpg_speed[reduce-overhead-None] 1.7555ms 1.3646ms 732.8035 Ops/s 729.3395 Ops/s $\color{#35bf28}+0.47\%$
test_ddpg_speed[reduce-overhead-backward] 2.4090ms 2.3362ms 428.0413 Ops/s 361.7358 Ops/s $\textbf{\color{#35bf28}+18.33\%}$
test_sac_speed[False-None] 9.1405ms 7.9214ms 126.2400 Ops/s 124.3326 Ops/s $\color{#35bf28}+1.53\%$
test_sac_speed[False-backward] 11.5379ms 11.2046ms 89.2487 Ops/s 89.5413 Ops/s $\color{#d91a1a}-0.33\%$
test_sac_speed[True-None] 2.4640ms 2.1313ms 469.1990 Ops/s 456.7139 Ops/s $\color{#35bf28}+2.73\%$
test_sac_speed[True-backward] 4.2137ms 4.0749ms 245.4053 Ops/s 246.0143 Ops/s $\color{#d91a1a}-0.25\%$
test_sac_speed[reduce-overhead-None] 2.3421ms 2.1161ms 472.5700 Ops/s 465.1586 Ops/s $\color{#35bf28}+1.59\%$
test_sac_speed[reduce-overhead-backward] 4.1819ms 4.0317ms 248.0321 Ops/s 235.0857 Ops/s $\textbf{\color{#35bf28}+5.51\%}$
test_redq_speed[False-None] 14.8222ms 10.5297ms 94.9693 Ops/s 94.3912 Ops/s $\color{#35bf28}+0.61\%$
test_redq_speed[False-backward] 18.6200ms 18.1118ms 55.2127 Ops/s 54.9343 Ops/s $\color{#35bf28}+0.51\%$
test_redq_speed[True-None] 4.5801ms 4.3111ms 231.9587 Ops/s 230.0550 Ops/s $\color{#35bf28}+0.83\%$
test_redq_speed[True-backward] 9.9263ms 9.6945ms 103.1513 Ops/s 104.9363 Ops/s $\color{#d91a1a}-1.70\%$
test_redq_speed[reduce-overhead-None] 4.7442ms 4.3672ms 228.9800 Ops/s 233.5624 Ops/s $\color{#d91a1a}-1.96\%$
test_redq_speed[reduce-overhead-backward] 10.2575ms 9.9385ms 100.6188 Ops/s 101.1269 Ops/s $\color{#d91a1a}-0.50\%$
test_redq_deprec_speed[False-None] 13.7303ms 11.1753ms 89.4827 Ops/s 88.9627 Ops/s $\color{#35bf28}+0.58\%$
test_redq_deprec_speed[False-backward] 25.0567ms 16.3232ms 61.2625 Ops/s 62.5843 Ops/s $\color{#d91a1a}-2.11\%$
test_redq_deprec_speed[True-None] 4.2051ms 3.7010ms 270.1965 Ops/s 269.8403 Ops/s $\color{#35bf28}+0.13\%$
test_redq_deprec_speed[True-backward] 8.1283ms 7.8385ms 127.5752 Ops/s 118.4545 Ops/s $\textbf{\color{#35bf28}+7.70\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.0399ms 3.5875ms 278.7486 Ops/s 272.7781 Ops/s $\color{#35bf28}+2.19\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.2148ms 7.7450ms 129.1154 Ops/s 128.9328 Ops/s $\color{#35bf28}+0.14\%$
test_td3_speed[False-None] 8.3339ms 7.9791ms 125.3278 Ops/s 124.5606 Ops/s $\color{#35bf28}+0.62\%$
test_td3_speed[False-backward] 11.3591ms 10.8710ms 91.9875 Ops/s 91.7976 Ops/s $\color{#35bf28}+0.21\%$
test_td3_speed[True-None] 1.8469ms 1.8077ms 553.2000 Ops/s 537.7267 Ops/s $\color{#35bf28}+2.88\%$
test_td3_speed[True-backward] 3.8288ms 3.6837ms 271.4630 Ops/s 267.2954 Ops/s $\color{#35bf28}+1.56\%$
test_td3_speed[reduce-overhead-None] 1.8511ms 1.7917ms 558.1436 Ops/s 542.4356 Ops/s $\color{#35bf28}+2.90\%$
test_td3_speed[reduce-overhead-backward] 3.9969ms 3.6914ms 270.9032 Ops/s 264.6066 Ops/s $\color{#35bf28}+2.38\%$
test_cql_speed[False-None] 32.6679ms 27.1867ms 36.7827 Ops/s 38.1778 Ops/s $\color{#d91a1a}-3.65\%$
test_cql_speed[False-backward] 38.3727ms 35.9062ms 27.8503 Ops/s 27.8787 Ops/s $\color{#d91a1a}-0.10\%$
test_cql_speed[True-None] 12.8823ms 12.4377ms 80.4010 Ops/s 79.7375 Ops/s $\color{#35bf28}+0.83\%$
test_cql_speed[True-backward] 18.9020ms 18.3962ms 54.3591 Ops/s 54.3941 Ops/s $\color{#d91a1a}-0.06\%$
test_cql_speed[reduce-overhead-None] 12.8212ms 12.4638ms 80.2326 Ops/s 79.6529 Ops/s $\color{#35bf28}+0.73\%$
test_cql_speed[reduce-overhead-backward] 18.9488ms 18.4585ms 54.1756 Ops/s 55.0173 Ops/s $\color{#d91a1a}-1.53\%$
test_a2c_speed[False-None] 5.8779ms 5.4518ms 183.4272 Ops/s 180.7380 Ops/s $\color{#35bf28}+1.49\%$
test_a2c_speed[False-backward] 12.7951ms 11.9460ms 83.7104 Ops/s 83.0319 Ops/s $\color{#35bf28}+0.82\%$
test_a2c_speed[True-None] 4.0203ms 3.7745ms 264.9330 Ops/s 265.7382 Ops/s $\color{#d91a1a}-0.30\%$
test_a2c_speed[True-backward] 8.9315ms 8.6744ms 115.2811 Ops/s 114.0093 Ops/s $\color{#35bf28}+1.12\%$
test_a2c_speed[reduce-overhead-None] 4.0059ms 3.7231ms 268.5926 Ops/s 267.9233 Ops/s $\color{#35bf28}+0.25\%$
test_a2c_speed[reduce-overhead-backward] 9.7329ms 8.7734ms 113.9812 Ops/s 103.5463 Ops/s $\textbf{\color{#35bf28}+10.08\%}$
test_ppo_speed[False-None] 6.2111ms 5.8939ms 169.6682 Ops/s 167.5254 Ops/s $\color{#35bf28}+1.28\%$
test_ppo_speed[False-backward] 12.8657ms 12.6384ms 79.1236 Ops/s 78.6542 Ops/s $\color{#35bf28}+0.60\%$
test_ppo_speed[True-None] 3.8177ms 3.6157ms 276.5727 Ops/s 274.1522 Ops/s $\color{#35bf28}+0.88\%$
test_ppo_speed[True-backward] 8.8128ms 8.4644ms 118.1412 Ops/s 117.6286 Ops/s $\color{#35bf28}+0.44\%$
test_ppo_speed[reduce-overhead-None] 3.8076ms 3.6009ms 277.7089 Ops/s 275.4882 Ops/s $\color{#35bf28}+0.81\%$
test_ppo_speed[reduce-overhead-backward] 9.1185ms 8.7618ms 114.1315 Ops/s 112.6754 Ops/s $\color{#35bf28}+1.29\%$
test_reinforce_speed[False-None] 7.0744ms 4.6204ms 216.4318 Ops/s 217.1133 Ops/s $\color{#d91a1a}-0.31\%$
test_reinforce_speed[False-backward] 7.6944ms 7.4497ms 134.2340 Ops/s 133.2496 Ops/s $\color{#35bf28}+0.74\%$
test_reinforce_speed[True-None] 3.2670ms 2.8679ms 348.6917 Ops/s 328.4490 Ops/s $\textbf{\color{#35bf28}+6.16\%}$
test_reinforce_speed[True-backward] 8.0654ms 7.8525ms 127.3481 Ops/s 126.0737 Ops/s $\color{#35bf28}+1.01\%$
test_reinforce_speed[reduce-overhead-None] 3.0663ms 2.8640ms 349.1660 Ops/s 318.4093 Ops/s $\textbf{\color{#35bf28}+9.66\%}$
test_reinforce_speed[reduce-overhead-backward] 8.1656ms 7.9420ms 125.9123 Ops/s 124.0272 Ops/s $\color{#35bf28}+1.52\%$
test_iql_speed[False-None] 24.0473ms 20.1647ms 49.5917 Ops/s 49.5802 Ops/s $\color{#35bf28}+0.02\%$
test_iql_speed[False-backward] 32.4045ms 30.8730ms 32.3907 Ops/s 32.3312 Ops/s $\color{#35bf28}+0.18\%$
test_iql_speed[True-None] 9.8858ms 8.5918ms 116.3896 Ops/s 115.4724 Ops/s $\color{#35bf28}+0.79\%$
test_iql_speed[True-backward] 17.3747ms 16.9688ms 58.9316 Ops/s 58.2899 Ops/s $\color{#35bf28}+1.10\%$
test_iql_speed[reduce-overhead-None] 8.9668ms 8.6246ms 115.9472 Ops/s 113.0956 Ops/s $\color{#35bf28}+2.52\%$
test_iql_speed[reduce-overhead-backward] 18.3749ms 17.4867ms 57.1864 Ops/s 54.0103 Ops/s $\textbf{\color{#35bf28}+5.88\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.1221ms 5.8677ms 170.4250 Ops/s 174.1238 Ops/s $\color{#d91a1a}-2.12\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.0226ms 0.3501ms 2.8561 KOps/s 3.5183 KOps/s $\textbf{\color{#d91a1a}-18.82\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6244ms 0.3269ms 3.0594 KOps/s 3.7621 KOps/s $\textbf{\color{#d91a1a}-18.68\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.7736ms 5.5179ms 181.2293 Ops/s 177.9313 Ops/s $\color{#35bf28}+1.85\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.4107ms 0.3765ms 2.6563 KOps/s 2.9950 KOps/s $\textbf{\color{#d91a1a}-11.31\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5298ms 0.3363ms 2.9738 KOps/s 3.7904 KOps/s $\textbf{\color{#d91a1a}-21.54\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.8081ms 1.4839ms 673.9054 Ops/s 788.0471 Ops/s $\textbf{\color{#d91a1a}-14.48\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6405ms 1.4041ms 712.1771 Ops/s 839.4670 Ops/s $\textbf{\color{#d91a1a}-15.16\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.9733ms 5.7180ms 174.8865 Ops/s 172.6253 Ops/s $\color{#35bf28}+1.31\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1002ms 0.5138ms 1.9461 KOps/s 2.3123 KOps/s $\textbf{\color{#d91a1a}-15.84\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7099ms 0.4958ms 2.0171 KOps/s 2.3957 KOps/s $\textbf{\color{#d91a1a}-15.80\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.8148ms 5.6992ms 175.4624 Ops/s 177.3835 Ops/s $\color{#d91a1a}-1.08\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9137ms 0.2872ms 3.4815 KOps/s 3.5469 KOps/s $\color{#d91a1a}-1.84\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4754ms 0.2701ms 3.7026 KOps/s 3.7585 KOps/s $\color{#d91a1a}-1.49\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.9288ms 5.6527ms 176.9080 Ops/s 178.3293 Ops/s $\color{#d91a1a}-0.80\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8479ms 0.3164ms 3.1606 KOps/s 2.9834 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5367ms 0.3363ms 2.9735 KOps/s 3.5982 KOps/s $\textbf{\color{#d91a1a}-17.36\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.9836ms 5.8360ms 171.3511 Ops/s 173.9258 Ops/s $\color{#d91a1a}-1.48\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.5862ms 0.5228ms 1.9127 KOps/s 2.3096 KOps/s $\textbf{\color{#d91a1a}-17.18\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6993ms 0.4935ms 2.0264 KOps/s 2.0253 KOps/s $\color{#35bf28}+0.05\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.5439s 15.8455ms 63.1095 Ops/s 58.9931 Ops/s $\textbf{\color{#35bf28}+6.98\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.3890ms 1.9699ms 507.6518 Ops/s 575.2421 Ops/s $\textbf{\color{#d91a1a}-11.75\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 2.2684ms 1.0067ms 993.3208 Ops/s 969.8537 Ops/s $\color{#35bf28}+2.42\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.0333ms 5.1152ms 195.4940 Ops/s 197.9655 Ops/s $\color{#d91a1a}-1.25\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.6035ms 2.0567ms 486.2124 Ops/s 490.9288 Ops/s $\color{#d91a1a}-0.96\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.2487ms 1.2135ms 824.0570 Ops/s 784.4724 Ops/s $\textbf{\color{#35bf28}+5.05\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.7004ms 5.2525ms 190.3869 Ops/s 58.1420 Ops/s $\textbf{\color{#35bf28}+227.45\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 12.1445ms 2.1157ms 472.6652 Ops/s 518.9192 Ops/s $\textbf{\color{#d91a1a}-8.91\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.6714ms 1.3201ms 757.4940 Ops/s 714.5837 Ops/s $\textbf{\color{#35bf28}+6.00\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 34.8755ms 32.9212ms 30.3756 Ops/s 30.3468 Ops/s $\color{#35bf28}+0.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 19.7296ms 17.5002ms 57.1422 Ops/s 57.0286 Ops/s $\color{#35bf28}+0.20\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 36.6467ms 34.2982ms 29.1561 Ops/s 29.1152 Ops/s $\color{#35bf28}+0.14\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 19.2274ms 17.7927ms 56.2027 Ops/s 56.6931 Ops/s $\color{#d91a1a}-0.86\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 41.1704ms 37.3773ms 26.7542 Ops/s 27.8527 Ops/s $\color{#d91a1a}-3.94\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 21.8086ms 20.2097ms 49.4811 Ops/s 51.9155 Ops/s $\color{#d91a1a}-4.69\%$

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/binaries/all Build all binaries CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants