rl
rl copied to clipboard
[CI] Fix windows upload wheels
:link: Helpful Links
:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2507
- :page_facing_up: Preview Python docs built from this PR
Note: Links to docs will display an error until the docs builds have been completed.
:x: 1 New Failure, 6 Unrelated Failures
As of commit 6fa1826b28d996072ed68c4bf211d51419a85ed9 with merge base 9f6c21f43642f032c7b903ed537e5a3a8749cade ():
NEW FAILURE - The following job has failed:
- Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 13dfbea8aad118aac72dedb8bc726190e2b3e078789bbe0f91f25164170c224c /exec failed with exit code 134
BROKEN TRUNK - The following jobs failed but were present on the merge base:
👉 Rebase onto the `viable/strict` branch to avoid these failures
- Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow - Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow - Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_1 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow - Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow - Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh) (trunk failure)
test/test_rb.py::TestEnsemble::test_rb[SamplerWithoutReplacement-48-None-None-Tensor-ListStorage] - Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
test/test_collector.py::TestCompile::test_compiled_policy[device0-compile_policy2-MultiSyncDataCollector]
This comment was automatically generated by Dr. CI and updates every 15 minutes.
$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests
Total Benchmarks: 143. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}8$.
Expand to view detailed results
| Name | Max | Mean | Ops | Ops on Repo HEAD |
Change |
|---|---|---|---|---|---|
| test_simple | 0.4086s | 0.4068s | 2.4584 Ops/s | 2.3473 Ops/s | $\color{#35bf28}+4.74\%$ |
| test_transformed | 0.6734s | 0.6002s | 1.6661 Ops/s | 1.7287 Ops/s | $\color{#d91a1a}-3.62\%$ |
| test_serial | 1.4099s | 1.3349s | 0.7491 Ops/s | 0.7546 Ops/s | $\color{#d91a1a}-0.73\%$ |
| test_parallel | 1.3791s | 1.3079s | 0.7646 Ops/s | 0.7580 Ops/s | $\color{#35bf28}+0.87\%$ |
| test_step_mdp_speed[True-True-True-True-True] | 0.1962ms | 28.3987μs | 35.2128 KOps/s | 35.1922 KOps/s | $\color{#35bf28}+0.06\%$ |
| test_step_mdp_speed[True-True-True-True-False] | 50.7960μs | 16.9676μs | 58.9358 KOps/s | 58.6693 KOps/s | $\color{#35bf28}+0.45\%$ |
| test_step_mdp_speed[True-True-True-False-True] | 57.0670μs | 15.8911μs | 62.9283 KOps/s | 60.9121 KOps/s | $\color{#35bf28}+3.31\%$ |
| test_step_mdp_speed[True-True-True-False-False] | 46.4770μs | 9.3976μs | 106.4096 KOps/s | 106.4997 KOps/s | $\color{#d91a1a}-0.08\%$ |
| test_step_mdp_speed[True-True-False-True-True] | 77.3850μs | 30.8896μs | 32.3734 KOps/s | 32.5838 KOps/s | $\color{#d91a1a}-0.65\%$ |
| test_step_mdp_speed[True-True-False-True-False] | 0.6587ms | 19.3915μs | 51.5689 KOps/s | 52.9332 KOps/s | $\color{#d91a1a}-2.58\%$ |
| test_step_mdp_speed[True-True-False-False-True] | 71.2030μs | 17.9561μs | 55.6913 KOps/s | 55.2094 KOps/s | $\color{#35bf28}+0.87\%$ |
| test_step_mdp_speed[True-True-False-False-False] | 40.4660μs | 11.5277μs | 86.7476 KOps/s | 87.7461 KOps/s | $\color{#d91a1a}-1.14\%$ |
| test_step_mdp_speed[True-False-True-True-True] | 73.6390μs | 33.1411μs | 30.1740 KOps/s | 29.9278 KOps/s | $\color{#35bf28}+0.82\%$ |
| test_step_mdp_speed[True-False-True-True-False] | 70.7930μs | 21.4536μs | 46.6122 KOps/s | 47.6495 KOps/s | $\color{#d91a1a}-2.18\%$ |
| test_step_mdp_speed[True-False-True-False-True] | 46.8880μs | 17.9834μs | 55.6067 KOps/s | 55.6254 KOps/s | $\color{#d91a1a}-0.03\%$ |
| test_step_mdp_speed[True-False-True-False-False] | 59.8630μs | 11.5369μs | 86.6788 KOps/s | 88.1797 KOps/s | $\color{#d91a1a}-1.70\%$ |
| test_step_mdp_speed[True-False-False-True-True] | 93.6560μs | 35.1049μs | 28.4860 KOps/s | 28.6397 KOps/s | $\color{#d91a1a}-0.54\%$ |
| test_step_mdp_speed[True-False-False-True-False] | 70.0610μs | 23.2992μs | 42.9199 KOps/s | 43.5498 KOps/s | $\color{#d91a1a}-1.45\%$ |
| test_step_mdp_speed[True-False-False-False-True] | 50.5250μs | 19.9637μs | 50.0909 KOps/s | 50.5568 KOps/s | $\color{#d91a1a}-0.92\%$ |
| test_step_mdp_speed[True-False-False-False-False] | 73.9890μs | 13.4612μs | 74.2875 KOps/s | 75.5684 KOps/s | $\color{#d91a1a}-1.70\%$ |
| test_step_mdp_speed[False-True-True-True-True] | 85.5210μs | 32.7550μs | 30.5297 KOps/s | 30.3331 KOps/s | $\color{#35bf28}+0.65\%$ |
| test_step_mdp_speed[False-True-True-True-False] | 56.3550μs | 21.3561μs | 46.8251 KOps/s | 47.4937 KOps/s | $\color{#d91a1a}-1.41\%$ |
| test_step_mdp_speed[False-True-True-False-True] | 57.6580μs | 21.0652μs | 47.4717 KOps/s | 47.3522 KOps/s | $\color{#35bf28}+0.25\%$ |
| test_step_mdp_speed[False-True-True-False-False] | 38.0410μs | 13.0883μs | 76.4039 KOps/s | 76.5521 KOps/s | $\color{#d91a1a}-0.19\%$ |
| test_step_mdp_speed[False-True-False-True-True] | 72.7060μs | 34.5226μs | 28.9666 KOps/s | 28.6696 KOps/s | $\color{#35bf28}+1.04\%$ |
| test_step_mdp_speed[False-True-False-True-False] | 74.9010μs | 23.0308μs | 43.4200 KOps/s | 43.4091 KOps/s | $\color{#35bf28}+0.03\%$ |
| test_step_mdp_speed[False-True-False-False-True] | 2.7200ms | 23.1278μs | 43.2380 KOps/s | 42.4525 KOps/s | $\color{#35bf28}+1.85\%$ |
| test_step_mdp_speed[False-True-False-False-False] | 0.6252ms | 15.1840μs | 65.8588 KOps/s | 66.2725 KOps/s | $\color{#d91a1a}-0.62\%$ |
| test_step_mdp_speed[False-False-True-True-True] | 71.4040μs | 36.7892μs | 27.1819 KOps/s | 27.3464 KOps/s | $\color{#d91a1a}-0.60\%$ |
| test_step_mdp_speed[False-False-True-True-False] | 79.3490μs | 25.4048μs | 39.3627 KOps/s | 40.2899 KOps/s | $\color{#d91a1a}-2.30\%$ |
| test_step_mdp_speed[False-False-True-False-True] | 92.6910μs | 23.3117μs | 42.8970 KOps/s | 43.4924 KOps/s | $\color{#d91a1a}-1.37\%$ |
| test_step_mdp_speed[False-False-True-False-False] | 58.8000μs | 14.9754μs | 66.7760 KOps/s | 66.6057 KOps/s | $\color{#35bf28}+0.26\%$ |
| test_step_mdp_speed[False-False-False-True-True] | 90.7080μs | 38.5561μs | 25.9363 KOps/s | 25.7638 KOps/s | $\color{#35bf28}+0.67\%$ |
| test_step_mdp_speed[False-False-False-True-False] | 70.0540μs | 27.0580μs | 36.9577 KOps/s | 37.2359 KOps/s | $\color{#d91a1a}-0.75\%$ |
| test_step_mdp_speed[False-False-False-False-True] | 69.0300μs | 25.0643μs | 39.8973 KOps/s | 40.0891 KOps/s | $\color{#d91a1a}-0.48\%$ |
| test_step_mdp_speed[False-False-False-False-False] | 65.2930μs | 17.0576μs | 58.6250 KOps/s | 58.4799 KOps/s | $\color{#35bf28}+0.25\%$ |
| test_values[generalized_advantage_estimate-True-True] | 16.0097ms | 10.1695ms | 98.3329 Ops/s | 105.1625 Ops/s | $\textbf{\color{#d91a1a}-6.49\%}$ |
| test_values[vec_generalized_advantage_estimate-True-True] | 37.4947ms | 35.3665ms | 28.2754 Ops/s | 29.9743 Ops/s | $\textbf{\color{#d91a1a}-5.67\%}$ |
| test_values[td0_return_estimate-False-False] | 0.2257ms | 0.1692ms | 5.9088 KOps/s | 5.9074 KOps/s | $\color{#35bf28}+0.02\%$ |
| test_values[td1_return_estimate-False-False] | 24.5747ms | 24.0445ms | 41.5896 Ops/s | 41.4998 Ops/s | $\color{#35bf28}+0.22\%$ |
| test_values[vec_td1_return_estimate-False-False] | 37.7296ms | 35.5746ms | 28.1099 Ops/s | 29.7777 Ops/s | $\textbf{\color{#d91a1a}-5.60\%}$ |
| test_values[td_lambda_return_estimate-True-False] | 37.8334ms | 34.8682ms | 28.6795 Ops/s | 28.6731 Ops/s | $\color{#35bf28}+0.02\%$ |
| test_values[vec_td_lambda_return_estimate-True-False] | 37.4116ms | 35.5488ms | 28.1304 Ops/s | 30.0475 Ops/s | $\textbf{\color{#d91a1a}-6.38\%}$ |
| test_gae_speed[generalized_advantage_estimate-False-1-512] | 12.2946ms | 8.4631ms | 118.1597 Ops/s | 120.1198 Ops/s | $\color{#d91a1a}-1.63\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.7129ms | 2.1719ms | 460.4340 Ops/s | 460.3513 Ops/s | $\color{#35bf28}+0.02\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.5975ms | 0.3563ms | 2.8066 KOps/s | 2.8288 KOps/s | $\color{#d91a1a}-0.78\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 45.5381ms | 44.5985ms | 22.4223 Ops/s | 22.7818 Ops/s | $\color{#d91a1a}-1.58\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 3.8702ms | 3.0352ms | 329.4684 Ops/s | 329.8689 Ops/s | $\color{#d91a1a}-0.12\%$ |
| test_dqn_speed[False-None] | 5.8767ms | 1.3428ms | 744.7258 Ops/s | 752.5098 Ops/s | $\color{#d91a1a}-1.03\%$ |
| test_dqn_speed[False-backward] | 1.8798ms | 1.8022ms | 554.8816 Ops/s | 554.6386 Ops/s | $\color{#35bf28}+0.04\%$ |
| test_dqn_speed[True-None] | 0.5725ms | 0.4564ms | 2.1909 KOps/s | 2.1541 KOps/s | $\color{#35bf28}+1.71\%$ |
| test_dqn_speed[True-backward] | 0.9375ms | 0.8776ms | 1.1394 KOps/s | 778.1648 Ops/s | $\textbf{\color{#35bf28}+46.43\%}$ |
| test_dqn_speed[reduce-overhead-None] | 0.7925ms | 0.4607ms | 2.1704 KOps/s | 2.1219 KOps/s | $\color{#35bf28}+2.29\%$ |
| test_dqn_speed[reduce-overhead-backward] | 0.9591ms | 0.8761ms | 1.1414 KOps/s | 1.1405 KOps/s | $\color{#35bf28}+0.08\%$ |
| test_ddpg_speed[False-None] | 3.4748ms | 2.7979ms | 357.4160 Ops/s | 357.2395 Ops/s | $\color{#35bf28}+0.05\%$ |
| test_ddpg_speed[False-backward] | 4.0745ms | 3.9201ms | 255.0967 Ops/s | 254.8861 Ops/s | $\color{#35bf28}+0.08\%$ |
| test_ddpg_speed[True-None] | 1.1814ms | 1.0022ms | 997.7644 Ops/s | 989.2806 Ops/s | $\color{#35bf28}+0.86\%$ |
| test_ddpg_speed[True-backward] | 1.9520ms | 1.8771ms | 532.7275 Ops/s | 530.1423 Ops/s | $\color{#35bf28}+0.49\%$ |
| test_ddpg_speed[reduce-overhead-None] | 1.5098ms | 1.0021ms | 997.8753 Ops/s | 1.0003 KOps/s | $\color{#d91a1a}-0.24\%$ |
| test_ddpg_speed[reduce-overhead-backward] | 2.0832ms | 1.8955ms | 527.5742 Ops/s | 522.9666 Ops/s | $\color{#35bf28}+0.88\%$ |
| test_sac_speed[False-None] | 9.0939ms | 7.9098ms | 126.4256 Ops/s | 126.8898 Ops/s | $\color{#d91a1a}-0.37\%$ |
| test_sac_speed[False-backward] | 10.8971ms | 10.5838ms | 94.4842 Ops/s | 94.6243 Ops/s | $\color{#d91a1a}-0.15\%$ |
| test_sac_speed[True-None] | 2.4482ms | 1.8572ms | 538.4420 Ops/s | 540.8475 Ops/s | $\color{#d91a1a}-0.44\%$ |
| test_sac_speed[True-backward] | 4.3634ms | 3.5730ms | 279.8743 Ops/s | 283.5629 Ops/s | $\color{#d91a1a}-1.30\%$ |
| test_sac_speed[reduce-overhead-None] | 3.3892ms | 1.8516ms | 540.0670 Ops/s | 539.3981 Ops/s | $\color{#35bf28}+0.12\%$ |
| test_sac_speed[reduce-overhead-backward] | 4.3826ms | 3.5567ms | 281.1593 Ops/s | 277.8429 Ops/s | $\color{#35bf28}+1.19\%$ |
| test_redq_speed[False-None] | 19.6479ms | 13.4875ms | 74.1426 Ops/s | 80.9101 Ops/s | $\textbf{\color{#d91a1a}-8.36\%}$ |
| test_redq_speed[False-backward] | 25.7401ms | 22.1161ms | 45.2159 Ops/s | 46.1665 Ops/s | $\color{#d91a1a}-2.06\%$ |
| test_redq_speed[True-None] | 6.3622ms | 4.5580ms | 219.3940 Ops/s | 223.2604 Ops/s | $\color{#d91a1a}-1.73\%$ |
| test_redq_speed[True-backward] | 13.0342ms | 11.9037ms | 84.0076 Ops/s | 82.7807 Ops/s | $\color{#35bf28}+1.48\%$ |
| test_redq_speed[reduce-overhead-None] | 5.3758ms | 4.5792ms | 218.3797 Ops/s | 215.7432 Ops/s | $\color{#35bf28}+1.22\%$ |
| test_redq_speed[reduce-overhead-backward] | 13.0371ms | 12.1263ms | 82.4656 Ops/s | 83.5908 Ops/s | $\color{#d91a1a}-1.35\%$ |
| test_redq_deprec_speed[False-None] | 14.5298ms | 12.5726ms | 79.5379 Ops/s | 78.9328 Ops/s | $\color{#35bf28}+0.77\%$ |
| test_redq_deprec_speed[False-backward] | 19.4491ms | 18.1850ms | 54.9904 Ops/s | 54.5958 Ops/s | $\color{#35bf28}+0.72\%$ |
| test_redq_deprec_speed[True-None] | 4.6479ms | 3.5691ms | 280.1803 Ops/s | 279.8502 Ops/s | $\color{#35bf28}+0.12\%$ |
| test_redq_deprec_speed[True-backward] | 8.6290ms | 7.9837ms | 125.2555 Ops/s | 124.3933 Ops/s | $\color{#35bf28}+0.69\%$ |
| test_redq_deprec_speed[reduce-overhead-None] | 4.0697ms | 3.5762ms | 279.6291 Ops/s | 280.7713 Ops/s | $\color{#d91a1a}-0.41\%$ |
| test_redq_deprec_speed[reduce-overhead-backward] | 9.5726ms | 8.0994ms | 123.4656 Ops/s | 124.2293 Ops/s | $\color{#d91a1a}-0.61\%$ |
| test_td3_speed[False-None] | 8.0121ms | 7.7984ms | 128.2313 Ops/s | 128.6548 Ops/s | $\color{#d91a1a}-0.33\%$ |
| test_td3_speed[False-backward] | 12.3870ms | 10.2561ms | 97.5027 Ops/s | 97.7586 Ops/s | $\color{#d91a1a}-0.26\%$ |
| test_td3_speed[True-None] | 1.8925ms | 1.7515ms | 570.9379 Ops/s | 565.9981 Ops/s | $\color{#35bf28}+0.87\%$ |
| test_td3_speed[True-backward] | 3.4535ms | 3.3638ms | 297.2826 Ops/s | 279.7976 Ops/s | $\textbf{\color{#35bf28}+6.25\%}$ |
| test_td3_speed[reduce-overhead-None] | 1.9308ms | 1.7499ms | 571.4690 Ops/s | 563.7591 Ops/s | $\color{#35bf28}+1.37\%$ |
| test_td3_speed[reduce-overhead-backward] | 4.3795ms | 3.4013ms | 294.0061 Ops/s | 297.8467 Ops/s | $\color{#d91a1a}-1.29\%$ |
| test_cql_speed[False-None] | 37.2475ms | 35.3941ms | 28.2533 Ops/s | 27.8802 Ops/s | $\color{#35bf28}+1.34\%$ |
| test_cql_speed[False-backward] | 48.3532ms | 45.8453ms | 21.8125 Ops/s | 22.0298 Ops/s | $\color{#d91a1a}-0.99\%$ |
| test_cql_speed[True-None] | 16.6773ms | 15.5855ms | 64.1621 Ops/s | 64.2063 Ops/s | $\color{#d91a1a}-0.07\%$ |
| test_cql_speed[True-backward] | 23.4270ms | 22.1499ms | 45.1470 Ops/s | 44.3410 Ops/s | $\color{#35bf28}+1.82\%$ |
| test_cql_speed[reduce-overhead-None] | 16.5244ms | 15.5545ms | 64.2901 Ops/s | 63.4469 Ops/s | $\color{#35bf28}+1.33\%$ |
| test_cql_speed[reduce-overhead-backward] | 23.6545ms | 22.3495ms | 44.7437 Ops/s | 46.0217 Ops/s | $\color{#d91a1a}-2.78\%$ |
| test_a2c_speed[False-None] | 7.9681ms | 7.1051ms | 140.7445 Ops/s | 140.0349 Ops/s | $\color{#35bf28}+0.51\%$ |
| test_a2c_speed[False-backward] | 16.0000ms | 14.1197ms | 70.8229 Ops/s | 70.8427 Ops/s | $\color{#d91a1a}-0.03\%$ |
| test_a2c_speed[True-None] | 3.9489ms | 3.3301ms | 300.2883 Ops/s | 300.5295 Ops/s | $\color{#d91a1a}-0.08\%$ |
| test_a2c_speed[True-backward] | 10.1238ms | 9.7954ms | 102.0883 Ops/s | 103.4568 Ops/s | $\color{#d91a1a}-1.32\%$ |
| test_a2c_speed[reduce-overhead-None] | 3.7528ms | 3.3428ms | 299.1465 Ops/s | 299.8286 Ops/s | $\color{#d91a1a}-0.23\%$ |
| test_a2c_speed[reduce-overhead-backward] | 10.2009ms | 9.7223ms | 102.8561 Ops/s | 103.6082 Ops/s | $\color{#d91a1a}-0.73\%$ |
| test_ppo_speed[False-None] | 9.2189ms | 7.3568ms | 135.9290 Ops/s | 136.0712 Ops/s | $\color{#d91a1a}-0.10\%$ |
| test_ppo_speed[False-backward] | 15.9364ms | 14.4165ms | 69.3650 Ops/s | 70.0189 Ops/s | $\color{#d91a1a}-0.93\%$ |
| test_ppo_speed[True-None] | 4.4913ms | 3.7471ms | 266.8726 Ops/s | 269.9324 Ops/s | $\color{#d91a1a}-1.13\%$ |
| test_ppo_speed[True-backward] | 10.7637ms | 9.6115ms | 104.0425 Ops/s | 104.7276 Ops/s | $\color{#d91a1a}-0.65\%$ |
| test_ppo_speed[reduce-overhead-None] | 4.0400ms | 3.7203ms | 268.7930 Ops/s | 268.9217 Ops/s | $\color{#d91a1a}-0.05\%$ |
| test_ppo_speed[reduce-overhead-backward] | 10.4685ms | 9.5993ms | 104.1744 Ops/s | 104.6026 Ops/s | $\color{#d91a1a}-0.41\%$ |
| test_reinforce_speed[False-None] | 8.6289ms | 6.5082ms | 153.6533 Ops/s | 155.1621 Ops/s | $\color{#d91a1a}-0.97\%$ |
| test_reinforce_speed[False-backward] | 10.5220ms | 9.6801ms | 103.3044 Ops/s | 103.5567 Ops/s | $\color{#d91a1a}-0.24\%$ |
| test_reinforce_speed[True-None] | 3.6903ms | 2.6678ms | 374.8442 Ops/s | 378.3473 Ops/s | $\color{#d91a1a}-0.93\%$ |
| test_reinforce_speed[True-backward] | 9.8522ms | 8.5533ms | 116.9140 Ops/s | 117.4864 Ops/s | $\color{#d91a1a}-0.49\%$ |
| test_reinforce_speed[reduce-overhead-None] | 3.2443ms | 2.6499ms | 377.3680 Ops/s | 379.9329 Ops/s | $\color{#d91a1a}-0.68\%$ |
| test_reinforce_speed[reduce-overhead-backward] | 9.2752ms | 8.5615ms | 116.8020 Ops/s | 117.3000 Ops/s | $\color{#d91a1a}-0.42\%$ |
| test_iql_speed[False-None] | 33.2904ms | 31.8415ms | 31.4056 Ops/s | 31.0827 Ops/s | $\color{#35bf28}+1.04\%$ |
| test_iql_speed[False-backward] | 46.4789ms | 44.7947ms | 22.3241 Ops/s | 22.0272 Ops/s | $\color{#35bf28}+1.35\%$ |
| test_iql_speed[True-None] | 12.6952ms | 10.6741ms | 93.6847 Ops/s | 95.3040 Ops/s | $\color{#d91a1a}-1.70\%$ |
| test_iql_speed[True-backward] | 22.8179ms | 21.6590ms | 46.1701 Ops/s | 46.7935 Ops/s | $\color{#d91a1a}-1.33\%$ |
| test_iql_speed[reduce-overhead-None] | 11.9125ms | 10.6126ms | 94.2277 Ops/s | 96.0751 Ops/s | $\color{#d91a1a}-1.92\%$ |
| test_iql_speed[reduce-overhead-backward] | 22.6923ms | 21.6502ms | 46.1889 Ops/s | 45.7583 Ops/s | $\color{#35bf28}+0.94\%$ |
| test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 7.2540ms | 4.7664ms | 209.8030 Ops/s | 211.8363 Ops/s | $\color{#d91a1a}-0.96\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 2.7707ms | 0.4779ms | 2.0924 KOps/s | 2.1005 KOps/s | $\color{#d91a1a}-0.38\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6896ms | 0.4539ms | 2.2030 KOps/s | 2.2086 KOps/s | $\color{#d91a1a}-0.25\%$ |
| test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 7.0441ms | 4.6518ms | 214.9713 Ops/s | 217.8090 Ops/s | $\color{#d91a1a}-1.30\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.7696ms | 0.4709ms | 2.1237 KOps/s | 2.1430 KOps/s | $\color{#d91a1a}-0.90\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6836ms | 0.4488ms | 2.2284 KOps/s | 2.2348 KOps/s | $\color{#d91a1a}-0.29\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.2958ms | 1.5863ms | 630.3905 Ops/s | 635.0875 Ops/s | $\color{#d91a1a}-0.74\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.1736ms | 1.5356ms | 651.1988 Ops/s | 653.9704 Ops/s | $\color{#d91a1a}-0.42\%$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.9988ms | 4.7992ms | 208.3681 Ops/s | 207.0425 Ops/s | $\color{#35bf28}+0.64\%$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.1636ms | 0.6146ms | 1.6269 KOps/s | 1.6207 KOps/s | $\color{#35bf28}+0.39\%$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9420ms | 0.5881ms | 1.7003 KOps/s | 1.6986 KOps/s | $\color{#35bf28}+0.10\%$ |
| test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.9042ms | 4.6897ms | 213.2328 Ops/s | 209.7563 Ops/s | $\color{#35bf28}+1.66\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.6558ms | 0.4761ms | 2.1005 KOps/s | 2.0537 KOps/s | $\color{#35bf28}+2.28\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 7.3810ms | 0.4655ms | 2.1482 KOps/s | 2.1591 KOps/s | $\color{#d91a1a}-0.51\%$ |
| test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.0367ms | 4.6395ms | 215.5412 Ops/s | 213.5465 Ops/s | $\color{#35bf28}+0.93\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.1719ms | 0.4757ms | 2.1021 KOps/s | 2.1348 KOps/s | $\color{#d91a1a}-1.53\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6585ms | 0.4522ms | 2.2114 KOps/s | 2.1767 KOps/s | $\color{#35bf28}+1.59\%$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.1134ms | 4.8084ms | 207.9706 Ops/s | 208.5532 Ops/s | $\color{#d91a1a}-0.28\%$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.5836ms | 0.6142ms | 1.6281 KOps/s | 1.6482 KOps/s | $\color{#d91a1a}-1.22\%$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7979ms | 0.5886ms | 1.6990 KOps/s | 1.7048 KOps/s | $\color{#d91a1a}-0.34\%$ |
| test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 8.4490ms | 4.2677ms | 234.3190 Ops/s | 250.4588 Ops/s | $\textbf{\color{#d91a1a}-6.44\%}$ |
| test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 8.0693ms | 2.2876ms | 437.1366 Ops/s | 461.8849 Ops/s | $\textbf{\color{#d91a1a}-5.36\%}$ |
| test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 3.8540ms | 1.2292ms | 813.5612 Ops/s | 752.3575 Ops/s | $\textbf{\color{#35bf28}+8.13\%}$ |
| test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.3638s | 11.4093ms | 87.6474 Ops/s | 37.6370 Ops/s | $\textbf{\color{#35bf28}+132.88\%}$ |
| test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 4.7984ms | 2.2526ms | 443.9331 Ops/s | 423.8300 Ops/s | $\color{#35bf28}+4.74\%$ |
| test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 7.3742ms | 1.3047ms | 766.4444 Ops/s | 713.0531 Ops/s | $\textbf{\color{#35bf28}+7.49\%}$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 5.9296ms | 4.3854ms | 228.0310 Ops/s | 212.0353 Ops/s | $\textbf{\color{#35bf28}+7.54\%}$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 6.0253ms | 2.4186ms | 413.4634 Ops/s | 416.5488 Ops/s | $\color{#d91a1a}-0.74\%$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 6.6227ms | 1.4794ms | 675.9696 Ops/s | 715.9084 Ops/s | $\textbf{\color{#d91a1a}-5.58\%}$ |
$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests
Total Benchmarks: 143. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}7$.
Expand to view detailed results
| Name | Max | Mean | Ops | Ops on Repo HEAD |
Change |
|---|---|---|---|---|---|
| test_simple | 0.7581s | 0.7491s | 1.3350 Ops/s | 1.3560 Ops/s | $\color{#d91a1a}-1.55\%$ |
| test_transformed | 1.0771s | 0.9994s | 1.0006 Ops/s | 1.0222 Ops/s | $\color{#d91a1a}-2.11\%$ |
| test_serial | 2.2420s | 2.1618s | 0.4626 Ops/s | 0.4649 Ops/s | $\color{#d91a1a}-0.50\%$ |
| test_parallel | 2.0658s | 1.9880s | 0.5030 Ops/s | 0.4999 Ops/s | $\color{#35bf28}+0.63\%$ |
| test_step_mdp_speed[True-True-True-True-True] | 0.1816ms | 38.6488μs | 25.8740 KOps/s | 25.8450 KOps/s | $\color{#35bf28}+0.11\%$ |
| test_step_mdp_speed[True-True-True-True-False] | 51.3410μs | 23.1125μs | 43.2666 KOps/s | 43.6861 KOps/s | $\color{#d91a1a}-0.96\%$ |
| test_step_mdp_speed[True-True-True-False-True] | 61.5010μs | 20.5985μs | 48.5473 KOps/s | 48.8137 KOps/s | $\color{#d91a1a}-0.55\%$ |
| test_step_mdp_speed[True-True-True-False-False] | 53.6910μs | 12.2641μs | 81.5390 KOps/s | 81.7009 KOps/s | $\color{#d91a1a}-0.20\%$ |
| test_step_mdp_speed[True-True-False-True-True] | 74.1020μs | 41.2427μs | 24.2467 KOps/s | 24.1384 KOps/s | $\color{#35bf28}+0.45\%$ |
| test_step_mdp_speed[True-True-False-True-False] | 55.5710μs | 25.2523μs | 39.6004 KOps/s | 39.3410 KOps/s | $\color{#35bf28}+0.66\%$ |
| test_step_mdp_speed[True-True-False-False-True] | 66.2920μs | 24.0214μs | 41.6296 KOps/s | 42.7470 KOps/s | $\color{#d91a1a}-2.61\%$ |
| test_step_mdp_speed[True-True-False-False-False] | 39.4110μs | 15.0623μs | 66.3910 KOps/s | 66.8289 KOps/s | $\color{#d91a1a}-0.66\%$ |
| test_step_mdp_speed[True-False-True-True-True] | 79.9510μs | 44.5240μs | 22.4598 KOps/s | 22.2990 KOps/s | $\color{#35bf28}+0.72\%$ |
| test_step_mdp_speed[True-False-True-True-False] | 60.6510μs | 28.2193μs | 35.4368 KOps/s | 35.3574 KOps/s | $\color{#35bf28}+0.22\%$ |
| test_step_mdp_speed[True-False-True-False-True] | 52.9010μs | 23.6491μs | 42.2849 KOps/s | 42.7734 KOps/s | $\color{#d91a1a}-1.14\%$ |
| test_step_mdp_speed[True-False-True-False-False] | 38.9510μs | 15.0504μs | 66.4433 KOps/s | 66.6902 KOps/s | $\color{#d91a1a}-0.37\%$ |
| test_step_mdp_speed[True-False-False-True-True] | 78.6010μs | 47.0955μs | 21.2334 KOps/s | 21.1599 KOps/s | $\color{#35bf28}+0.35\%$ |
| test_step_mdp_speed[True-False-False-True-False] | 56.6910μs | 30.8443μs | 32.4210 KOps/s | 32.4929 KOps/s | $\color{#d91a1a}-0.22\%$ |
| test_step_mdp_speed[True-False-False-False-True] | 54.4710μs | 25.9583μs | 38.5233 KOps/s | 37.7694 KOps/s | $\color{#35bf28}+2.00\%$ |
| test_step_mdp_speed[True-False-False-False-False] | 41.7810μs | 17.6685μs | 56.5979 KOps/s | 55.5332 KOps/s | $\color{#35bf28}+1.92\%$ |
| test_step_mdp_speed[False-True-True-True-True] | 74.1620μs | 44.8922μs | 22.2756 KOps/s | 22.1920 KOps/s | $\color{#35bf28}+0.38\%$ |
| test_step_mdp_speed[False-True-True-True-False] | 55.4010μs | 28.3619μs | 35.2585 KOps/s | 34.9611 KOps/s | $\color{#35bf28}+0.85\%$ |
| test_step_mdp_speed[False-True-True-False-True] | 73.1310μs | 28.5129μs | 35.0718 KOps/s | 34.5574 KOps/s | $\color{#35bf28}+1.49\%$ |
| test_step_mdp_speed[False-True-True-False-False] | 44.3000μs | 17.3965μs | 57.4829 KOps/s | 56.3942 KOps/s | $\color{#35bf28}+1.93\%$ |
| test_step_mdp_speed[False-True-False-True-True] | 84.9320μs | 46.7347μs | 21.3974 KOps/s | 21.1213 KOps/s | $\color{#35bf28}+1.31\%$ |
| test_step_mdp_speed[False-True-False-True-False] | 61.6210μs | 31.3208μs | 31.9277 KOps/s | 32.9057 KOps/s | $\color{#d91a1a}-2.97\%$ |
| test_step_mdp_speed[False-True-False-False-True] | 3.1534ms | 31.9554μs | 31.2936 KOps/s | 31.3926 KOps/s | $\color{#d91a1a}-0.32\%$ |
| test_step_mdp_speed[False-True-False-False-False] | 52.2710μs | 20.6018μs | 48.5393 KOps/s | 48.8994 KOps/s | $\color{#d91a1a}-0.74\%$ |
| test_step_mdp_speed[False-False-True-True-True] | 78.4010μs | 50.1058μs | 19.9578 KOps/s | 20.0903 KOps/s | $\color{#d91a1a}-0.66\%$ |
| test_step_mdp_speed[False-False-True-True-False] | 63.9810μs | 34.0419μs | 29.3756 KOps/s | 30.0306 KOps/s | $\color{#d91a1a}-2.18\%$ |
| test_step_mdp_speed[False-False-True-False-True] | 65.6020μs | 30.5990μs | 32.6808 KOps/s | 32.1186 KOps/s | $\color{#35bf28}+1.75\%$ |
| test_step_mdp_speed[False-False-True-False-False] | 46.4410μs | 20.1711μs | 49.5759 KOps/s | 50.2719 KOps/s | $\color{#d91a1a}-1.38\%$ |
| test_step_mdp_speed[False-False-False-True-True] | 82.9120μs | 51.8316μs | 19.2932 KOps/s | 19.2553 KOps/s | $\color{#35bf28}+0.20\%$ |
| test_step_mdp_speed[False-False-False-True-False] | 64.8310μs | 36.4992μs | 27.3979 KOps/s | 28.0060 KOps/s | $\color{#d91a1a}-2.17\%$ |
| test_step_mdp_speed[False-False-False-False-True] | 62.6620μs | 34.1750μs | 29.2611 KOps/s | 29.7389 KOps/s | $\color{#d91a1a}-1.61\%$ |
| test_step_mdp_speed[False-False-False-False-False] | 56.3910μs | 23.1392μs | 43.2166 KOps/s | 44.8311 KOps/s | $\color{#d91a1a}-3.60\%$ |
| test_values[generalized_advantage_estimate-True-True] | 25.3762ms | 25.0450ms | 39.9281 Ops/s | 40.6874 Ops/s | $\color{#d91a1a}-1.87\%$ |
| test_values[vec_generalized_advantage_estimate-True-True] | 99.0969ms | 2.8763ms | 347.6733 Ops/s | 313.1596 Ops/s | $\textbf{\color{#35bf28}+11.02\%}$ |
| test_values[td0_return_estimate-False-False] | 86.7210μs | 66.4906μs | 15.0397 KOps/s | 15.2072 KOps/s | $\color{#d91a1a}-1.10\%$ |
| test_values[td1_return_estimate-False-False] | 56.0590ms | 55.7568ms | 17.9350 Ops/s | 18.2293 Ops/s | $\color{#d91a1a}-1.61\%$ |
| test_values[vec_td1_return_estimate-False-False] | 1.2939ms | 1.0787ms | 927.0087 Ops/s | 932.9208 Ops/s | $\color{#d91a1a}-0.63\%$ |
| test_values[td_lambda_return_estimate-True-False] | 90.2283ms | 88.7109ms | 11.2726 Ops/s | 11.5208 Ops/s | $\color{#d91a1a}-2.15\%$ |
| test_values[vec_td_lambda_return_estimate-True-False] | 1.2681ms | 1.0758ms | 929.5457 Ops/s | 935.5877 Ops/s | $\color{#d91a1a}-0.65\%$ |
| test_gae_speed[generalized_advantage_estimate-False-1-512] | 24.8769ms | 24.6868ms | 40.5075 Ops/s | 40.9577 Ops/s | $\color{#d91a1a}-1.10\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 1.0345ms | 0.7616ms | 1.3130 KOps/s | 1.3441 KOps/s | $\color{#d91a1a}-2.31\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7828ms | 0.6672ms | 1.4987 KOps/s | 1.5092 KOps/s | $\color{#d91a1a}-0.69\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5217ms | 1.4785ms | 676.3387 Ops/s | 680.4539 Ops/s | $\color{#d91a1a}-0.60\%$ |
| test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.7197ms | 0.6823ms | 1.4656 KOps/s | 1.4743 KOps/s | $\color{#d91a1a}-0.59\%$ |
| test_dqn_speed[False-None] | 6.5919ms | 1.3304ms | 751.6428 Ops/s | 735.7617 Ops/s | $\color{#35bf28}+2.16\%$ |
| test_dqn_speed[False-backward] | 1.9413ms | 1.8620ms | 537.0554 Ops/s | 533.7860 Ops/s | $\color{#35bf28}+0.61\%$ |
| test_dqn_speed[True-None] | 0.9453ms | 0.5602ms | 1.7851 KOps/s | 1.7259 KOps/s | $\color{#35bf28}+3.43\%$ |
| test_dqn_speed[True-backward] | 1.0413ms | 1.0063ms | 993.7570 Ops/s | 967.4076 Ops/s | $\color{#35bf28}+2.72\%$ |
| test_dqn_speed[reduce-overhead-None] | 0.6853ms | 0.5757ms | 1.7369 KOps/s | 1.7390 KOps/s | $\color{#d91a1a}-0.12\%$ |
| test_dqn_speed[reduce-overhead-backward] | 1.0468ms | 1.0092ms | 990.9143 Ops/s | 981.5254 Ops/s | $\color{#35bf28}+0.96\%$ |
| test_ddpg_speed[False-None] | 3.0524ms | 2.6849ms | 372.4566 Ops/s | 365.0872 Ops/s | $\color{#35bf28}+2.02\%$ |
| test_ddpg_speed[False-backward] | 4.2864ms | 3.9538ms | 252.9217 Ops/s | 253.0487 Ops/s | $\color{#d91a1a}-0.05\%$ |
| test_ddpg_speed[True-None] | 1.4910ms | 1.2444ms | 803.5821 Ops/s | 745.3896 Ops/s | $\textbf{\color{#35bf28}+7.81\%}$ |
| test_ddpg_speed[True-backward] | 2.2972ms | 2.2302ms | 448.3934 Ops/s | 366.4170 Ops/s | $\textbf{\color{#35bf28}+22.37\%}$ |
| test_ddpg_speed[reduce-overhead-None] | 1.5070ms | 1.2547ms | 797.0331 Ops/s | 777.8032 Ops/s | $\color{#35bf28}+2.47\%$ |
| test_ddpg_speed[reduce-overhead-backward] | 2.3240ms | 2.2585ms | 442.7622 Ops/s | 451.4353 Ops/s | $\color{#d91a1a}-1.92\%$ |
| test_sac_speed[False-None] | 7.9557ms | 7.6271ms | 131.1112 Ops/s | 128.1762 Ops/s | $\color{#35bf28}+2.29\%$ |
| test_sac_speed[False-backward] | 11.2461ms | 10.8244ms | 92.3837 Ops/s | 91.1922 Ops/s | $\color{#35bf28}+1.31\%$ |
| test_sac_speed[True-None] | 2.4064ms | 2.0309ms | 492.3974 Ops/s | 488.8467 Ops/s | $\color{#35bf28}+0.73\%$ |
| test_sac_speed[True-backward] | 4.0609ms | 3.9580ms | 252.6529 Ops/s | 231.6162 Ops/s | $\textbf{\color{#35bf28}+9.08\%}$ |
| test_sac_speed[reduce-overhead-None] | 2.6054ms | 2.0647ms | 484.3313 Ops/s | 490.3121 Ops/s | $\color{#d91a1a}-1.22\%$ |
| test_sac_speed[reduce-overhead-backward] | 4.2222ms | 3.9584ms | 252.6262 Ops/s | 253.1817 Ops/s | $\color{#d91a1a}-0.22\%$ |
| test_redq_speed[False-None] | 11.5962ms | 9.9603ms | 100.3987 Ops/s | 95.8881 Ops/s | $\color{#35bf28}+4.70\%$ |
| test_redq_speed[False-backward] | 18.3133ms | 17.2029ms | 58.1296 Ops/s | 55.9483 Ops/s | $\color{#35bf28}+3.90\%$ |
| test_redq_speed[True-None] | 3.8186ms | 3.5593ms | 280.9568 Ops/s | 276.5976 Ops/s | $\color{#35bf28}+1.58\%$ |
| test_redq_speed[True-backward] | 8.9828ms | 8.5258ms | 117.2908 Ops/s | 107.8923 Ops/s | $\textbf{\color{#35bf28}+8.71\%}$ |
| test_redq_speed[reduce-overhead-None] | 3.7241ms | 3.4963ms | 286.0180 Ops/s | 279.3345 Ops/s | $\color{#35bf28}+2.39\%$ |
| test_redq_speed[reduce-overhead-backward] | 8.8998ms | 8.4554ms | 118.2676 Ops/s | 116.5038 Ops/s | $\color{#35bf28}+1.51\%$ |
| test_redq_deprec_speed[False-None] | 10.9746ms | 10.5430ms | 94.8492 Ops/s | 93.2207 Ops/s | $\color{#35bf28}+1.75\%$ |
| test_redq_deprec_speed[False-backward] | 15.6918ms | 15.2886ms | 65.4082 Ops/s | 64.7408 Ops/s | $\color{#35bf28}+1.03\%$ |
| test_redq_deprec_speed[True-None] | 3.3768ms | 3.1840ms | 314.0697 Ops/s | 305.6809 Ops/s | $\color{#35bf28}+2.74\%$ |
| test_redq_deprec_speed[True-backward] | 7.2117ms | 7.0286ms | 142.2758 Ops/s | 140.0384 Ops/s | $\color{#35bf28}+1.60\%$ |
| test_redq_deprec_speed[reduce-overhead-None] | 3.3360ms | 3.1733ms | 315.1301 Ops/s | 304.3459 Ops/s | $\color{#35bf28}+3.54\%$ |
| test_redq_deprec_speed[reduce-overhead-backward] | 7.2602ms | 7.0589ms | 141.6644 Ops/s | 139.1514 Ops/s | $\color{#35bf28}+1.81\%$ |
| test_td3_speed[False-None] | 7.9238ms | 7.5721ms | 132.0637 Ops/s | 130.0925 Ops/s | $\color{#35bf28}+1.52\%$ |
| test_td3_speed[False-backward] | 10.6332ms | 10.3538ms | 96.5826 Ops/s | 94.9284 Ops/s | $\color{#35bf28}+1.74\%$ |
| test_td3_speed[True-None] | 1.9350ms | 1.9078ms | 524.1770 Ops/s | 523.1381 Ops/s | $\color{#35bf28}+0.20\%$ |
| test_td3_speed[True-backward] | 3.8169ms | 3.6927ms | 270.8064 Ops/s | 263.8615 Ops/s | $\color{#35bf28}+2.63\%$ |
| test_td3_speed[reduce-overhead-None] | 1.9339ms | 1.8975ms | 527.0005 Ops/s | 521.4170 Ops/s | $\color{#35bf28}+1.07\%$ |
| test_td3_speed[reduce-overhead-backward] | 3.7798ms | 3.6735ms | 272.2172 Ops/s | 269.0432 Ops/s | $\color{#35bf28}+1.18\%$ |
| test_cql_speed[False-None] | 29.3625ms | 25.3645ms | 39.4251 Ops/s | 39.7153 Ops/s | $\color{#d91a1a}-0.73\%$ |
| test_cql_speed[False-backward] | 39.0133ms | 34.8534ms | 28.6916 Ops/s | 29.2528 Ops/s | $\color{#d91a1a}-1.92\%$ |
| test_cql_speed[True-None] | 11.2065ms | 10.8208ms | 92.4147 Ops/s | 92.5663 Ops/s | $\color{#d91a1a}-0.16\%$ |
| test_cql_speed[True-backward] | 16.8731ms | 16.6028ms | 60.2309 Ops/s | 60.7399 Ops/s | $\color{#d91a1a}-0.84\%$ |
| test_cql_speed[reduce-overhead-None] | 11.1487ms | 10.8843ms | 91.8753 Ops/s | 92.2892 Ops/s | $\color{#d91a1a}-0.45\%$ |
| test_cql_speed[reduce-overhead-backward] | 16.9206ms | 16.5642ms | 60.3712 Ops/s | 61.0470 Ops/s | $\color{#d91a1a}-1.11\%$ |
| test_a2c_speed[False-None] | 5.6113ms | 5.3561ms | 186.7033 Ops/s | 185.5709 Ops/s | $\color{#35bf28}+0.61\%$ |
| test_a2c_speed[False-backward] | 11.9419ms | 11.6439ms | 85.8821 Ops/s | 84.5695 Ops/s | $\color{#35bf28}+1.55\%$ |
| test_a2c_speed[True-None] | 3.3196ms | 3.0191ms | 331.2294 Ops/s | 331.3734 Ops/s | $\color{#d91a1a}-0.04\%$ |
| test_a2c_speed[True-backward] | 8.8132ms | 8.5316ms | 117.2119 Ops/s | 108.1328 Ops/s | $\textbf{\color{#35bf28}+8.40\%}$ |
| test_a2c_speed[reduce-overhead-None] | 3.1806ms | 3.0286ms | 330.1853 Ops/s | 327.1372 Ops/s | $\color{#35bf28}+0.93\%$ |
| test_a2c_speed[reduce-overhead-backward] | 8.7657ms | 8.4991ms | 117.6601 Ops/s | 118.7880 Ops/s | $\color{#d91a1a}-0.95\%$ |
| test_ppo_speed[False-None] | 5.8196ms | 5.5395ms | 180.5221 Ops/s | 174.2414 Ops/s | $\color{#35bf28}+3.60\%$ |
| test_ppo_speed[False-backward] | 12.3878ms | 12.0459ms | 83.0157 Ops/s | 81.2422 Ops/s | $\color{#35bf28}+2.18\%$ |
| test_ppo_speed[True-None] | 3.6209ms | 3.4550ms | 289.4321 Ops/s | 285.7002 Ops/s | $\color{#35bf28}+1.31\%$ |
| test_ppo_speed[True-backward] | 8.3636ms | 8.1667ms | 122.4484 Ops/s | 122.2084 Ops/s | $\color{#35bf28}+0.20\%$ |
| test_ppo_speed[reduce-overhead-None] | 3.6146ms | 3.4481ms | 290.0151 Ops/s | 286.1752 Ops/s | $\color{#35bf28}+1.34\%$ |
| test_ppo_speed[reduce-overhead-backward] | 8.4948ms | 8.2426ms | 121.3213 Ops/s | 121.1171 Ops/s | $\color{#35bf28}+0.17\%$ |
| test_reinforce_speed[False-None] | 4.9882ms | 4.4380ms | 225.3288 Ops/s | 220.3451 Ops/s | $\color{#35bf28}+2.26\%$ |
| test_reinforce_speed[False-backward] | 7.8178ms | 7.2839ms | 137.2891 Ops/s | 138.1718 Ops/s | $\color{#d91a1a}-0.64\%$ |
| test_reinforce_speed[True-None] | 2.3515ms | 2.2153ms | 451.4091 Ops/s | 448.9679 Ops/s | $\color{#35bf28}+0.54\%$ |
| test_reinforce_speed[True-backward] | 7.4281ms | 7.0727ms | 141.3883 Ops/s | 118.1327 Ops/s | $\textbf{\color{#35bf28}+19.69\%}$ |
| test_reinforce_speed[reduce-overhead-None] | 2.4498ms | 2.2278ms | 448.8765 Ops/s | 445.3594 Ops/s | $\color{#35bf28}+0.79\%$ |
| test_reinforce_speed[reduce-overhead-backward] | 7.4984ms | 7.1333ms | 140.1884 Ops/s | 141.7125 Ops/s | $\color{#d91a1a}-1.08\%$ |
| test_iql_speed[False-None] | 20.8861ms | 19.4845ms | 51.3228 Ops/s | 50.4789 Ops/s | $\color{#35bf28}+1.67\%$ |
| test_iql_speed[False-backward] | 30.5193ms | 29.7255ms | 33.6411 Ops/s | 33.3283 Ops/s | $\color{#35bf28}+0.94\%$ |
| test_iql_speed[True-None] | 7.1689ms | 6.7265ms | 148.6649 Ops/s | 148.4977 Ops/s | $\color{#35bf28}+0.11\%$ |
| test_iql_speed[True-backward] | 15.9689ms | 15.3784ms | 65.0262 Ops/s | 63.7760 Ops/s | $\color{#35bf28}+1.96\%$ |
| test_iql_speed[reduce-overhead-None] | 7.0541ms | 6.7490ms | 148.1710 Ops/s | 149.5718 Ops/s | $\color{#d91a1a}-0.94\%$ |
| test_iql_speed[reduce-overhead-backward] | 15.7876ms | 15.2892ms | 65.4057 Ops/s | 64.9228 Ops/s | $\color{#35bf28}+0.74\%$ |
| test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.4348ms | 6.2151ms | 160.8983 Ops/s | 159.6188 Ops/s | $\color{#35bf28}+0.80\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.2952s | 0.4484ms | 2.2300 KOps/s | 2.8197 KOps/s | $\textbf{\color{#d91a1a}-20.91\%}$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.4203ms | 0.2197ms | 4.5507 KOps/s | 3.4479 KOps/s | $\textbf{\color{#35bf28}+31.98\%}$ |
| test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.4889ms | 6.2318ms | 160.4674 Ops/s | 163.5400 Ops/s | $\color{#d91a1a}-1.88\%$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.0946ms | 0.2775ms | 3.6040 KOps/s | 3.0663 KOps/s | $\textbf{\color{#35bf28}+17.53\%}$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.5091ms | 0.2525ms | 3.9610 KOps/s | 3.2433 KOps/s | $\textbf{\color{#35bf28}+22.13\%}$ |
| test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.4525ms | 1.2372ms | 808.3069 Ops/s | 716.7932 Ops/s | $\textbf{\color{#35bf28}+12.77\%}$ |
| test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.5577ms | 1.2535ms | 797.7779 Ops/s | 738.5569 Ops/s | $\textbf{\color{#35bf28}+8.02\%}$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.5060ms | 6.4005ms | 156.2373 Ops/s | 159.6125 Ops/s | $\color{#d91a1a}-2.11\%$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.9189ms | 0.4260ms | 2.3472 KOps/s | 2.1338 KOps/s | $\textbf{\color{#35bf28}+10.00\%}$ |
| test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.6481ms | 0.4002ms | 2.4990 KOps/s | 2.2122 KOps/s | $\textbf{\color{#35bf28}+12.97\%}$ |
| test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.3648ms | 6.2639ms | 159.6449 Ops/s | 163.5341 Ops/s | $\color{#d91a1a}-2.38\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.9291ms | 0.3409ms | 2.9336 KOps/s | 4.1246 KOps/s | $\textbf{\color{#d91a1a}-28.88\%}$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.5639ms | 0.2934ms | 3.4077 KOps/s | 4.5859 KOps/s | $\textbf{\color{#d91a1a}-25.69\%}$ |
| test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 9.8467ms | 6.2093ms | 161.0479 Ops/s | 162.5700 Ops/s | $\color{#d91a1a}-0.94\%$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.8068ms | 0.2716ms | 3.6813 KOps/s | 4.2029 KOps/s | $\textbf{\color{#d91a1a}-12.41\%}$ |
| test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.4573ms | 0.2156ms | 4.6390 KOps/s | 4.6432 KOps/s | $\color{#d91a1a}-0.09\%$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.8102ms | 6.4341ms | 155.4227 Ops/s | 154.2962 Ops/s | $\color{#35bf28}+0.73\%$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.4025ms | 0.4144ms | 2.4129 KOps/s | 2.5751 KOps/s | $\textbf{\color{#d91a1a}-6.30\%}$ |
| test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7857ms | 0.3954ms | 2.5289 KOps/s | 2.7201 KOps/s | $\textbf{\color{#d91a1a}-7.03\%}$ |
| test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 6.9799ms | 5.3088ms | 188.3671 Ops/s | 184.2846 Ops/s | $\color{#35bf28}+2.22\%$ |
| test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 7.2499ms | 1.9913ms | 502.1869 Ops/s | 453.1663 Ops/s | $\textbf{\color{#35bf28}+10.82\%}$ |
| test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 8.4390ms | 1.2620ms | 792.4026 Ops/s | 799.8761 Ops/s | $\color{#d91a1a}-0.93\%$ |
| test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.4078s | 13.4513ms | 74.3422 Ops/s | 180.2077 Ops/s | $\textbf{\color{#d91a1a}-58.75\%}$ |
| test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 9.5270ms | 2.0706ms | 482.9571 Ops/s | 479.7881 Ops/s | $\color{#35bf28}+0.66\%$ |
| test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 7.0976ms | 1.1989ms | 834.0707 Ops/s | 816.7118 Ops/s | $\color{#35bf28}+2.13\%$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 7.2180ms | 5.5333ms | 180.7235 Ops/s | 176.9021 Ops/s | $\color{#35bf28}+2.16\%$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 9.8762ms | 2.1755ms | 459.6738 Ops/s | 404.5056 Ops/s | $\textbf{\color{#35bf28}+13.64\%}$ |
| test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 6.9109ms | 1.3736ms | 727.9931 Ops/s | 730.5017 Ops/s | $\color{#d91a1a}-0.34\%$ |