cccl icon indicating copy to clipboard operation
cccl copied to clipboard

Implement `cuda::mul_overflow`

Open davebayer opened this issue 9 months ago • 8 comments

This PR introduces the initial cuda::mul_overflow implementation. With that also a cuda::__neg utility function is introduced and applied in cuda::uabs function.

Allowing type mixing of input/output parameters makes everything quite complicated..

davebayer avatar Apr 11 '25 13:04 davebayer

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

copy-pr-bot[bot] avatar Apr 11 '25 13:04 copy-pr-bot[bot]

/ok to test ce30a04

miscco avatar Apr 11 '25 14:04 miscco

/ok to test 3d47497

miscco avatar Apr 11 '25 15:04 miscco

/ok to test 3d47497

miscco avatar Apr 11 '25 15:04 miscco

🟨 CI finished in 6h 05m: Pass: 98%/170 | Total: 2d 09h | Avg: 20m 15s | Max: 1h 34m | Hits: 93%/264305
  • 🟨 libcudacxx: Pass: 95%/45 | Total: 12h 58m | Avg: 17m 18s | Max: 43m 10s | Hits: 87%/110597

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/43  | Total: 12h 26m | Avg: 17m 21s | Max: 43m 10s | Hits:  86%/104568
      🟩 arm64              Pass: 100%/2   | Total: 32m 16s | Avg: 16m 08s | Max: 16m 17s | Hits:  97%/6029  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 00s | Avg: 24m 00s | Max: 27m 07s | Hits:  27%/5989  
      🔍 nvcc               Pass:  95%/43  | Total: 12h 10m | Avg: 16m 59s | Max: 43m 10s | Hits:  90%/104608
    🚨 cxx: MSVC14.29 🚨
      🟩 Clang14            Pass: 100%/4   | Total:  1h 08m | Avg: 17m 08s | Max: 24m 02s | Hits:  79%/11946 
      🟩 Clang15            Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 16m 24s | Hits:  98%/5985  
      🟩 Clang16            Pass: 100%/2   | Total: 34m 05s | Avg: 17m 02s | Max: 17m 37s | Hits:  95%/5985  
      🟩 Clang17            Pass: 100%/2   | Total: 32m 21s | Avg: 16m 10s | Max: 16m 30s | Hits:  98%/5985  
      🟩 Clang18            Pass: 100%/2   | Total: 53m 22s | Avg: 26m 41s | Max: 32m 20s | Hits:  96%/5985  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 54m | Avg: 19m 02s | Max: 27m 07s | Hits:  68%/14988 
      🟩 GCC7               Pass: 100%/2   | Total: 17m 23s | Avg:  8m 41s | Max: 11m 10s | Hits:  85%/5921  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s | Hits:  98%/2971  
      🟩 GCC9               Pass: 100%/2   | Total: 27m 32s | Avg: 13m 46s | Max: 15m 23s | Hits:  98%/5933  
      🟩 GCC10              Pass: 100%/2   | Total: 31m 54s | Avg: 15m 57s | Max: 16m 14s | Hits:  97%/5991  
      🟩 GCC11              Pass: 100%/2   | Total: 32m 21s | Avg: 16m 10s | Max: 16m 56s | Hits:  97%/5987  
      🟩 GCC12              Pass: 100%/2   | Total: 34m 02s | Avg: 17m 01s | Max: 18m 24s | Hits:  98%/5987  
      🟩 GCC13              Pass: 100%/10  | Total:  2h 02m | Avg: 12m 16s | Max: 17m 56s | Hits:  98%/15250 
      🔥 MSVC14.29          Pass:   0%/2   | Total: 45m 51s | Avg: 22m 55s | Max: 23m 28s
      🟩 MSVC14.42          Pass: 100%/2   | Total: 47m 59s | Avg: 23m 59s | Max: 24m 35s | Hits:  98%/5708  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 19m | Avg: 39m 33s | Max: 43m 10s | Hits:  29%/5975  
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/18  | Total:  5h 35m | Avg: 18m 38s | Max: 32m 20s | Hits:  84%/50874 
      🟩 GCC                Pass: 100%/21  | Total:  4h 30m | Avg: 12m 52s | Max: 18m 24s | Hits:  96%/48040 
      🔍 MSVC               Pass:  50%/4   | Total:  1h 33m | Avg: 23m 27s | Max: 24m 35s | Hits:  98%/5708  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 19m | Avg: 39m 33s | Max: 43m 10s | Hits:  29%/5975  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max: 12m 31s | Hits:  99%/3104  
      🔍 rtx2080            Pass:  95%/43  | Total: 12h 39m | Avg: 17m 40s | Max: 43m 10s | Hits:  87%/107493
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  94%/39  | Total: 11h 43m | Avg: 18m 02s | Max: 43m 10s | Hits:  87%/110557
      🟩 NVRTC              Pass: 100%/2   | Total: 33m 44s | Avg: 16m 52s | Max: 17m 56s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 39m 09s | Avg: 13m 03s | Max: 17m 15s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s
    🔍 std: 17 🔍
      🔍 17                 Pass:  90%/22  | Total:  6h 33m | Avg: 17m 52s | Max: 35m 57s | Hits:  85%/56283 
      🟩 20                 Pass: 100%/22  | Total:  6h 23m | Avg: 17m 25s | Max: 43m 10s | Hits:  89%/54314 
    🟨 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 48m 00s | Avg: 24m 00s | Max: 27m 07s | Hits:  27%/5989  
      🟨 nvcc12.0           Pass:  80%/5   | Total:  1h 22m | Avg: 16m 28s | Max: 24m 02s | Hits:  76%/11878 
      🟨 nvcc12.8           Pass:  97%/38  | Total: 10h 48m | Avg: 17m 03s | Max: 43m 10s | Hits:  92%/92730 
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  1h 22m | Avg: 16m 28s | Max: 24m 02s | Hits:  76%/11878 
      🟨 12.8               Pass:  97%/40  | Total: 11h 36m | Avg: 17m 24s | Max: 43m 10s | Hits:  88%/98719 
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 33m 44s | Avg: 16m 52s | Max: 17m 56s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max: 12m 31s | Hits:  99%/3104  
      🟩 90;90a;100         Pass: 100%/1   | Total: 10m 38s | Avg: 10m 38s | Max: 10m 38s | Hits:  97%/3104  
    
  • 🟩 cub: Pass: 100%/47 | Total: 1d 06h | Avg: 38m 43s | Max: 1h 02m | Hits: 96%/56545

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 04h | Avg: 38m 27s | Max:  1h 02m | Hits:  96%/54087 
      🟩 arm64              Pass: 100%/2   | Total:  1h 28m | Avg: 44m 25s | Max: 45m 04s | Hits:  99%/2458  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 19m | Avg: 39m 52s | Max: 40m 39s | Hits:  96%/5974  
      🟩 12.8               Pass: 100%/42  | Total:  1d 03h | Avg: 38m 34s | Max:  1h 02m | Hits:  96%/50571 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 33m | Avg: 46m 41s | Max: 47m 03s | Hits:  99%/2120  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 19m | Avg: 39m 52s | Max: 40m 39s | Hits:  96%/5974  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 01h | Avg: 38m 10s | Max:  1h 02m | Hits:  96%/48451 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 33m | Avg: 46m 41s | Max: 47m 03s | Hits:  99%/2120  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 04h | Avg: 38m 21s | Max:  1h 02m | Hits:  96%/54425 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 38m | Avg: 39m 32s | Max: 41m 12s | Hits:  99%/4924  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 17m | Avg: 38m 47s | Max: 39m 18s | Hits:  98%/2458  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 14m | Avg: 37m 26s | Max: 38m 26s | Hits:  97%/2458  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 18m | Avg: 39m 17s | Max: 40m 19s | Hits:  95%/2458  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 13m | Avg: 36m 31s | Max: 37m 10s | Hits:  99%/2458  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 20m | Avg: 37m 10s | Max: 47m 03s | Hits:  99%/8265  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 52s | Max: 39m 17s | Hits:  98%/2462  
      🟩 GCC8               Pass: 100%/1   | Total: 36m 40s | Avg: 36m 40s | Max: 36m 40s | Hits:  98%/1231  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 44s | Max: 38m 51s | Hits:  95%/2462  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 33s | Max: 37m 53s | Hits:  96%/2462  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 17m | Avg: 38m 48s | Max: 39m 07s | Hits:  98%/2458  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 07s | Max: 40m 48s | Hits:  93%/2458  
      🟩 GCC13              Pass: 100%/11  | Total:  5h 58m | Avg: 32m 33s | Max:  1h 01m | Hits:  96%/13519 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 27s | Max:  1h 02m | Hits:  89%/2100  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 56s | Max: 57m 22s | Hits:  89%/2100  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 47s | Max: 53m 11s | Hits:  86%/2272  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 12h 02m | Avg: 38m 01s | Max: 47m 03s | Hits:  98%/23021 
      🟩 GCC                Pass: 100%/22  | Total: 13h 03m | Avg: 35m 35s | Max:  1h 01m | Hits:  96%/27052 
      🟩 MSVC               Pass: 100%/4   | Total:  3h 30m | Avg: 52m 42s | Max:  1h 02m | Hits:  89%/4200  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 43m | Avg: 51m 47s | Max: 53m 11s | Hits:  86%/2272  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 53m 58s | Avg: 17m 59s | Max: 26m 48s | Hits:  99%/3687  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 01h | Avg: 42m 28s | Max:  1h 02m | Hits:  95%/43026 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 56m | Avg: 29m 36s | Max: 39m 33s | Hits:  98%/9832  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 02h | Avg: 41m 17s | Max:  1h 02m | Hits:  95%/46713 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 31m 00s | Avg: 31m 00s | Max: 31m 00s | Hits:  99%/1229  
      🟩 GraphCapture       Pass: 100%/1   | Total: 23m 46s | Avg: 23m 46s | Max: 23m 46s | Hits:  99%/1229  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 22m | Avg: 27m 33s | Max: 30m 28s | Hits:  99%/3687  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 12m | Avg: 24m 00s | Max: 26m 51s | Hits:  99%/3687  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 53m 58s | Avg: 17m 59s | Max: 26m 48s | Hits:  99%/3687  
      🟩 90;90a;100         Pass: 100%/1   | Total: 45m 41s | Avg: 45m 41s | Max: 45m 41s | Hits:  97%/1229  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 14h 51m | Avg: 42m 27s | Max:  1h 02m | Hits:  94%/25026 
      🟩 20                 Pass: 100%/26  | Total: 15h 28m | Avg: 35m 42s | Max: 57m 22s | Hits:  97%/31519 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 9h 16m | Avg: 11m 50s | Max: 1h 01m | Hits: 96%/83463

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max: 11m 05s | Hits:  99%/3554  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  9h 06m | Avg: 12m 08s | Max:  1h 01m | Hits:  96%/79910 
      🟩 arm64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 11s | Hits:  99%/3553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 53m 16s | Avg: 10m 39s | Max: 33m 29s | Hits:  95%/8876  
      🟩 12.8               Pass: 100%/42  | Total:  8h 23m | Avg: 11m 59s | Max:  1h 01m | Hits:  97%/74587 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 13s | Hits: 100%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 16s | Avg: 10m 39s | Max: 33m 29s | Hits:  95%/8876  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  8h 13m | Avg: 12m 19s | Max:  1h 01m | Hits:  97%/71035 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 13s | Hits: 100%/3552  
      🟩 nvcc               Pass: 100%/45  | Total:  9h 06m | Avg: 12m 08s | Max:  1h 01m | Hits:  96%/79911 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 37s | Hits: 100%/7104  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 45s | Hits: 100%/3552  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 30s | Hits: 100%/3552  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 36s | Hits: 100%/3552  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 04s | Avg:  5m 32s | Max:  5m 32s | Hits: 100%/3552  
      🟩 Clang19            Pass: 100%/7   | Total: 44m 17s | Avg:  6m 19s | Max:  9m 42s | Hits: 100%/12432 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 26s | Hits:  99%/3554  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s | Hits:  99%/1777  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  6m 00s | Hits:  99%/3554  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  6m 20s | Hits:  99%/3554  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 13s | Avg:  8m 36s | Max: 11m 30s | Hits:  99%/3554  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  6m 25s | Hits:  99%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 18m | Avg:  7m 51s | Max: 12m 04s | Hits:  99%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 34m 54s | Hits:  82%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 55m | Avg: 38m 34s | Max: 45m 28s | Hits:  81%/5310  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 52s | Max:  1h 01m | Hits:  74%/3552  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 49m | Avg:  5m 46s | Max:  9m 42s | Hits: 100%/33744 
      🟩 GCC                Pass: 100%/21  | Total:  2h 27m | Avg:  7m 00s | Max: 12m 04s | Hits:  99%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 04m | Avg: 36m 49s | Max: 45m 28s | Hits:  81%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 52s | Max:  1h 01m | Hits:  74%/3552  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 52s | Avg:  8m 26s | Max: 12m 04s | Hits:  99%/3554  
      🟩 rtx2080            Pass: 100%/35  | Total:  6h 35m | Avg: 11m 17s | Max:  1h 01m | Hits:  96%/62156 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 24m | Avg: 14m 27s | Max: 45m 28s | Hits:  96%/17753 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  7h 44m | Avg: 11m 36s | Max:  1h 01m | Hits:  96%/71033 
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 07s | Avg: 16m 02s | Max: 31m 22s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 32s | Avg: 11m 08s | Max: 12m 04s | Hits:  99%/7107  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 52s | Avg:  8m 26s | Max: 12m 04s | Hits:  99%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 18s | Avg:  6m 18s | Max:  6m 18s | Hits:  99%/1777  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 28m | Avg: 12m 48s | Max:  1h 01m | Hits:  95%/37287 
      🟩 20                 Pass: 100%/24  | Total:  4h 29m | Avg: 11m 14s | Max: 54m 09s | Hits:  97%/42622 
    
  • 🟩 cudax: Pass: 100%/24 | Total: 2h 31m | Avg: 6m 18s | Max: 14m 34s | Hits: 99%/13372

    🟩 cpu
      🟩 amd64              Pass: 100%/20  | Total:  2h 19m | Avg:  6m 59s | Max: 14m 34s | Hits:  99%/11044 
      🟩 arm64              Pass: 100%/4   | Total: 11m 44s | Avg:  2m 56s | Max:  3m 05s | Hits:  99%/2328  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 42s | Avg: 11m 42s | Max: 11m 42s | Hits:  95%/284   
      🟩 12.8               Pass: 100%/23  | Total:  2h 19m | Avg:  6m 04s | Max: 14m 34s | Hits:  99%/13088 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 42s | Avg: 11m 42s | Max: 11m 42s | Hits:  95%/284   
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 19m | Avg:  6m 04s | Max: 14m 34s | Hits:  99%/13088 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/24  | Total:  2h 31m | Avg:  6m 18s | Max: 14m 34s | Hits:  99%/13372 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits: 100%/584   
      🟩 Clang15            Pass: 100%/1   | Total: 11m 43s | Avg: 11m 43s | Max: 11m 43s | Hits: 100%/582   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s | Hits: 100%/582   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s | Hits: 100%/582   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits: 100%/582   
      🟩 Clang19            Pass: 100%/4   | Total: 27m 03s | Avg:  6m 45s | Max: 13m 39s | Hits: 100%/2328  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s | Hits:  99%/584   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s | Hits:  99%/582   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s | Hits:  99%/582   
      🟩 GCC13              Pass: 100%/8   | Total: 46m 28s | Avg:  5m 48s | Max: 14m 34s | Hits:  99%/4656  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 42s | Avg: 11m 42s | Max: 11m 42s | Hits:  95%/284   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 57s | Avg: 11m 57s | Max: 11m 57s | Hits:  95%/284   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 17m 38s | Avg:  8m 49s | Max:  8m 53s | Hits:  97%/1160  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/9   | Total: 52m 34s | Avg:  5m 50s | Max: 13m 39s | Hits: 100%/5240  
      🟩 GCC                Pass: 100%/11  | Total: 57m 35s | Avg:  5m 14s | Max: 14m 34s | Hits:  99%/6404  
      🟩 MSVC               Pass: 100%/2   | Total: 23m 39s | Avg: 11m 49s | Max: 11m 57s | Hits:  95%/568   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 38s | Avg:  8m 49s | Max:  8m 53s | Hits:  97%/1160  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max: 14m 34s | Hits:  99%/1164  
      🟩 rtx2080            Pass: 100%/22  | Total:  2h 13m | Avg:  6m 04s | Max: 13m 39s | Hits:  99%/12208 
    🟩 jobs
      🟩 Build              Pass: 100%/21  | Total:  1h 50m | Avg:  5m 16s | Max: 11m 57s | Hits:  99%/11626 
      🟩 Test               Pass: 100%/3   | Total: 40m 34s | Avg: 13m 31s | Max: 14m 34s | Hits:  99%/1746  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 21m 05s | Avg:  7m 01s | Max: 14m 34s | Hits:  99%/1746  
      🟩 90a                Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s | Hits:  99%/582   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 00s | Avg:  4m 30s | Max:  8m 45s | Hits:  99%/2326  
      🟩 20                 Pass: 100%/20  | Total:  2h 13m | Avg:  6m 40s | Max: 14m 34s | Hits:  99%/11046 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 20m 05s | Avg: 5m 01s | Max: 5m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 26s | Avg:  5m 43s | Max:  5m 52s
      🟩 arm64              Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 44s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  5m 52s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 52s
      🟩 20                 Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  5m 34s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 21m 34s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/164   
      🟩 Test               Pass: 100%/1   | Total: 21m 34s | Avg: 21m 34s | Max: 21m 34s | Hits:  98%/164   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 34m | Avg: 1h 34m | Max: 1h 34m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 34m | Avg:  1h 34m | Max:  1h 34m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
121 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

github-actions[bot] avatar Apr 11 '25 21:04 github-actions[bot]

/ok to test 01c2c7e

miscco avatar Apr 16 '25 06:04 miscco

🟨 CI finished in 1h 59m: Pass: 97%/170 | Total: 1d 07h | Avg: 10m 56s | Max: 1h 35m | Hits: 93%/269900
  • 🟨 libcudacxx: Pass: 88%/45 | Total: 8h 56m | Avg: 11m 55s | Max: 35m 42s | Hits: 90%/116192

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  88%/43  | Total:  8h 37m | Avg: 12m 02s | Max: 35m 42s | Hits:  89%/110163
      🟩 arm64              Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 22s | Hits:  97%/6029  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 51m 59s | Avg: 10m 23s | Max: 20m 30s | Hits:  98%/14690 
      🔍 12.8               Pass:  87%/40  | Total:  8h 04m | Avg: 12m 06s | Max: 35m 42s | Hits:  88%/101502
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 44m 35s | Avg: 22m 17s | Max: 23m 24s | Hits:  27%/5989  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 51m 59s | Avg: 10m 23s | Max: 20m 30s | Hits:  98%/14690 
      🔍 nvcc12.8           Pass:  86%/38  | Total:  7h 19m | Avg: 11m 34s | Max: 35m 42s | Hits:  92%/95513 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 44m 35s | Avg: 22m 17s | Max: 23m 24s | Hits:  27%/5989  
      🔍 nvcc               Pass:  88%/43  | Total:  8h 11m | Avg: 11m 26s | Max: 35m 42s | Hits:  93%/110203
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 37m 29s | Avg:  9m 22s | Max:  9m 55s | Hits:  98%/11946 
      🟩 Clang15            Pass: 100%/2   | Total: 19m 17s | Avg:  9m 38s | Max:  9m 52s | Hits:  98%/5985  
      🟩 Clang16            Pass: 100%/2   | Total: 19m 32s | Avg:  9m 46s | Max:  9m 51s | Hits:  97%/5985  
      🟩 Clang17            Pass: 100%/2   | Total: 20m 00s | Avg: 10m 00s | Max: 10m 10s | Hits:  98%/5985  
      🟩 Clang18            Pass: 100%/2   | Total: 20m 44s | Avg: 10m 22s | Max: 10m 57s | Hits:  98%/5985  
      🟨 Clang19            Pass:  83%/6   | Total:  1h 22m | Avg: 13m 44s | Max: 23m 24s | Hits:  70%/14988 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 20s | Hits:  98%/5921  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s | Hits:  98%/2971  
      🟩 GCC9               Pass: 100%/2   | Total: 19m 41s | Avg:  9m 50s | Max:  9m 51s | Hits:  98%/5933  
      🟩 GCC10              Pass: 100%/2   | Total: 19m 33s | Avg:  9m 46s | Max: 10m 26s | Hits:  99%/5991  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  8m 59s | Hits:  98%/5987  
      🟩 GCC12              Pass: 100%/2   | Total: 19m 59s | Avg:  9m 59s | Max: 10m 03s | Hits:  98%/5987  
      🟨 GCC13              Pass:  60%/10  | Total:  1h 48m | Avg: 10m 53s | Max: 19m 51s | Hits:  98%/15210 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 20s | Avg: 21m 10s | Max: 21m 50s | Hits:  97%/5635  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 46m 45s | Avg: 23m 22s | Max: 23m 52s | Hits:  39%/5708  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 49m 27s | Avg: 24m 43s | Max: 35m 42s | Hits:  64%/5975  
    🟨 cxx_family
      🟨 Clang              Pass:  94%/18  | Total:  3h 19m | Avg: 11m 04s | Max: 23m 24s | Hits:  90%/50874 
      🟨 GCC                Pass:  80%/21  | Total:  3h 18m | Avg:  9m 26s | Max: 19m 51s | Hits:  98%/48000 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 29m | Avg: 22m 16s | Max: 23m 52s | Hits:  68%/11343 
      🟩 NVHPC              Pass: 100%/2   | Total: 49m 27s | Avg: 24m 43s | Max: 35m 42s | Hits:  64%/5975  
    🟨 jobs
      🟩 Build              Pass: 100%/39  | Total:  7h 40m | Avg: 11m 48s | Max: 35m 42s | Hits:  90%/116192
      🟥 NVRTC              Pass:   0%/2   | Total: 39m 04s | Avg: 19m 32s | Max: 19m 51s
      🟥 Test               Pass:   0%/3   | Total: 34m 45s | Avg: 11m 35s | Max: 13m 01s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 11s | Avg:  2m 11s | Max:  2m 11s
    🟨 sm
      🟥 75                 Pass:   0%/2   | Total: 39m 04s | Avg: 19m 32s | Max: 19m 51s
      🟨 90                 Pass:  50%/2   | Total: 18m 23s | Avg:  9m 11s | Max: 12m 25s | Hits:  99%/3104  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 29s | Avg:  8m 29s | Max:  8m 29s | Hits:  99%/3104  
    🟨 gpu
      🟨 h100               Pass:  50%/2   | Total: 18m 23s | Avg:  9m 11s | Max: 12m 25s | Hits:  99%/3104  
      🟨 rtx2080            Pass:  90%/43  | Total:  8h 38m | Avg: 12m 02s | Max: 35m 42s | Hits:  89%/113088
    🟨 std
      🟨 17                 Pass:  95%/22  | Total:  4h 41m | Avg: 12m 48s | Max: 35m 42s | Hits:  88%/61898 
      🟨 20                 Pass:  81%/22  | Total:  4h 12m | Avg: 11m 28s | Max: 23m 52s | Hits:  91%/54294 
    
  • 🟩 cub: Pass: 100%/47 | Total: 9h 36m | Avg: 12m 15s | Max: 37m 44s | Hits: 96%/56545

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  9h 23m | Avg: 12m 31s | Max: 37m 44s | Hits:  96%/54087 
      🟩 arm64              Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  6m 21s | Hits:  99%/2458  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 42m 59s | Avg:  8m 35s | Max: 18m 17s | Hits:  99%/5974  
      🟩 12.8               Pass: 100%/42  | Total:  8h 53m | Avg: 12m 41s | Max: 37m 44s | Hits:  96%/50571 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 12s | Avg:  5m 06s | Max:  5m 06s | Hits: 100%/2120  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 42m 59s | Avg:  8m 35s | Max: 18m 17s | Hits:  99%/5974  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  8h 42m | Avg: 13m 04s | Max: 37m 44s | Hits:  95%/48451 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 12s | Avg:  5m 06s | Max:  5m 06s | Hits: 100%/2120  
      🟩 nvcc               Pass: 100%/45  | Total:  9h 25m | Avg: 12m 34s | Max: 37m 44s | Hits:  96%/54425 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 53s | Avg:  6m 13s | Max:  6m 21s | Hits: 100%/4924  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  6m 50s | Hits: 100%/2458  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 08s | Avg:  6m 34s | Max:  6m 42s | Hits: 100%/2458  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 20s | Avg:  7m 10s | Max:  7m 33s | Hits: 100%/2458  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 44s | Avg:  6m 52s | Max:  7m 00s | Hits: 100%/2458  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 19m | Avg: 11m 18s | Max: 29m 06s | Hits: 100%/8265  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 55s | Avg:  6m 27s | Max:  6m 54s | Hits:  99%/2462  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 35s | Avg:  6m 35s | Max:  6m 35s | Hits:  99%/1231  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 57s | Avg:  6m 28s | Max:  6m 39s | Hits:  99%/2462  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 28s | Avg:  7m 14s | Max:  7m 30s | Hits:  99%/2462  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 01s | Avg:  7m 00s | Max:  7m 14s | Hits:  99%/2458  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max:  7m 10s | Hits:  99%/2458  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 36m | Avg: 19m 39s | Max: 37m 44s | Hits:  99%/13519 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 37m 54s | Avg: 18m 57s | Max: 19m 37s | Hits:  99%/2100  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 06s | Max: 35m 18s | Hits:  12%/2100  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 24m 15s | Avg: 12m 07s | Max: 12m 28s | Hits:  98%/2272  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 38m | Avg:  8m 20s | Max: 29m 06s | Hits: 100%/23021 
      🟩 GCC                Pass: 100%/22  | Total:  4h 51m | Avg: 13m 14s | Max: 37m 44s | Hits:  99%/27052 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 42m | Avg: 25m 31s | Max: 35m 18s | Hits:  55%/4200  
      🟩 NVHPC              Pass: 100%/2   | Total: 24m 15s | Avg: 12m 07s | Max: 12m 28s | Hits:  98%/2272  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 55m 13s | Avg: 18m 24s | Max: 27m 19s | Hits:  99%/3687  
      🟩 rtx2080            Pass: 100%/36  | Total:  5h 55m | Avg:  9m 52s | Max: 37m 44s | Hits:  95%/43026 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 45m | Avg: 20m 39s | Max: 29m 06s | Hits:  99%/9832  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 13m | Avg:  9m 35s | Max: 37m 44s | Hits:  95%/46713 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 26m 33s | Avg: 26m 33s | Max: 26m 33s | Hits:  99%/1229  
      🟩 GraphCapture       Pass: 100%/1   | Total: 23m 26s | Avg: 23m 26s | Max: 23m 26s | Hits:  99%/1229  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 24m | Avg: 28m 14s | Max: 29m 06s | Hits:  99%/3687  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 27s | Max: 23m 04s | Hits:  99%/3687  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 55m 13s | Avg: 18m 24s | Max: 27m 19s | Hits:  99%/3687  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 32s | Avg:  8m 32s | Max:  8m 32s | Hits:  99%/1229  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 41m | Avg: 10m 34s | Max: 37m 44s | Hits:  95%/25026 
      🟩 20                 Pass: 100%/26  | Total:  5h 54m | Avg: 13m 37s | Max: 35m 18s | Hits:  96%/31519 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 7h 56m | Avg: 10m 08s | Max: 46m 40s | Hits: 96%/83463

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 49s | Avg:  8m 54s | Max: 11m 44s | Hits:  99%/3554  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  7h 47m | Avg: 10m 22s | Max: 46m 40s | Hits:  96%/79910 
      🟩 arm64              Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  5m 13s | Hits:  99%/3553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 09s | Avg:  8m 13s | Max: 19m 57s | Hits:  99%/8876  
      🟩 12.8               Pass: 100%/42  | Total:  7h 15m | Avg: 10m 22s | Max: 46m 40s | Hits:  96%/74587 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 38s | Hits: 100%/3552  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 09s | Avg:  8m 13s | Max: 19m 57s | Hits:  99%/8876  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  7h 04m | Avg: 10m 36s | Max: 46m 40s | Hits:  95%/71035 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 38s | Hits: 100%/3552  
      🟩 nvcc               Pass: 100%/45  | Total:  7h 45m | Avg: 10m 21s | Max: 46m 40s | Hits:  96%/79911 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 30s | Avg:  5m 22s | Max:  5m 52s | Hits: 100%/7104  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  6m 13s | Hits: 100%/3552  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 24s | Avg:  6m 12s | Max:  6m 24s | Hits: 100%/3552  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 57s | Hits: 100%/3552  
      🟩 Clang18            Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  5m 26s | Hits: 100%/3552  
      🟩 Clang19            Pass: 100%/7   | Total: 46m 14s | Avg:  6m 36s | Max:  9m 42s | Hits: 100%/12432 
      🟩 GCC7               Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 50s | Hits:  99%/3554  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s | Hits:  99%/1777  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 41s | Avg:  6m 20s | Max:  7m 09s | Hits:  99%/3554  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 49s | Hits:  99%/3554  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 07s | Hits:  99%/3554  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 59s | Avg:  6m 29s | Max:  6m 35s | Hits:  99%/3554  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 17m | Avg:  7m 46s | Max: 11m 59s | Hits:  99%/17770 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 40m 05s | Avg: 20m 02s | Max: 20m 08s | Hits:  99%/3540  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 58m | Avg: 39m 35s | Max: 46m 40s | Hits:  46%/5310  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 16s | Max: 30m 56s | Hits:  99%/3552  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 54m | Avg:  6m 00s | Max:  9m 42s | Hits: 100%/33744 
      🟩 GCC                Pass: 100%/21  | Total:  2h 23m | Avg:  6m 50s | Max: 11m 59s | Hits:  99%/37317 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 38m | Avg: 31m 46s | Max: 46m 40s | Hits:  67%/8850  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 16s | Max: 30m 56s | Hits:  99%/3552  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 11m 59s | Hits:  99%/3554  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 19m | Avg:  9m 07s | Max: 45m 12s | Hits:  97%/62156 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 20m | Avg: 14m 04s | Max: 46m 40s | Hits:  91%/17753 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  6h 29m | Avg:  9m 43s | Max: 46m 40s | Hits:  95%/71033 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 18s | Avg: 14m 26s | Max: 26m 53s | Hits:  99%/5323  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 26s | Avg: 11m 06s | Max: 11m 59s | Hits:  99%/7107  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 11m 59s | Hits:  99%/3554  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 09s | Avg:  6m 09s | Max:  6m 09s | Hits:  99%/1777  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 35m | Avg: 10m 16s | Max: 45m 12s | Hits:  96%/37287 
      🟩 20                 Pass: 100%/24  | Total:  4h 03m | Avg: 10m 08s | Max: 46m 40s | Hits:  96%/42622 
    
  • 🟩 cudax: Pass: 100%/24 | Total: 2h 13m | Avg: 5m 33s | Max: 14m 14s | Hits: 98%/13372

    🟩 cpu
      🟩 amd64              Pass: 100%/20  | Total:  2h 01m | Avg:  6m 04s | Max: 14m 14s | Hits:  98%/11044 
      🟩 arm64              Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 00s | Hits:  99%/2328  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  95%/284   
      🟩 12.8               Pass: 100%/23  | Total:  2h 03m | Avg:  5m 22s | Max: 14m 14s | Hits:  98%/13088 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  95%/284   
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 03m | Avg:  5m 22s | Max: 14m 14s | Hits:  98%/13088 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/24  | Total:  2h 13m | Avg:  5m 33s | Max: 14m 14s | Hits:  98%/13372 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s | Hits: 100%/584   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s | Hits: 100%/582   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s | Hits: 100%/582   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s | Hits: 100%/582   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s | Hits: 100%/582   
      🟩 Clang19            Pass: 100%/4   | Total: 21m 29s | Avg:  5m 22s | Max: 11m 59s | Hits: 100%/2328  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s | Hits:  99%/584   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s | Hits:  99%/582   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s | Hits:  99%/582   
      🟩 GCC13              Pass: 100%/8   | Total: 46m 34s | Avg:  5m 49s | Max: 14m 14s | Hits:  99%/4656  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 35s | Avg:  9m 35s | Max:  9m 35s | Hits:  95%/284   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 10m 24s | Avg: 10m 24s | Max: 10m 24s | Hits:  43%/284   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 15m 46s | Avg:  7m 53s | Max:  8m 13s | Hits:  97%/1160  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/9   | Total: 39m 46s | Avg:  4m 25s | Max: 11m 59s | Hits: 100%/5240  
      🟩 GCC                Pass: 100%/11  | Total: 57m 50s | Avg:  5m 15s | Max: 14m 14s | Hits:  99%/6404  
      🟩 MSVC               Pass: 100%/2   | Total: 19m 59s | Avg:  9m 59s | Max: 10m 24s | Hits:  69%/568   
      🟩 NVHPC              Pass: 100%/2   | Total: 15m 46s | Avg:  7m 53s | Max:  8m 13s | Hits:  97%/1160  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 23s | Avg:  8m 41s | Max: 14m 14s | Hits:  99%/1164  
      🟩 rtx2080            Pass: 100%/22  | Total:  1h 55m | Avg:  5m 16s | Max: 12m 41s | Hits:  98%/12208 
    🟩 jobs
      🟩 Build              Pass: 100%/21  | Total:  1h 34m | Avg:  4m 29s | Max: 10m 24s | Hits:  98%/11626 
      🟩 Test               Pass: 100%/3   | Total: 38m 54s | Avg: 12m 58s | Max: 14m 14s | Hits:  99%/1746  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 20m 38s | Avg:  6m 52s | Max: 14m 14s | Hits:  99%/1746  
      🟩 90a                Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s | Hits:  99%/582   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 20s | Avg:  4m 20s | Max:  8m 13s | Hits:  99%/2326  
      🟩 20                 Pass: 100%/20  | Total:  1h 56m | Avg:  5m 48s | Max: 14m 14s | Hits:  98%/11046 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 59s | Avg: 4m 29s | Max: 5m 08s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 08s
      🟩 arm64              Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 06s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 59s | Avg:  4m 29s | Max:  5m 08s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  5m 08s
      🟩 20                 Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 54s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 21m 53s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 25s | Avg:  2m 25s | Max:  2m 25s | Hits:  98%/164   
      🟩 Test               Pass: 100%/1   | Total: 21m 53s | Avg: 21m 53s | Max: 21m 53s | Hits:  98%/164   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 35m | Avg: 1h 35m | Max: 1h 35m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 35m | Avg:  1h 35m | Max:  1h 35m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
121 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

github-actions[bot] avatar Apr 16 '25 08:04 github-actions[bot]

@davebayer FYI these two papers are related to the topic P3161 and "Unified integer overflow arithmetic"

fbusato avatar Apr 19 '25 02:04 fbusato

/ok to test 09a6087

miscco avatar Jul 16 '25 10:07 miscco

🟨 CI finished in 4h 40m: Pass: 77%/205 | Total: 4d 09h | Avg: 30m 54s | Max: 1h 47m | Hits: 70%/174012
  • 🟨 libcudacxx: Pass: 2%/48 | Total: 13h 09m | Avg: 16m 26s | Max: 56m 37s

    🟨 jobs
      🟥 Build              Pass:   0%/42  | Total: 12h 19m | Avg: 17m 37s | Max: 56m 37s
      🟥 NVRTC              Pass:   0%/2   | Total: 47m 31s | Avg: 23m 45s | Max: 24m 13s
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟨 cpu
      🟨 amd64              Pass:   2%/46  | Total: 12h 59m | Avg: 16m 56s | Max: 56m 37s
      🟥 arm64              Pass:   0%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 05s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  1h 05m | Avg: 13m 04s | Max: 47m 13s
      🟨 12.9               Pass:   2%/43  | Total: 12h 04m | Avg: 16m 50s | Max: 56m 37s
    🟨 cudacxx
      🟥 ClangCUDA19        Pass:   0%/2   | Total: 52m 46s | Avg: 26m 23s | Max: 27m 18s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  1h 05m | Avg: 13m 04s | Max: 47m 13s
      🟨 nvcc12.9           Pass:   2%/41  | Total: 11h 11m | Avg: 16m 22s | Max: 56m 37s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 52m 46s | Avg: 26m 23s | Max: 27m 18s
      🟨 nvcc               Pass:   2%/46  | Total: 12h 16m | Avg: 16m 00s | Max: 56m 37s
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 32m 18s | Avg:  8m 04s | Max: 18m 14s
      🟥 Clang15            Pass:   0%/2   | Total: 14m 13s | Avg:  7m 06s | Max:  9m 12s
      🟥 Clang16            Pass:   0%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  5m 33s
      🟥 Clang17            Pass:   0%/2   | Total: 31m 46s | Avg: 15m 53s | Max: 26m 47s
      🟥 Clang18            Pass:   0%/2   | Total: 21m 05s | Avg: 10m 32s | Max: 15m 54s
      🟥 Clang19            Pass:   0%/6   | Total:  1h 07m | Avg: 11m 18s | Max: 27m 18s
      🟥 GCC7               Pass:   0%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 05s
      🟥 GCC8               Pass:   0%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s
      🟥 GCC9               Pass:   0%/2   | Total: 27m 33s | Avg: 13m 46s | Max: 23m 00s
      🟥 GCC10              Pass:   0%/2   | Total: 30m 25s | Avg: 15m 12s | Max: 25m 43s
      🟥 GCC11              Pass:   0%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 04s
      🟥 GCC12              Pass:   0%/2   | Total: 29m 17s | Avg: 14m 38s | Max: 23m 42s
      🟨 GCC13              Pass:   9%/11  | Total:  1h 20m | Avg:  7m 21s | Max: 24m 13s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 43m | Avg: 51m 55s | Max: 56m 37s
      🟥 MSVC14.43          Pass:   0%/4   | Total:  3h 33m | Avg: 53m 21s | Max: 55m 37s
      🟥 NVHPC25.5          Pass:   0%/2   | Total:  1h 41m | Avg: 50m 34s | Max: 50m 45s
    🟨 cxx_family
      🟥 Clang              Pass:   0%/18  | Total:  2h 58m | Avg:  9m 53s | Max: 27m 18s
      🟨 GCC                Pass:   4%/22  | Total:  3h 12m | Avg:  8m 46s | Max: 25m 43s
      🟥 MSVC               Pass:   0%/6   | Total:  5h 17m | Avg: 52m 52s | Max: 56m 37s
      🟥 NVHPC              Pass:   0%/2   | Total:  1h 41m | Avg: 50m 34s | Max: 50m 45s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  5m 08s | Avg:  2m 34s | Max:  5m 08s
      🟨 rtx2080            Pass:   2%/46  | Total: 13h 04m | Avg: 17m 02s | Max: 56m 37s
    🟥 sm
      🟥 75                 Pass:   0%/2   | Total: 47m 31s | Avg: 23m 45s | Max: 24m 13s
      🟥 90                 Pass:   0%/2   | Total:  5m 08s | Avg:  2m 34s | Max:  5m 08s
      🟥 90;90a             Pass:   0%/2   | Total: 58m 10s | Avg: 29m 05s | Max: 52m 50s
      🟥 100;120            Pass:   0%/2   | Total: 58m 16s | Avg: 29m 08s | Max: 52m 44s
    🟥 std
      🟥 17                 Pass:   0%/22  | Total:  6h 26m | Avg: 17m 34s | Max: 56m 37s
      🟥 20                 Pass:   0%/25  | Total:  6h 41m | Avg: 16m 02s | Max: 55m 37s
    
  • 🟩 cub: Pass: 100%/50 | Total: 2d 01h | Avg: 59m 15s | Max: 1h 47m | Hits: 63%/61706

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 23h | Avg: 59m 16s | Max:  1h 47m | Hits:  63%/59190 
      🟩 arm64              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max:  1h 04m | Hits:  66%/2516  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 29m | Avg:  1h 05m | Max:  1h 36m | Hits:  55%/6186  
      🟩 12.9               Pass: 100%/45  | Total:  1d 19h | Avg: 58m 30s | Max:  1h 47m | Hits:  64%/55520 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 33m 09s | Hits:  72%/2165  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 29m | Avg:  1h 05m | Max:  1h 36m | Hits:  55%/6186  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 18h | Avg: 59m 44s | Max:  1h 47m | Hits:  64%/53355 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 10s | Max: 33m 09s | Hits:  72%/2165  
      🟩 nvcc               Pass: 100%/48  | Total:  2d 00h | Avg:  1h 00m | Max:  1h 47m | Hits:  63%/59541 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 53m | Avg: 58m 24s | Max:  1h 02m | Hits:  66%/5034  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 50s | Max:  1h 02m | Hits:  66%/2513  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 44s | Max: 57m 37s | Hits:  66%/2513  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m | Hits:  66%/2513  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 51m | Avg: 55m 35s | Max: 56m 40s | Hits:  66%/2513  
      🟩 Clang19            Pass: 100%/7   | Total:  5h 02m | Avg: 43m 11s | Max:  1h 04m | Hits:  77%/8449  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  65%/2516  
      🟩 GCC8               Pass: 100%/1   | Total: 58m 53s | Avg: 58m 53s | Max: 58m 53s | Hits:  65%/1258  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 08m | Hits:  65%/2516  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m | Hits:  65%/2517  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:  65%/2513  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 14m | Hits:  65%/2513  
      🟩 GCC13              Pass: 100%/12  | Total:  8h 51m | Avg: 44m 17s | Max:  1h 04m | Hits:  82%/15105 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  3h 10m | Avg:  1h 35m | Max:  1h 36m | Hits:  11%/2306  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  5h 54m | Avg:  1h 28m | Max:  1h 47m | Hits:  11%/4612  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 45m | Avg:  1h 22m | Max:  1h 23m | Hits:   9%/2315  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 16h 43m | Avg: 52m 49s | Max:  1h 04m | Hits:  70%/23535 
      🟩 GCC                Pass: 100%/23  | Total: 20h 48m | Avg: 54m 18s | Max:  1h 14m | Hits:  74%/28938 
      🟩 MSVC               Pass: 100%/6   | Total:  9h 04m | Avg:  1h 30m | Max:  1h 47m | Hits:  11%/6918  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 45m | Avg:  1h 22m | Max:  1h 23m | Hits:   9%/2315  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 32m 27s | Hits:  88%/3777  
      🟩 rtx2080            Pass: 100%/39  | Total:  1d 18h | Avg:  1h 04m | Max:  1h 47m | Hits:  55%/47863 
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 50m | Avg: 43m 45s | Max:  1h 04m | Hits:  91%/10066 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  1d 20h | Avg:  1h 03m | Max:  1h 47m | Hits:  56%/51638 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 40m 48s | Avg: 40m 48s | Max: 40m 48s | Hits:  99%/1259  
      🟩 GraphCapture       Pass: 100%/1   | Total: 32m 18s | Avg: 32m 18s | Max: 32m 18s | Hits:  99%/1259  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 44m | Avg: 34m 58s | Max: 40m 51s | Hits:  99%/3775  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 45m | Avg: 35m 07s | Max: 42m 07s | Hits:  99%/3775  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 32m 27s | Hits:  88%/3777  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 59m | Avg: 59m 55s | Max:  1h 14m | Hits:  39%/2412  
      🟩 100;120            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 49s | Max:  1h 13m | Hits:  39%/2412  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 23h 09m | Avg:  1h 06m | Max:  1h 39m | Hits:  56%/25810 
      🟩 20                 Pass: 100%/29  | Total:  1d 02h | Avg: 54m 14s | Max:  1h 47m | Hits:  68%/35896 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 1d 10h | Avg: 40m 58s | Max: 1h 44m | Hits: 72%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 50m 37s | Avg: 25m 18s | Max: 37m 28s | Hits:  89%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 09h | Avg: 41m 15s | Max:  1h 44m | Hits:  72%/91794 
      🟩 arm64              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 23s | Max: 36m 57s | Hits:  78%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 33m | Avg: 42m 44s | Max:  1h 15m | Hits:  64%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  1d 06h | Avg: 40m 47s | Max:  1h 44m | Hits:  73%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 57m 55s | Avg: 28m 57s | Max: 30m 11s | Hits:  78%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 33m | Avg: 42m 44s | Max:  1h 15m | Hits:  64%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 05h | Avg: 41m 20s | Max:  1h 44m | Hits:  73%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 55s | Avg: 28m 57s | Max: 30m 11s | Hits:  78%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  1d 09h | Avg: 41m 29s | Max:  1h 44m | Hits:  72%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 34s | Max: 38m 06s | Hits:  78%/7652  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 20s | Max: 36m 08s | Hits:  78%/3826  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 19m | Avg: 39m 36s | Max: 42m 44s | Hits:  78%/3826  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 14m | Avg: 37m 21s | Max: 38m 25s | Hits:  78%/3826  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 27s | Max: 38m 01s | Hits:  78%/3826  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 04m | Avg: 26m 17s | Max: 38m 25s | Hits:  84%/13391 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 18m | Avg: 39m 12s | Max: 40m 24s | Hits:  78%/3828  
      🟩 GCC8               Pass: 100%/1   | Total: 40m 18s | Avg: 40m 18s | Max: 40m 18s | Hits:  78%/1914  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 21m | Avg: 40m 53s | Max: 44m 56s | Hits:  78%/3828  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 17m | Avg: 38m 39s | Max: 39m 02s | Hits:  78%/3828  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 22s | Max: 43m 27s | Hits:  78%/3828  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 25m | Avg: 42m 42s | Max: 45m 12s | Hits:  78%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  4h 55m | Avg: 26m 53s | Max: 42m 21s | Hits:  86%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m | Hits:  24%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  5h 30m | Avg:  1h 06m | Max:  1h 25m | Hits:  43%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  3h 26m | Avg:  1h 43m | Max:  1h 44m | Hits:   6%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 10h 19m | Avg: 32m 37s | Max: 42m 44s | Hits:  80%/36347 
      🟩 GCC                Pass: 100%/22  | Total: 12h 19m | Avg: 33m 37s | Max: 45m 12s | Hits:  82%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  8h 02m | Avg:  1h 08m | Max:  1h 25m | Hits:  37%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total:  3h 26m | Avg:  1h 43m | Max:  1h 44m | Hits:   6%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 37m 39s | Avg: 18m 49s | Max: 22m 32s | Hits:  89%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  1d 04h | Avg: 45m 35s | Max:  1h 44m | Hits:  67%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 38m | Avg: 27m 53s | Max:  1h 14m | Hits:  88%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  1d 08h | Avg: 45m 17s | Max:  1h 44m | Hits:  68%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 11s | Avg: 16m 43s | Max: 32m 57s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 19s | Avg: 12m 49s | Max: 15m 07s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 37m 39s | Avg: 18m 49s | Max: 22m 32s | Hits:  89%/3828  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 41m | Avg: 50m 35s | Max:  1h 08m | Hits:  42%/3820  
      🟩 100;120            Pass: 100%/2   | Total:  1h 42m | Avg: 51m 00s | Max:  1h 09m | Hits:  63%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 16h 23m | Avg: 46m 51s | Max:  1h 42m | Hits:  66%/40160 
      🟩 20                 Pass: 100%/27  | Total: 16h 54m | Avg: 37m 34s | Max:  1h 44m | Hits:  75%/51633 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 4h 34m | Avg: 9m 47s | Max: 36m 07s | Hits: 81%/16190

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  4h 10m | Avg: 10m 26s | Max: 36m 07s | Hits:  79%/13706 
      🟩 arm64              Pass: 100%/4   | Total: 23m 33s | Avg:  5m 53s | Max:  6m 26s | Hits:  89%/2484  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 27m 14s | Avg:  9m 04s | Max: 16m 28s | Hits:  79%/1562  
      🟩 12.9               Pass: 100%/25  | Total:  4h 06m | Avg:  9m 52s | Max: 36m 07s | Hits:  81%/14628 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 27m 14s | Avg:  9m 04s | Max: 16m 28s | Hits:  79%/1562  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  4h 06m | Avg:  9m 52s | Max: 36m 07s | Hits:  81%/14628 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  4h 34m | Avg:  9m 47s | Max: 36m 07s | Hits:  81%/16190 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 57s | Hits:  90%/1244  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s | Hits:  89%/621   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 53s | Avg:  5m 53s | Max:  5m 53s | Hits:  89%/621   
      🟩 Clang17            Pass: 100%/1   | Total:  6m 10s | Avg:  6m 10s | Max:  6m 10s | Hits:  89%/621   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s | Hits:  89%/621   
      🟩 Clang19            Pass: 100%/4   | Total: 24m 59s | Avg:  6m 14s | Max:  7m 55s | Hits:  92%/2484  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 16s | Avg:  6m 08s | Max:  6m 30s | Hits:  89%/1244  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 57s | Avg:  6m 57s | Max:  6m 57s | Hits:  89%/621   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 26s | Avg:  6m 26s | Max:  6m 26s | Hits:  89%/621   
      🟩 GCC13              Pass: 100%/8   | Total: 55m 11s | Avg:  6m 53s | Max: 11m 02s | Hits:  92%/4968  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s | Hits:  37%/320   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 46m 32s | Avg: 15m 30s | Max: 17m 29s | Hits:  37%/966   
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 30s | Max: 36m 07s | Hits:  18%/1238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 59m 22s | Avg:  5m 56s | Max:  7m 55s | Hits:  90%/6212  
      🟩 GCC                Pass: 100%/12  | Total:  1h 20m | Avg:  6m 44s | Max: 11m 02s | Hits:  91%/7454  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 17m 29s | Hits:  37%/1286  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 30s | Max: 36m 07s | Hits:  18%/1238  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 12m 50s | Avg:  6m 25s | Max:  7m 18s | Hits:  94%/1242  
      🟩 rtx2080            Pass: 100%/26  | Total:  4h 21m | Avg: 10m 03s | Max: 36m 07s | Hits:  80%/14948 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  4h 07m | Avg:  9m 55s | Max: 36m 07s | Hits:  78%/14327 
      🟩 Test               Pass: 100%/3   | Total: 26m 15s | Avg:  8m 45s | Max: 11m 02s | Hits:  99%/1863  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 12m 50s | Avg:  6m 25s | Max:  7m 18s | Hits:  94%/1242  
      🟩 90;90a             Pass: 100%/2   | Total: 20m 36s | Avg: 10m 18s | Max: 14m 44s | Hits:  71%/943   
      🟩 100;120            Pass: 100%/2   | Total: 20m 16s | Avg: 10m 08s | Max: 14m 19s | Hits:  72%/943   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 46m 58s | Avg: 15m 39s | Max: 34m 53s | Hits:  65%/1861  
      🟩 20                 Pass: 100%/25  | Total:  3h 47m | Avg:  9m 05s | Max: 36m 07s | Hits:  83%/14329 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 20m | Avg: 11m 09s | Max: 23m 30s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 20m | Avg: 11m 09s | Max: 23m 30s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 24m | Avg: 10m 31s | Max: 20m 46s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 56m | Avg: 11m 40s | Max: 23m 30s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 18m 55s | Avg:  9m 27s | Max:  9m 44s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 13m | Avg: 18m 19s | Max: 23m 30s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 18m 59s | Avg:  4m 44s | Max:  6m 03s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 17m 33s | Avg:  4m 23s | Max:  5m 19s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 12m | Avg: 18m 03s | Max: 20m 46s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 47m | Avg: 11m 55s | Max: 23m 30s
      🟩 3.13               Pass: 100%/9   | Total:  1h 33m | Avg: 10m 24s | Max: 22m 21s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 17m 54s | Avg: 4m 28s | Max: 5m 38s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  5m 38s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 38s
      🟩 12.9               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 11s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 38s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  5m 38s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 38s
      🟩 GCC                Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 29s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  5m 38s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 17m 54s | Avg:  4m 28s | Max:  5m 38s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 15m 53s | Avg: 3m 58s | Max: 4m 08s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 49s
      🟩 arm64              Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 08s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 08s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 08s
      🟩 20                 Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 07s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 27m 54s | Avg: 9m 18s | Max: 14m 04s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 27m 54s | Avg:  9m 18s | Max: 14m 04s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 14m 04s | Avg: 14m 04s | Max: 14m 04s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 50s | Avg:  6m 55s | Max: 11m 20s | Hits:  98%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 30s | Avg:  2m 30s | Max:  2m 30s | Hits:  98%/165   
      🟩 Test               Pass: 100%/2   | Total: 25m 24s | Avg: 12m 42s | Max: 14m 04s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

github-actions[bot] avatar Jul 16 '25 14:07 github-actions[bot]