cccl icon indicating copy to clipboard operation
cccl copied to clipboard

Improve CUDA macros

Open davebayer opened this issue 8 months ago • 1 comments

This PR introduces several macros to improve the code clarity. Plus this PR changes _CCCL_PTX_ARCH macro to a function-like macro.

After this PR we can replace:

#if !_CCCL_COMPILER(NVRTC)
_CCCL_HOST int host_fn(...)
{
  // ...
}
#endif // !_CCCL_COMPILER(NVRTC)

#if _CCCL_HAS_CUDA_COMPILER() && (defined(__CUDA_ARCH__) || defined(_NVHPC_CUDA))
_CCCL_DEVICE int device_fn(...)
{
  // ...
}
#endif // _CCCL_HAS_CUDA_COMPILER() && (defined(__CUDA_ARCH__) || defined(_NVHPC_CUDA))

with:

#if _CCCL_HOST_COMPILATION()
_CCCL_HOST int host_fn(...)
{
  // ...
}
#endif // _CCCL_HOST_COMPILATION()

#if _CCCL_DEVICE_COMPILATION()
_CCCL_DEVICE int device_fn(...)
{
  // ...
}
#endif // _CCCL_DEVICE_COMPILATION()

and many others.

davebayer avatar Apr 26 '25 18:04 davebayer

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

copy-pr-bot[bot] avatar Apr 26 '25 18:04 copy-pr-bot[bot]

I love this PR. It makes the CUDA macros much more expressive and clearer.

fbusato avatar Apr 28 '25 16:04 fbusato

/ok to test cee36bc

miscco avatar Apr 29 '25 06:04 miscco

🟩 CI finished in 1h 45m: Pass: 100%/174 | Total: 1d 16h | Avg: 14m 03s | Max: 1h 30m | Hits: 87%/272197
  • 🟩 cub: Pass: 100%/47 | Total: 14h 53m | Avg: 19m 01s | Max: 1h 26m | Hits: 89%/56811

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 14h 41m | Avg: 19m 35s | Max:  1h 26m | Hits:  89%/54341 
      🟩 arm64              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 16s | Hits:  99%/2470  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 38m | Avg: 19m 41s | Max:  1h 14m | Hits:  85%/6002  
      🟩 12.8               Pass: 100%/42  | Total: 13h 15m | Avg: 18m 56s | Max:  1h 26m | Hits:  90%/50809 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 28s | Avg:  5m 14s | Max:  5m 23s | Hits: 100%/2128  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 38m | Avg: 19m 41s | Max:  1h 14m | Hits:  85%/6002  
      🟩 nvcc12.8           Pass: 100%/40  | Total: 13h 04m | Avg: 19m 37s | Max:  1h 26m | Hits:  90%/48681 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 28s | Avg:  5m 14s | Max:  5m 23s | Hits: 100%/2128  
      🟩 nvcc               Pass: 100%/45  | Total: 14h 43m | Avg: 19m 37s | Max:  1h 26m | Hits:  89%/54683 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 20s | Avg:  5m 50s | Max:  6m 09s | Hits: 100%/4948  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 47s | Hits: 100%/2470  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 31s | Avg:  6m 45s | Max:  6m 47s | Hits: 100%/2470  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 13s | Avg:  6m 36s | Max:  6m 50s | Hits: 100%/2470  
      🟩 Clang18            Pass: 100%/2   | Total: 12m 56s | Avg:  6m 28s | Max:  6m 39s | Hits: 100%/2470  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 17m | Avg: 11m 06s | Max: 26m 36s | Hits: 100%/8303  
      🟩 GCC7               Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 32s | Hits:  99%/2474  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 35s | Avg:  6m 35s | Max:  6m 35s | Hits:  99%/1237  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 04s | Avg:  6m 32s | Max:  6m 37s | Hits:  99%/2474  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 32s | Avg:  6m 46s | Max:  6m 53s | Hits:  99%/2474  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 12s | Avg:  7m 06s | Max:  7m 19s | Hits:  99%/2470  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 13s | Avg:  7m 06s | Max:  7m 21s | Hits:  99%/2470  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 06m | Avg: 16m 54s | Max: 28m 10s | Hits:  99%/13585 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 40m | Avg:  1h 20m | Max:  1h 26m | Hits:  15%/2108  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 43m | Avg:  1h 21m | Max:  1h 24m | Hits:  15%/2108  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2280  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 33m | Avg:  8m 05s | Max: 26m 36s | Hits: 100%/23131 
      🟩 GCC                Pass: 100%/22  | Total:  4h 20m | Avg: 11m 51s | Max: 28m 10s | Hits:  99%/27184 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 24m | Avg:  1h 21m | Max:  1h 26m | Hits:  15%/4216  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 34m | Avg:  1h 17m | Max:  1h 18m | Hits:  10%/2280  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 55m 20s | Avg: 18m 26s | Max: 27m 41s | Hits:  99%/3705  
      🟩 rtx2080            Pass: 100%/36  | Total: 11h 13m | Avg: 18m 42s | Max:  1h 26m | Hits:  86%/43226 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 44m | Avg: 20m 36s | Max: 28m 10s | Hits:  99%/9880  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total: 11h 32m | Avg: 17m 45s | Max:  1h 26m | Hits:  87%/46931 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 10s | Avg: 28m 10s | Max: 28m 10s | Hits:  99%/1235  
      🟩 GraphCapture       Pass: 100%/1   | Total: 22m 11s | Avg: 22m 11s | Max: 22m 11s | Hits:  99%/1235  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 21m | Avg: 27m 16s | Max: 27m 41s | Hits:  99%/3705  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 58s | Max: 24m 12s | Hits:  99%/3705  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 55m 20s | Avg: 18m 26s | Max: 27m 41s | Hits:  99%/3705  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 41s | Avg:  7m 41s | Max:  7m 41s | Hits:  99%/1235  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  7h 11m | Avg: 20m 33s | Max:  1h 26m | Hits:  85%/25142 
      🟩 20                 Pass: 100%/26  | Total:  7h 42m | Avg: 17m 46s | Max:  1h 19m | Hits:  93%/31669 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 11h 19m | Avg: 14m 26s | Max: 1h 30m | Hits: 89%/83557

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 59s | Avg:  8m 59s | Max: 11m 31s | Hits:  99%/3558  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 11h 09m | Avg: 14m 52s | Max:  1h 30m | Hits:  88%/80000 
      🟩 arm64              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  5m 14s | Hits:  99%/3557  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 13m | Avg: 14m 39s | Max: 52m 43s | Hits:  84%/8886  
      🟩 12.8               Pass: 100%/42  | Total: 10h 05m | Avg: 14m 25s | Max:  1h 30m | Hits:  90%/74671 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 24s | Hits: 100%/3556  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 13m | Avg: 14m 39s | Max: 52m 43s | Hits:  84%/8886  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  9h 55m | Avg: 14m 52s | Max:  1h 30m | Hits:  89%/71115 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 24s | Hits: 100%/3556  
      🟩 nvcc               Pass: 100%/45  | Total: 11h 08m | Avg: 14m 51s | Max:  1h 30m | Hits:  88%/80001 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 32s | Avg:  5m 23s | Max:  5m 48s | Hits: 100%/7112  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 35s | Hits: 100%/3556  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 56s | Hits: 100%/3556  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 40s | Hits: 100%/3556  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  5m 56s | Hits: 100%/3556  
      🟩 Clang19            Pass: 100%/7   | Total: 44m 49s | Avg:  6m 24s | Max: 10m 21s | Hits: 100%/12446 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 34s | Hits:  99%/3558  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s | Hits:  99%/1779  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  6m 17s | Hits:  99%/3558  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 47s | Avg:  5m 53s | Max:  6m 12s | Hits:  99%/3558  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 46s | Hits:  99%/3558  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 32s | Hits:  99%/3558  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 18m | Avg:  7m 53s | Max: 11m 36s | Hits:  99%/17790 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 45m | Avg: 52m 41s | Max: 52m 43s | Hits:  34%/3544  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 19m | Avg: 46m 39s | Max: 57m 00s | Hits:  45%/5316  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 59m | Avg:  1h 29m | Max:  1h 30m | Hits:   0%/3556  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 51m | Avg:  5m 51s | Max: 10m 21s | Hits: 100%/33782 
      🟩 GCC                Pass: 100%/21  | Total:  2h 23m | Avg:  6m 49s | Max: 11m 36s | Hits:  99%/37359 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 05m | Avg: 49m 04s | Max: 57m 00s | Hits:  40%/8860  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 59m | Avg:  1h 29m | Max:  1h 30m | Hits:   0%/3556  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 11m 00s | Hits:  99%/3558  
      🟩 rtx2080            Pass: 100%/35  | Total:  8h 29m | Avg: 14m 33s | Max:  1h 30m | Hits:  88%/62226 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 32m | Avg: 15m 17s | Max: 57m 00s | Hits:  91%/17773 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  9h 51m | Avg: 14m 47s | Max:  1h 30m | Hits:  87%/71113 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 12s | Avg: 14m 24s | Max: 27m 13s | Hits:  99%/5329  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 28s | Avg: 11m 07s | Max: 11m 36s | Hits:  99%/7115  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 11m 00s | Hits:  99%/3558  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s | Hits:  99%/1779  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  5h 47m | Avg: 16m 33s | Max:  1h 29m | Hits:  85%/37329 
      🟩 20                 Pass: 100%/24  | Total:  5h 13m | Avg: 13m 03s | Max:  1h 30m | Hits:  92%/42670 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 9h 43m | Avg: 12m 58s | Max: 45m 33s | Hits: 85%/116887

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  9h 18m | Avg: 12m 58s | Max: 45m 33s | Hits:  85%/110824
      🟩 arm64              Pass: 100%/2   | Total: 25m 29s | Avg: 12m 44s | Max: 21m 17s | Hits:  74%/6063  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 50m 32s | Avg: 10m 06s | Max: 34m 20s | Hits:  98%/14773 
      🟩 12.8               Pass: 100%/40  | Total:  8h 53m | Avg: 13m 19s | Max: 45m 33s | Hits:  83%/102114
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 46m 44s | Avg: 23m 22s | Max: 23m 23s | Hits:  27%/6023  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 50m 32s | Avg: 10m 06s | Max: 34m 20s | Hits:  98%/14773 
      🟩 nvcc12.8           Pass: 100%/38  | Total:  8h 06m | Avg: 12m 47s | Max: 45m 33s | Hits:  86%/96091 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 44s | Avg: 23m 22s | Max: 23m 23s | Hits:  27%/6023  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 56m | Avg: 12m 29s | Max: 45m 33s | Hits:  88%/110864
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  4m 50s | Hits:  98%/12014 
      🟩 Clang15            Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  5m 06s | Hits:  98%/6019  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 03s | Avg:  5m 01s | Max:  5m 06s | Hits:  98%/6019  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 18s | Hits:  98%/6019  
      🟩 Clang18            Pass: 100%/2   | Total: 43m 41s | Avg: 21m 50s | Max: 23m 15s | Hits:  51%/6019  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 31m | Avg: 15m 15s | Max: 23m 23s | Hits:  61%/15073 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 59s | Hits:  99%/5955  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 04s | Avg:  4m 04s | Max:  4m 04s | Hits:  98%/2988  
      🟩 GCC9               Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 08s | Hits:  98%/5967  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 21s | Hits:  98%/6025  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 44s | Hits:  98%/6021  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 56s | Hits:  98%/6021  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 48m | Avg: 10m 53s | Max: 21m 17s | Hits:  88%/15335 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 16m | Avg: 38m 19s | Max: 42m 19s | Hits:  98%/5665  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 27s | Max: 45m 33s | Hits:  97%/5738  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 22m | Avg: 41m 28s | Max: 43m 58s | Hits:   2%/6009  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  3h 03m | Avg: 10m 10s | Max: 23m 23s | Hits:  82%/51163 
      🟩 GCC                Pass: 100%/21  | Total:  2h 35m | Avg:  7m 25s | Max: 21m 17s | Hits:  95%/48312 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 41m | Avg: 40m 23s | Max: 45m 33s | Hits:  98%/11403 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 28s | Max: 43m 58s | Hits:   2%/6009  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 29s | Avg:  8m 14s | Max: 12m 24s | Hits:  99%/3121  
      🟩 rtx2080            Pass: 100%/43  | Total:  9h 27m | Avg: 13m 11s | Max: 45m 33s | Hits:  84%/113766
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  8h 25m | Avg: 12m 57s | Max: 45m 33s | Hits:  85%/116847
      🟩 NVRTC              Pass: 100%/2   | Total: 41m 00s | Avg: 20m 30s | Max: 20m 53s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 35m 15s | Avg: 11m 45s | Max: 13m 20s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 41m 00s | Avg: 20m 30s | Max: 20m 53s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 16m 29s | Avg:  8m 14s | Max: 12m 24s | Hits:  99%/3121  
      🟩 90;90a;100         Pass: 100%/1   | Total:  4m 50s | Avg:  4m 50s | Max:  4m 50s | Hits:  98%/3121  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  5h 02m | Avg: 13m 46s | Max: 42m 19s | Hits:  86%/62269 
      🟩 20                 Pass: 100%/22  | Total:  4h 38m | Avg: 12m 39s | Max: 45m 33s | Hits:  83%/54618 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 24m | Avg: 7m 52s | Max: 35m 46s | Hits: 90%/14616

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  3h 13m | Avg:  8m 46s | Max: 35m 46s | Hits:  89%/12276 
      🟩 arm64              Pass: 100%/4   | Total: 11m 49s | Avg:  2m 57s | Max:  3m 09s | Hits:  99%/2340  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 20m 39s | Avg:  6m 53s | Max: 12m 42s | Hits:  85%/1460  
      🟩 12.8               Pass: 100%/23  | Total:  3h 04m | Avg:  8m 00s | Max: 35m 46s | Hits:  91%/13156 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 20m 39s | Avg:  6m 53s | Max: 12m 42s | Hits:  85%/1460  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  3h 04m | Avg:  8m 00s | Max: 35m 46s | Hits:  91%/13156 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 24m | Avg:  7m 52s | Max: 35m 46s | Hits:  90%/14616 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 28s | Avg:  3m 14s | Max:  3m 28s | Hits: 100%/1174  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s | Hits: 100%/585   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s | Hits: 100%/585   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s | Hits: 100%/585   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s | Hits: 100%/585   
      🟩 Clang19            Pass: 100%/4   | Total: 25m 32s | Avg:  6m 23s | Max: 16m 19s | Hits: 100%/2340  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 22s | Avg:  4m 11s | Max:  4m 57s | Hits:  95%/1174  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits:  99%/585   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s | Hits:  99%/585   
      🟩 GCC13              Pass: 100%/8   | Total: 47m 24s | Avg:  5m 55s | Max: 14m 23s | Hits:  99%/4680  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 42s | Avg: 12m 42s | Max: 12m 42s | Hits:  46%/286   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 13m 22s | Avg: 13m 22s | Max: 13m 22s | Hits:  46%/286   
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 35m 46s | Hits:  19%/1166  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 46m 08s | Avg:  4m 36s | Max: 16m 19s | Hits: 100%/5854  
      🟩 GCC                Pass: 100%/12  | Total:  1h 02m | Avg:  5m 14s | Max: 14m 23s | Hits:  98%/7024  
      🟩 MSVC               Pass: 100%/2   | Total: 26m 04s | Avg: 13m 02s | Max: 13m 22s | Hits:  46%/572   
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 35m 46s | Hits:  19%/1166  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 31s | Avg:  8m 45s | Max: 14m 23s | Hits:  99%/1170  
      🟩 rtx2080            Pass: 100%/24  | Total:  3h 07m | Avg:  7m 48s | Max: 35m 46s | Hits:  90%/13446 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 41m | Avg:  7m 00s | Max: 35m 46s | Hits:  89%/12861 
      🟩 Test               Pass: 100%/3   | Total: 43m 51s | Avg: 14m 37s | Max: 16m 19s | Hits:  99%/1755  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 20m 46s | Avg:  6m 55s | Max: 14m 23s | Hits:  99%/1755  
      🟩 90a                Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits:  99%/585   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 43m 05s | Avg: 10m 46s | Max: 34m 06s | Hits:  79%/2338  
      🟩 20                 Pass: 100%/22  | Total:  2h 41m | Avg:  7m 21s | Max: 35m 46s | Hits:  93%/12278 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 19m 24s | Avg: 4m 51s | Max: 6m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  6m 01s
      🟩 arm64              Pass: 100%/2   | Total:  8m 16s | Avg:  4m 08s | Max:  4m 12s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  6m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  6m 01s
      🟩 20                 Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  5m 07s
    
  • 🟩 python: Pass: 100%/3 | Total: 29m 17s | Avg: 9m 45s | Max: 20m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 29m 17s | Avg:  9m 45s | Max: 20m 34s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 20m 34s | Avg: 20m 34s | Max: 20m 34s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  6m 03s | Avg:  6m 03s | Max:  6m 03s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits: 98%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 34m 20s | Hits:  98%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s | Hits:  98%/163   
      🟩 Test               Pass: 100%/1   | Total: 34m 20s | Avg: 34m 20s | Max: 34m 20s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

github-actions[bot] avatar Apr 29 '25 08:04 github-actions[bot]

/ok to test cee36bc

miscco avatar Apr 29 '25 08:04 miscco

/ok to test cee36bc

@miscco, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

copy-pr-bot[bot] avatar Apr 29 '25 08:04 copy-pr-bot[bot]

/ok to test 1e97f9d

miscco avatar Apr 30 '25 06:04 miscco

/ok to test 1e97f9d

@miscco, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

copy-pr-bot[bot] avatar Apr 30 '25 06:04 copy-pr-bot[bot]

/ok to test 929f02e

miscco avatar Apr 30 '25 06:04 miscco

🟩 CI finished in 1h 38m: Pass: 100%/174 | Total: 3d 10h | Avg: 28m 33s | Max: 1h 31m | Hits: 65%/277123
  • 🟩 cub: Pass: 100%/47 | Total: 1d 15h | Avg: 50m 32s | Max: 1h 14m | Hits: 30%/57134

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 13h | Avg: 50m 31s | Max:  1h 14m | Hits:  30%/54650 
      🟩 arm64              Pass: 100%/2   | Total:  1h 42m | Avg: 51m 15s | Max: 52m 29s | Hits:  16%/2484  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 51m | Avg: 58m 21s | Max:  1h 04m | Hits:  15%/6036  
      🟩 12.8               Pass: 100%/42  | Total:  1d 10h | Avg: 49m 37s | Max:  1h 14m | Hits:  31%/51098 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 07m | Avg: 33m 49s | Max: 34m 36s | Hits:  14%/2140  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 51m | Avg: 58m 21s | Max:  1h 04m | Hits:  15%/6036  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 09h | Avg: 50m 24s | Max:  1h 14m | Hits:  32%/48958 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 07m | Avg: 33m 49s | Max: 34m 36s | Hits:  14%/2140  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 14h | Avg: 51m 17s | Max:  1h 14m | Hits:  30%/54994 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 41m | Avg: 55m 25s | Max: 59m 36s | Hits:  16%/4976  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 47m | Avg: 53m 34s | Max: 54m 06s | Hits:  16%/2484  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 44m | Avg: 52m 26s | Max: 52m 35s | Hits:  16%/2484  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 46m | Avg: 53m 08s | Max: 53m 19s | Hits:  16%/2484  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 50m | Avg: 55m 13s | Max: 56m 34s | Hits:  16%/2484  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 42m | Avg: 40m 23s | Max: 58m 39s | Hits:  40%/8350  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 43s | Max: 56m 23s | Hits:  15%/2488  
      🟩 GCC8               Pass: 100%/1   | Total: 51m 32s | Avg: 51m 32s | Max: 51m 32s | Hits:  16%/1244  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 31s | Max:  1h 04m | Hits:  16%/2488  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 28s | Max: 59m 40s | Hits:  16%/2488  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 59s | Max: 59m 53s | Hits:  15%/2484  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 54s | Max:  1h 02m | Hits:  15%/2484  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 37m | Avg: 36m 09s | Max: 57m 56s | Hits:  61%/13662 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m | Hits:  12%/2120  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m | Hits:  12%/2120  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 14m | Hits:  11%/2294  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 15h 33m | Avg: 49m 06s | Max: 59m 36s | Hits:  25%/23262 
      🟩 GCC                Pass: 100%/22  | Total: 17h 10m | Avg: 46m 50s | Max:  1h 04m | Hits:  38%/27338 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 26m | Avg:  1h 06m | Max:  1h 08m | Hits:  12%/4240  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 14m | Hits:  11%/2294  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 18m | Avg: 26m 00s | Max: 27m 35s | Hits:  71%/3726  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 09h | Avg: 56m 35s | Max:  1h 14m | Hits:  15%/43472 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 20m | Avg: 32m 33s | Max: 58m 39s | Hits:  78%/9936  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 12h | Avg: 55m 52s | Max:  1h 14m | Hits:  15%/47198 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 26m 52s | Avg: 26m 52s | Max: 26m 52s | Hits:  99%/1242  
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 58s | Avg: 20m 58s | Max: 20m 58s | Hits:  99%/1242  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 21m | Avg: 27m 01s | Max: 28m 02s | Hits:  99%/3726  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 40s | Max: 23m 25s | Hits:  99%/3726  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 18m | Avg: 26m 00s | Max: 27m 35s | Hits:  71%/3726  
      🟩 90;90a;100         Pass: 100%/1   | Total: 55m 48s | Avg: 55m 48s | Max: 55m 48s | Hits:  15%/1242  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 20h 10m | Avg: 57m 38s | Max:  1h 11m | Hits:  15%/25285 
      🟩 20                 Pass: 100%/26  | Total: 19h 25m | Avg: 44m 48s | Max:  1h 14m | Hits:  41%/31849 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 04h | Avg: 36m 18s | Max: 1h 31m | Hits: 53%/83698

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 44m 53s | Avg: 22m 26s | Max: 33m 19s | Hits:  73%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 03h | Avg: 36m 30s | Max:  1h 31m | Hits:  53%/80135 
      🟩 arm64              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 56s | Max: 32m 55s | Hits:  47%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 22m | Avg: 40m 33s | Max:  1h 03m | Hits:  51%/8901  
      🟩 12.8               Pass: 100%/42  | Total:  1d 01h | Avg: 35m 48s | Max:  1h 31m | Hits:  53%/74797 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 02m | Avg: 31m 01s | Max: 33m 17s | Hits:  47%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 22m | Avg: 40m 33s | Max:  1h 03m | Hits:  51%/8901  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 00h | Avg: 36m 02s | Max:  1h 31m | Hits:  53%/71235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 01s | Max: 33m 17s | Hits:  47%/3562  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 03h | Avg: 36m 32s | Max:  1h 31m | Hits:  53%/80136 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 36s | Max: 34m 35s | Hits:  57%/7124  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 27s | Max: 39m 41s | Hits:  47%/3562  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 38m 34s | Hits:  47%/3562  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 06s | Max: 36m 42s | Hits:  47%/3562  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 12s | Max: 36m 59s | Hits:  47%/3562  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 02m | Avg: 26m 01s | Max: 37m 16s | Hits:  64%/12467 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 06s | Max: 34m 08s | Hits:  55%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 36m 31s | Avg: 36m 31s | Max: 36m 31s | Hits:  47%/1782  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 16m | Avg: 38m 01s | Max: 38m 23s | Hits:  60%/3564  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 04s | Max: 38m 57s | Hits:  47%/3564  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 07s | Max: 39m 02s | Hits:  47%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 04s | Max: 38m 33s | Hits:  47%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 55m | Avg: 23m 31s | Max: 36m 42s | Hits:  74%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:   9%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 38m | Avg: 52m 51s | Max:  1h 06m | Hits:  35%/5325  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 59m | Avg:  1h 29m | Max:  1h 31m | Hits:   3%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 10h 02m | Avg: 31m 42s | Max: 39m 41s | Hits:  55%/33839 
      🟩 GCC                Pass: 100%/21  | Total: 10h 40m | Avg: 30m 30s | Max: 39m 02s | Hits:  62%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 57s | Max:  1h 06m | Hits:  25%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 59m | Avg:  1h 29m | Max:  1h 31m | Hits:   3%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 35m 13s | Avg: 17m 36s | Max: 23m 18s | Hits:  73%/3564  
      🟩 rtx2080            Pass: 100%/35  | Total: 23h 47m | Avg: 40m 47s | Max:  1h 31m | Hits:  45%/62331 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 03m | Avg: 24m 22s | Max:  1h 06m | Hits:  77%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 02h | Avg: 40m 25s | Max:  1h 31m | Hits:  45%/71233 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 42s | Avg: 14m 34s | Max: 27m 35s | Hits:  99%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 43s | Avg: 11m 25s | Max: 11m 55s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 35m 13s | Avg: 17m 36s | Max: 23m 18s | Hits:  73%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 35s | Avg: 32m 35s | Max: 32m 35s | Hits:  74%/1782  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 15h 06m | Avg: 43m 09s | Max:  1h 31m | Hits:  43%/37392 
      🟩 20                 Pass: 100%/24  | Total: 12h 35m | Avg: 31m 29s | Max:  1h 27m | Hits:  60%/42742 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 9h 46m | Avg: 13m 01s | Max: 46m 09s | Hits: 89%/121297

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  9h 38m | Avg: 13m 26s | Max: 46m 09s | Hits:  88%/115006
      🟩 arm64              Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 20s | Hits:  98%/6291  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 53m 05s | Avg: 10m 37s | Max: 36m 38s | Hits:  98%/15333 
      🟩 12.8               Pass: 100%/40  | Total:  8h 53m | Avg: 13m 20s | Max: 46m 09s | Hits:  87%/105964
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 26m 37s | Hits:  27%/6251  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 05s | Avg: 10m 37s | Max: 36m 38s | Hits:  98%/15333 
      🟩 nvcc12.8           Pass: 100%/38  | Total:  8h 03m | Avg: 12m 44s | Max: 46m 09s | Hits:  91%/99713 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 26m 37s | Hits:  27%/6251  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 57m | Avg: 12m 29s | Max: 46m 09s | Hits:  92%/115046
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 18m 17s | Avg:  4m 34s | Max:  4m 55s | Hits:  99%/12462 
      🟩 Clang15            Pass: 100%/2   | Total: 30m 09s | Avg: 15m 04s | Max: 25m 10s | Hits:  74%/6247  
      🟩 Clang16            Pass: 100%/2   | Total: 23m 10s | Avg: 11m 35s | Max: 18m 28s | Hits:  77%/6247  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  4m 52s | Hits:  99%/6247  
      🟩 Clang18            Pass: 100%/2   | Total:  9m 23s | Avg:  4m 41s | Max:  4m 42s | Hits:  99%/6247  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 44m | Avg: 17m 27s | Max: 41m 34s | Hits:  70%/15643 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  3m 55s | Hits:  99%/6179  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s | Hits:  99%/3100  
      🟩 GCC9               Pass: 100%/2   | Total:  8m 35s | Avg:  4m 17s | Max:  4m 34s | Hits:  99%/6191  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  4m 38s | Hits:  98%/6253  
      🟩 GCC11              Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 36s | Hits:  99%/6249  
      🟩 GCC12              Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  4m 33s | Hits:  99%/6249  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 36m | Avg:  9m 39s | Max: 22m 35s | Hits:  98%/15891 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 16m | Avg: 38m 15s | Max: 39m 53s | Hits:  98%/5889  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 28m | Avg: 44m 21s | Max: 46m 09s | Hits:  98%/5966  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 22m | Avg: 41m 07s | Max: 43m 05s | Hits:  23%/6237  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  3h 15m | Avg: 10m 51s | Max: 41m 34s | Hits:  85%/53093 
      🟩 GCC                Pass: 100%/21  | Total:  2h 23m | Avg:  6m 50s | Max: 22m 35s | Hits:  99%/50112 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 45m | Avg: 41m 18s | Max: 46m 09s | Hits:  98%/11855 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 07s | Max: 43m 05s | Hits:  23%/6237  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 14s | Avg:  9m 37s | Max: 14m 35s | Hits:  98%/3228  
      🟩 rtx2080            Pass: 100%/43  | Total:  9h 27m | Avg: 13m 11s | Max: 46m 09s | Hits:  88%/118069
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  7h 51m | Avg: 12m 04s | Max: 46m 09s | Hits:  89%/121257
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 45s | Avg: 22m 22s | Max: 22m 35s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total:  1h 08m | Avg: 22m 47s | Max: 41m 34s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 44m 45s | Avg: 22m 22s | Max: 22m 35s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 19m 14s | Avg:  9m 37s | Max: 14m 35s | Hits:  98%/3228  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 00s | Avg:  5m 00s | Max:  5m 00s | Hits:  98%/3228  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  5h 08m | Avg: 14m 00s | Max: 42m 34s | Hits:  87%/64645 
      🟩 20                 Pass: 100%/22  | Total:  4h 36m | Avg: 12m 32s | Max: 46m 09s | Hits:  90%/56652 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 36m | Avg: 8m 19s | Max: 32m 23s | Hits: 84%/14668

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  3h 16m | Avg:  8m 56s | Max: 32m 23s | Hits:  83%/12320 
      🟩 arm64              Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 13s | Hits:  90%/2348  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 23m 22s | Avg:  7m 47s | Max: 13m 11s | Hits:  82%/1466  
      🟩 12.8               Pass: 100%/23  | Total:  3h 13m | Avg:  8m 23s | Max: 32m 23s | Hits:  84%/13202 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 23m 22s | Avg:  7m 47s | Max: 13m 11s | Hits:  82%/1466  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  3h 13m | Avg:  8m 23s | Max: 32m 23s | Hits:  84%/13202 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 36m | Avg:  8m 19s | Max: 32m 23s | Hits:  84%/14668 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  5m 10s | Hits:  90%/1178  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s | Hits:  90%/587   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s | Hits:  90%/587   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s | Hits:  90%/587   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s | Hits:  90%/587   
      🟩 Clang19            Pass: 100%/4   | Total: 23m 54s | Avg:  5m 58s | Max:  9m 05s | Hits:  92%/2348  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 28s | Hits:  90%/1178  
      🟩 GCC11              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s | Hits:  90%/587   
      🟩 GCC12              Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s | Hits:  90%/587   
      🟩 GCC13              Pass: 100%/8   | Total: 47m 50s | Avg:  5m 58s | Max: 10m 56s | Hits:  92%/4696  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s | Hits:  48%/288   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 13m 04s | Avg: 13m 04s | Max: 13m 04s | Hits:  47%/288   
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 16s | Max: 32m 23s | Hits:  24%/1170  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 56m 08s | Avg:  5m 36s | Max:  9m 05s | Hits:  91%/5874  
      🟩 GCC                Pass: 100%/12  | Total:  1h 09m | Avg:  5m 47s | Max: 10m 56s | Hits:  91%/7048  
      🟩 MSVC               Pass: 100%/2   | Total: 26m 15s | Avg: 13m 07s | Max: 13m 11s | Hits:  48%/576   
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 16s | Max: 32m 23s | Hits:  24%/1170  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  7m 34s | Hits:  94%/1174  
      🟩 rtx2080            Pass: 100%/24  | Total:  3h 24m | Avg:  8m 30s | Max: 32m 23s | Hits:  83%/13494 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  3h 08m | Avg:  8m 12s | Max: 32m 23s | Hits:  82%/12907 
      🟩 Test               Pass: 100%/3   | Total: 27m 35s | Avg:  9m 11s | Max: 10m 56s | Hits:  99%/1761  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 16m 21s | Avg:  5m 27s | Max:  7m 34s | Hits:  93%/1761  
      🟩 90a                Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s | Hits:  90%/587   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 46m 09s | Avg: 11m 32s | Max: 32m 09s | Hits:  73%/2346  
      🟩 20                 Pass: 100%/22  | Total:  2h 50m | Avg:  7m 44s | Max: 32m 23s | Hits:  86%/12322 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 21m 04s | Avg: 5m 16s | Max: 5m 57s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 49s | Avg:  5m 54s | Max:  5m 57s
      🟩 arm64              Pass: 100%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  4m 56s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 21m 04s | Avg:  5m 16s | Max:  5m 57s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 57s
      🟩 20                 Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 52s
    
  • 🟩 python: Pass: 100%/3 | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 30m 58s | Avg: 10m 19s | Max: 16m 17s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  5m 02s | Avg:  5m 02s | Max:  5m 02s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 16m 17s | Avg: 16m 17s | Max: 16m 17s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  9m 39s | Avg:  9m 39s | Max:  9m 39s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits: 93%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 31m 32s | Avg: 15m 46s | Max: 28m 01s | Hits:  93%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s | Hits:  87%/163   
      🟩 Test               Pass: 100%/1   | Total: 28m 01s | Avg: 28m 01s | Max: 28m 01s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

github-actions[bot] avatar Apr 30 '25 08:04 github-actions[bot]