vision icon indicating copy to clipboard operation
vision copied to clipboard

Add gpu decode

Open deekay42 opened this issue 1 year ago • 1 comments

Over 8000 imgs/s on 1 A100 GPU


Platform: Linux-5.12.0-0_fbk7_zion_6511_gd766966f605a-x86_64-with-glibc2.34
Logical CPUs: 192

CUDA device: NVIDIA PG509-210
Total Memory: 84.99 GB

Mean image size: 551x676
[---------------------------------------------------------------- Image Decoding ----------------------------------------------------------------]
                                                                                                        |  1 images  |  100 images  |  1000 images
1 threads: ---------------------------------------------------------------------------------------------------------------------------------------
      CPU (unfused): [torchvision.io.decode_jpeg(img, device='cpu') for img in encoded_images_trunc]    |   3301.9   |   271141.6   |   2541465.3 
      CPU (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cpu')                       |   3239.7   |   288522.8   |   2596394.3 
      CUDA (unfused): [torchvision.io.decode_jpeg(img, device='cuda') for img in encoded_images_trunc]  |    603.7   |    60097.8   |    573783.4 
      CUDA (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cuda')                     |    600.6   |    12972.6   |    127654.8 
12 threads: --------------------------------------------------------------------------------------------------------------------------------------
      CPU (unfused): [torchvision.io.decode_jpeg(img, device='cpu') for img in encoded_images_trunc]    |   3330.5   |   272498.9   |   2552944.3 
      CPU (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cpu')                       |   3339.7   |   257796.7   |   2511005.4 
      CUDA (unfused): [torchvision.io.decode_jpeg(img, device='cuda') for img in encoded_images_trunc]  |    603.8   |    59138.0   |    588341.4 
      CUDA (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cuda')                     |    605.0   |    13163.7   |    127891.4 
24 threads: --------------------------------------------------------------------------------------------------------------------------------------
      CPU (unfused): [torchvision.io.decode_jpeg(img, device='cpu') for img in encoded_images_trunc]    |   3227.5   |   276357.8   |   2518914.3 
      CPU (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cpu')                       |   3277.7   |   257554.9   |   2497894.3 
      CUDA (unfused): [torchvision.io.decode_jpeg(img, device='cuda') for img in encoded_images_trunc]  |    607.9   |    58306.1   |    583932.6 
      CUDA (fused): torchvision.io.decode_jpeg(encoded_images_trunc, device='cuda')                     |    653.2   |    12604.1   |    124130.5 

Times are in microseconds (us).```

deekay42 avatar Jun 17 '24 19:06 deekay42

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/8496

Note: Links to docs will display an error until the docs builds have been completed.

:x: 13 New Failures, 6 Unrelated Failures

As of commit efa746d8b4892925c65d370e89f9357fb267dac9 with merge base 5242d6ac27c5eae7b74b65f28bd1373de955327e (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot[bot] avatar Jun 17 '24 19:06 pytorch-bot[bot]

Hey @NicolasHug!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

github-actions[bot] avatar Aug 07 '24 14:08 github-actions[bot]