Yan Wang comments

Results 78 comments of


                                            Yan Wang

Add resnet50 benchmark (#443)

Status update: Thanks to @jjsjann123 's patch https://github.com/Lightning-AI/lightning-thunder/pull/706, the failure in https://github.com/Lightning-AI/lightning-thunder/pull/451#issuecomment-2186631228 is gone, but there is an nvfuser failure about " Unsupported loop structure. Two loops are mapped together.bS323{1}...

Add resnet50 benchmark (#443)

Hi all, thanks to @jjsjann123 's fix, we can get the resnet50 working now, please help to review again, thanks

When comparing Thunder Torch Executor to Torch Eager, the ResNet18 gradients are not close for FP32.

run this script ``` import torch import torchvision import os os.environ["NVIDIA_TF32_OVERRIDE"]="0" os.environ["CUBLAS_WORKSPACE_CONFIG"]=":4096:8" torch.manual_seed(42) import random random.seed(42) torch.use_deterministic_algorithms(True) model = torchvision.models.resnet18(weights=None).to(device="cuda", dtype=torch.float32) x = torch.randn((1, 3, 224, 224), dtype=torch.float32, device="cuda", requires_grad=True)...

Yan Wang

Add resnet50 benchmark (#443)

Add resnet50 benchmark (#443)

When comparing Thunder Torch Executor to Torch Eager, the ResNet18 gradients are not close for FP32.

When comparing Thunder Torch Executor to Torch Eager, the ResNet18 gradients are not close for FP32.

OOM for ThunderFX and Thunder with DDP for Mistral-7B-v0.1

Reduces the test time

Adds SymTypes to tree_flatten

Additional ThunderFX benchmark backend options

Additional ThunderFX benchmark backend options

Additional ThunderFX benchmark backend options