MagicSource comments

Results 1299 comments of


                                            MagicSource

mask offset

@janikstfub this augmentation is used for resize along shortest with original image, if you skip this step, your images is using raw as input, it should works well except your...

Question: tp able to run a model which not able to fit a single batch on GPU?

I am currently using deepspeed zero3. I have a model which need at least 40GB GPU mem all. But I only got 32GB, using deepseed zero3 it might can reduce...

Question: tp able to run a model which not able to fit a single batch on GPU?

I think it might due the forward, as am able to train all at, Am suing zero3 config like this: ``` { "fp16": { "enabled": "auto", "loss_scale": 0, "loss_scale_window": 1000,...

Question: tp able to run a model which not able to fit a single batch on GPU?

So that, it can be concluded, if one can not use zero3 trainng a model even with bs = 1, then it won't able to do so with FSDP as...

Question: tp able to run a model which not able to fit a single batch on GPU?

Yes, how to enable tensor parallelism, seems I need split the model into 2 GPUs, and calculate for both a single batch data. This looks like didn't have default settings...

Question: tp able to run a model which not able to fit a single batch on GPU?

HI, does there any built-in implementation to scale TP with a single config in transformers? Looks like users need to config every single layer to use TP?

Question: tp able to run a model which not able to fit a single batch on GPU?

Looks like torchtitan able to do tensor parellel by default?

Question: tp able to run a model which not able to fit a single batch on GPU?

Does it support Qwen model? Also, does multimodal model can be suppported such as LLava etc?

When putting a Select inside a dialog or Sheet the hover effect on the select items doesn't work

Where to add it?

When putting a Select inside a dialog or Sheet the hover effect on the select items doesn't work

Why this work?