richcmwang comments

Results 8 comments of


                                            richcmwang

Has anyone succeeded in reproducing the results?

The loss goes up with the default parameters/set up very early in the training even before the discriminator kicked in. I am puzzled by this.

Allowing for custom trained VQGAN during DALLE training

For people who successfully train VQGAN, do you experience increasing quantized loss over time?

Inference with DeepSpeed

generating seems to be tricky because it seems the deepspeed or DataParallel etc only work through an `nn.Module` (`forward`). But the following code works for me to balance the gpus...

Inference with DeepSpeed

It does not improve the speed of one batch per GPU, but with 2 (or multiple) GPUs, it does improve the speed. In my test case, the running time ratio...

@afiaka87 Please feel free to incorporate this. I tried [inference](https://github.com/microsoft/DeepSpeed/blob/master/docs/_tutorials/inference-tutorial.md) but either get incorrect key "checkpoint_path" or unknown type "DeepSpeed" error message. Not sure the doc is accurate. ``` "checkpoint.json":...

richcmwang

Has anyone succeeded in reproducing the results?

Allowing for custom trained VQGAN during DALLE training

Inference with DeepSpeed

Inference with DeepSpeed

Inference with DeepSpeed

KL divergent term in DiscreteVAE

KL divergent term in DiscreteVAE

`sparse_attn` & `stable` parameters need documentation