rq-vae-transformer issues

Possible to share your training logs?

3

As the training takes super long time, would you mind uploading the training logs corresponding to [the released commands](https://github.com/kakaobrain/rq-vae-transformer#training-of-rq-vaes)?

XiSHEN0220

parentheses typo in download_cc3m.py

There exists a very trivial typo when writing {args.split}_error_list.txt

bzantium

The usage of soft codes

Thanks for your work! Does this function only work during the training of the transformer at stage 2? https://github.com/kakaobrain/rq-vae-transformer/blob/2bf6ece4b85608cfae4c0e2969b17f75495e1639/rqvae/models/rqvae/quantizations.py#L372

edward3862

Minimum GPU memory size for training RQ-Transformer

1

First of all, thank you all the authors for releasing this remarkable researches and models! I tried to finetune this RQ-Transformer model(3.9B) at certain domain. (I'm already aware that it...

Baekpica

add web demo/model to Huggingface

1

Hi, would you be interested in adding rq-vae-transformer to Hugging Face? The Hub offers free hosting, and it would make your work more accessible and visible to the rest of...

AK391

Support notebook execution on smaller devices

I was able to get the full-size parameter set working locally on my personal dev machine (16 samples on a RTX 3090,) but I had to disable mixed precision and...

ttt733

make it safer input argument type check in sampling method

1

In my case, during T2I sampling In the part of declaring the following variable When using as int type like this ``` top_k=1024 top_p=1 ``` I met the following error...

wbaek

Confused by this LogitMask

rqvae/models/rqtransformer/primitives.py class LogitMask(nn.Module): def __init__(self, vocab_size: Iterable[int], value=-1e6): super().__init__() self.vocab_size = vocab_size self.mask_cond = [vocab_size[0]]*len(vocab_size) != vocab_size self.value = value def forward(self, logits: Tensor) -> Tensor: if not self.mask_cond: return...

JJASMINE22

Train rq-vae on FFHQ failed when using the default ffhq256-rqvae-8x8x4.yaml

2

I trt to retrain rq-vae on FFHQ with the default ffhq256-rqvae-8x8x4.yaml. The training loss first decrease and then increase. ![image](https://user-images.githubusercontent.com/32598987/223308206-0dc2cab7-9583-48c2-a229-16041b544f6e.png) and Then I compute rfid using the model with the...

xiapengchng

How to use pretrained model to reconstruct？

I want to get the reconstruct image on my own dataset，but I just find the code to compute rFID. Which code can I use to pretrained reconstruct？

Catrin-baze

rq-vae-transformer
rq-vae-transformer copied to clipboard

Metadata

Possible to share your training logs?

parentheses typo in download_cc3m.py

The usage of soft codes

Minimum GPU memory size for training RQ-Transformer

add web demo/model to Huggingface

Support notebook execution on smaller devices

make it safer input argument type check in sampling method

Confused by this LogitMask

Train rq-vae on FFHQ failed when using the default ffhq256-rqvae-8x8x4.yaml

How to use pretrained model to reconstruct？

← Metadata

Owner

Metadata

rq-vae-transformer rq-vae-transformer copied to clipboard

Metadata

← Metadata

Owner

Metadata

rq-vae-transformer
rq-vae-transformer copied to clipboard