taming-transformers issues

Results 159 taming-transformers issues

Sort by recently updated

The use of the parameter num_res_blocks is different between the encoder and the decoder

Hi! Thanks for this wonderful codebase. In the file containing the Encoder and Decoder definitions, I found that `num_res_blocks` was used in the Encoder, and `num_res_blocks + 1` in the...

eloialonso

Missing init.py and relative import

This pull request contains the following changes: 1. Imports are relative in the `taming` module. 2. Missing `__init__.py` added. We can install it as a plain python package through `setup.py`...

KernelA

How to use the sliding attention window mechanism？

I have trained a transformer model that can generate _256*256_ images, how can I use the **sliding attention window** mechanism mentioned in the paper to generate high resolution images? It...

RichardXue123

ModuleNotFoundError: No module named 'cutlit'

I try to run the code from latent diffusion model which clones taming-transformer, but I fail as" ModuleNotFoundError: No module named 'cutlit'". cutlit is not a python package, according to...

Polaris0421

Fix variable naming errors in EMAVectorQuantizer

Fix variable naming errors, embedding_dim and n_embed

JJJYmmm

How to distinguish sos token(default = 0) and quantified image token zero ?

Since the transformer take in the quantified image token generated by VQGAN, which codebook has indices (0~n_embed-1), and transformer’s sos token is also set to zero defaultly. Could you tell...

JJJYmmm

: Parameter indices which did not receive grad for rank 0: 338 339 340 341 342 343 344 345 346 347 348 349 350

Can anyone help me with this. I am training on custom dataset and i have changes image dataset class as per my need. The error below is coming when i...

ashwani-ver

generate white images during training

Hi, I found the all of the saved reconstructed images are white images under the directory ./logs/ during training(shown below). Is there anyone can tell me the potential reason? ![image](https://github.com/CompVis/taming-transformers/assets/72506160/f7cab903-842f-4b8d-848f-dcc2fb2e6141)

NikLi66

Confusion about the dimension of latent vectors

Hi. There is one output in the decoder class: `print("Working with z of shape {} = {} dimensions.".format(self.z_shape, np.prod(self.z_shape)))` Here the letter z seems to represent the latent vector by...

LiJiahao-Alex

taming-transformers
taming-transformers copied to clipboard

Metadata

The use of the parameter num_res_blocks is different between the encoder and the decoder

Missing init.py and relative import

How to use the sliding attention window mechanism？

ModuleNotFoundError: No module named 'cutlit'

Fix variable naming errors in EMAVectorQuantizer

How to distinguish sos token(default = 0) and quantified image token zero ?

: Parameter indices which did not receive grad for rank 0: 338 339 340 341 342 343 344 345 346 347 348 349 350

generate white images during training

Confusion about the dimension of latent vectors

← Metadata

Owner

Metadata

taming-transformers taming-transformers copied to clipboard

Metadata

← Metadata

Owner

Metadata

taming-transformers
taming-transformers copied to clipboard