Kim Seonghyeon comments

Results 454 comments of


                                            Kim Seonghyeon

trafficstars

how to do with video?

Maybe you can do it by conditioning on previous frames. I don't know much about video generations, sorry.

Runtime Error While Training PixelSNAIL

I think you can change shape argument of PixelSNAIL.

vqvae reconstructed images are too blurry

Yes, perceptual loss will be easy to try. But I think you can get quite nice results with MSE loss alone.

Support for torch.cuda.amp in VQ-VAE training

I think it will be safer to use fp32 for entire quantize operations.

Support for torch.cuda.amp in VQ-VAE training

Yes. It may work.

Support for torch.cuda.amp in VQ-VAE training

If it is suffice to reproduct the result of fp32 training, definitely it would be nice to have.

[Question] What is PixelSnail? How to Train it?

Yes, it will generates sample of latent code for VQ-VAE. I checked it can make some samples if you train enough. But you will need to use a quite large...

[Question] What is PixelSnail? How to Train it?

![sample](https://user-images.githubusercontent.com/4343568/60182162-b4d95300-985e-11e9-9292-1c140b7b8f43.png) Not very nice, but it is from somewhat smaller model than the model in the paper.

[Question] What is PixelSnail? How to Train it?

##### Top * channel: 512 * n_block: 4 * n_res_block: 5 * res_channel: 512 * n_cond_res_block: 0 * n_out_res_block: 5 * attention: True * dropout: 0.1 * batch size: 63...

[Question] What is PixelSnail? How to Train it?

@k-eak I have used 4 V100s with mixed precision training.