Ng Kam Woh

Results 2 comments of Ng Kam Woh

my solution is to use the state dict from the encoder/decoder online ``` from dall_e import Encoder, Decoder # This can be changed to a GPU, e.g. 'cuda:0'. dev =...

I think you can try adding some non-linear layers before the hash layer. sometimes it overfits faster so I think it might help to converge.