Ng Kam Woh
Results
2
comments of
Ng Kam Woh
my solution is to use the state dict from the encoder/decoder online ``` from dall_e import Encoder, Decoder # This can be changed to a GPU, e.g. 'cuda:0'. dev =...
I think you can try adding some non-linear layers before the hash layer. sometimes it overfits faster so I think it might help to converge.